Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp4468533imu; Tue, 18 Dec 2018 15:44:59 -0800 (PST) X-Google-Smtp-Source: AFSGD/Vd7WsaA826sCDqnRJrxXq/7CScVHytk9yrSk842uRu2HbHUS3O1u8Q+xaEfVffuirhQOUY X-Received: by 2002:a63:a112:: with SMTP id b18mr17542876pgf.440.1545176698899; Tue, 18 Dec 2018 15:44:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1545176698; cv=none; d=google.com; s=arc-20160816; b=zAilewhzOgsR5LVOAqzsrtweiJRzC/oHmMRg7ucTWtIqe8Ykvvkw3CIczR4gvMvVI0 ixt1vaffynEEMs5nHzjjGZwp2LdnCXnL4folETRTYntdrnYnZ/XQ3kTOOna/TFfb7N0R Z6wTc+O5i43XTYOzWdRS4SoAgFVTEgbQrxFSp6//Zajwf8V6vXzBWmragacuj/A5TDE3 k56DOzsGKg/LEc5T1nQr6CIoj0GCoYlUMxYHG4ehci0Y8AQ/MwD4BmJeSM383QsdDSQa /HDpqtzc+20T8OsM51YZdCXzZDubwUDP7h/LaWKWYB3zOt+wTJIMwj0beJLTg+VW+xFv UTCg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:mail-followup-to :message-id:subject:cc:to:from:date:dkim-signature; bh=95UFh51dOarBCVSprS1HjbeMrf84jNZ9XmlpVgV2LxY=; b=Y9QbbDURwzU3tdM6dPlHky2cPdNLEUO23nOIG018mcL/Tt1HfYLTsd0Zuc5wStoOAc E2PhnuImtZ6que0MieuJWH4tiJaFjC4kOwV7c3Kq+DK2ArCmf5HHl5W7Kxdmm5A0mpDy ty3NUi2KimdN6B9vRSJ46JdF9TwxvqN3BuT81N9Wg65k5+cCz49nnxs7J3AZRlmU8qDs lPC3r5rmUjOGmj/u94rEAAflnez5bIIiTs7U8k77nBVaPYLPJutFzANe1Cb6M9upGP+l dBJMyoJA4+2sAwawxA1jIbYvKp5UlwKMZ/EnM50JaSnaXGm4dBmlDEIKuNpqKrQkmwTz ychg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=tVyYNvYI; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m4si14383464pgj.61.2018.12.18.15.44.42; Tue, 18 Dec 2018 15:44:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=tVyYNvYI; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727663AbeLRWxH (ORCPT + 99 others); Tue, 18 Dec 2018 17:53:07 -0500 Received: from mail-pl1-f176.google.com ([209.85.214.176]:40080 "EHLO mail-pl1-f176.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726716AbeLRWxH (ORCPT ); Tue, 18 Dec 2018 17:53:07 -0500 Received: by mail-pl1-f176.google.com with SMTP id u18so8494193plq.7 for ; Tue, 18 Dec 2018 14:53:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:cc:subject:message-id:mail-followup-to:references :mime-version:content-disposition:in-reply-to:user-agent; bh=95UFh51dOarBCVSprS1HjbeMrf84jNZ9XmlpVgV2LxY=; b=tVyYNvYIrH1dYWfD7NSr/AkwKrAJSdrYwacxww4pHUYTG7gxjg/Ci/h56FZXSltqbW tudWv4Go/4Tj8NmRRTEidvMwE7fSRdri8uX/5wCwsymlK10O7lsIpsiYngChDfvWjnaW 4a65IqMs1lZD648AcfBa3Wj5dVvwTnwA3nhqq/UkML3rQGp18mqT9NBgHF7GpQCpEGpr F0JptXrjju14iPYGOHWpWYKeO77nbm0AxVvjKa28oaMNa6E4czWQHbBmxeVm/kmsnYrK W/xheO1piZV/2syz0V+c06X/10lOd5RmsQQPswkurJgWyrYqIdZ9sfuoS5PjYyWizbI7 ZzOQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id :mail-followup-to:references:mime-version:content-disposition :in-reply-to:user-agent; bh=95UFh51dOarBCVSprS1HjbeMrf84jNZ9XmlpVgV2LxY=; b=MEq8qPm9VJmimuRPXjC/P6waOB44XqYhQ6K5L+6qLApbYwh0V+5m9bzIEgHNFq53hV SWZ238+2Vy2nEtPHLrZvxY3oQp5AJh0iprVtB8z35lf5q3a64Adu6ie+qorBxKjKvDro FbStGISp3Ol8S1NLsR9rYBCE2MwM7GL+c0kKN49byfCMIY2rYN7HbNHVqhaqMvp/8nN+ MjYOLmCw8tbAfcCKEy5K55NVSL1RGSIPwK5x0KXeOV4UKPwF4bLZqGgQyvA0aPdp/P3m 09/BfTkuRfAVSFX6JPDl9rf1XvNBUhzGpkPZmIIRevwppIImIMIpb1Rljb3RYxhLFbfu IAaQ== X-Gm-Message-State: AA+aEWa5+4Jjpu7ivyjva2ZSuent+hxzHwe2PJ/ZYcv2w/4VlvMLxTQ5 Ln13mbSXsVzjVN61J5z1LK6N6lzH X-Received: by 2002:a17:902:6b0c:: with SMTP id o12mr18449303plk.291.1545173586086; Tue, 18 Dec 2018 14:53:06 -0800 (PST) Received: from gmail.com (cpe-98-150-136-16.hawaii.res.rr.com. [98.150.136.16]) by smtp.gmail.com with ESMTPSA id s37sm21375055pgm.19.2018.12.18.14.53.04 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 18 Dec 2018 14:53:05 -0800 (PST) Date: Tue, 18 Dec 2018 12:53:03 -1000 From: Joey Pabalinas To: Jasper Spaans Cc: Joey Pabalinas , Joe Perches , Linux Kernel Mailing List Subject: Re: [RFC] LKML Archive in Maildir Format Message-ID: <20181218225303.jxgwf76wm4uls4fi@gmail.com> Mail-Followup-To: Joey Pabalinas , Jasper Spaans , Joe Perches , Linux Kernel Mailing List References: <20181216190639.6safwjqwdphkce67@gmail.com> <20181216192135.hc7gykmwkfgil2j5@gmail.com> <20181218202627.j6d2jgxercylclpc@jasper.es> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha512; protocol="application/pgp-signature"; boundary="mopxlzhr3jjalvzf" Content-Disposition: inline In-Reply-To: <20181218202627.j6d2jgxercylclpc@jasper.es> User-Agent: NeoMutt/20180716 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --mopxlzhr3jjalvzf Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Dec 18, 2018 at 09:26:27PM +0100, Jasper Spaans wrote: > Now you've caught my attention; first of all, there are more than 3M > messages stored in the lkml.org datase, so I guess you've missed some > messages or something is really broken. >=20 > Besides, unless you figured out how to get to the raw data, you've just > scraped a rendering which discards stuff like pgp signatures etc and has > very incomplete headers. Unless you don't care for those of course :) >=20 > Note that I've also been toying with the lore dataset, and wrote a tiny t= ool > to get Maildir-like data out of it; this code is a bit of a single-use-jig > so you'll need to do some coding if you really want to use it. Attached > anyway. Yeah, after looking closer at it last week, something here is very weird. This is definitely far from complete. When I have some free time I'm just going to give it another go with the public-inbox conversion. --=20 Cheers, Joey Pabalinas --mopxlzhr3jjalvzf Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAEBCgAdFiEENpTlWU2hUK/KMvHp2rGdfm9DtVIFAlwZek4ACgkQ2rGdfm9D tVIrFw/+OnMAOWfJodnabA2ECMj5pFjFGAgltlvj+FiUjKVlj6SkD6B6Mz4LH/nx 9sZZx/U+Nesb0ftCekmqPlMMQI1mJTC+gnNSmktqyOUtNNzCgFC9/wlhLK2zGKSC O6HkVvar/UwmkGHWgFWXmbvnvds2HcsyATND5r08ZiTrXHVitYk22nezhm5UufQU v6tWcFxWMDne+JR/JTI8qraSH5f+5lZ6TnUuLhAjxOJzj4tv4zhhxzEbThqy8Zw2 LxYOm9j+8D/t/9Q5zhIf4h3AltcJVKnLTIVlHEoX1OHH5Vah4Vfwf5fopCym0nhs QvBPN3VbCojjQl++oUNUxNdYtskUq3thWgYHwf4N+7yxI1htATMaa/rDDuPq8JkO HLyLd5NUaMp6gjp2lmjkgBRjLOICVHRWcTTSB3ds+w/Atg3FrGwj/yY1y9OiHO/e bvohIgQp+sVUClOO/qNuBksJmbRR786tTeH7xoe6aG9uOQfVwY0QqMU7VZxZHG6Q r32rPpU4nyT60dcsdcv6al3nTDJBEMaGTb93UTt3tBuzfUbGkyk5NufvcA6IPaib CTk5kJwlSOTX1HoOWs04bzO4sPAMLMr7Irkqbtk7CGG8BwUJISgNov+IAxdYV/Bn aGFr0smRDfPRsKiEYhSoUADq+FTqD79GcvaShDw+I2I/y3sUQss= =NlF+ -----END PGP SIGNATURE----- --mopxlzhr3jjalvzf--