Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1937402imu; Sun, 16 Dec 2018 12:24:36 -0800 (PST) X-Google-Smtp-Source: AFSGD/UYChtP+YMQHUCpEoCxssct7cHeD6uCXga5JhZyhxUzIlIjUuXJOlmppctfMxChgqmaUNrM X-Received: by 2002:a62:442:: with SMTP id 63mr10221561pfe.156.1544991876846; Sun, 16 Dec 2018 12:24:36 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544991876; cv=none; d=google.com; s=arc-20160816; b=Vl8/s2EdGJkebdb+lunV5Fh6PK0KtcpfEf7mhiWFFf3I1up2rXIJMTuyD0kuDcomNK BR3EGlNAr8qVSw7DcXpuEZqzK8T/3U4xqpIgbAY3RfDoG3GYOx6p4Dq3lcewt0kYAGKB pqPhp15C6GQssQwv6qAV57huFprtLyiHTNBn8V01wOe+2Z/lu5HYee9q4UdVW5ocQLs6 XQW2kk1TTyqPiXq+/51P4YyOqjYWpDISUyAUbE9JmyMwICWRbE/gcOY+/JbpHmO6LZNT d1iNhAKjWtW7YvbgZskTLGPwO9kY5cQfe7b+zMwqf2oMjkVyE0MUXnLxmFno5/AZDdZc 9iQA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:mail-followup-to:message-id:subject:to:from:date :dkim-signature; bh=UdYVx5GGtu9r8tifhjI5FSvucXiHocbZJsPJay6zmRo=; b=RqmfCJKlc1AK0VSlk0SEKYNwGOEzavcERESm3HIPaBOh51ithVJv5OSavh/WWNaJKP atGbHNoNcDL8qwZw1dqOTDtdGqQGQUHC90k0SjuFc2PA8spm1MpsFRq6m0glBPXnc/6F B5AO3ML57LOlMVLh5StadZLdVrnNE0UK93eUxzBjMmiCepQqMWRu0EzS9xSTeKElw6X2 dQsnr/f8xtw7uCbwlJwGbLegl/2ppiS/UX+ClcbDmVUNrp1g7Q7r6D5e+B19Xb6DMJqo Mq/mcN9SCTGzulZTGivMCwyD2iSsmk+Pu3tUs/LEYDDBL8zCba9C8b3vXbXyxPDkYaKq pUNg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=google header.b=QPrOJ2CM; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 4si9580391pff.161.2018.12.16.12.24.08; Sun, 16 Dec 2018 12:24:36 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=google header.b=QPrOJ2CM; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730874AbeLPTqx (ORCPT + 99 others); Sun, 16 Dec 2018 14:46:53 -0500 Received: from mail-qt1-f195.google.com ([209.85.160.195]:38411 "EHLO mail-qt1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730673AbeLPTqx (ORCPT ); Sun, 16 Dec 2018 14:46:53 -0500 Received: by mail-qt1-f195.google.com with SMTP id p17so11904007qtl.5 for ; Sun, 16 Dec 2018 11:46:52 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linuxfoundation.org; s=google; h=date:from:to:subject:message-id:mail-followup-to:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to:user-agent; bh=UdYVx5GGtu9r8tifhjI5FSvucXiHocbZJsPJay6zmRo=; b=QPrOJ2CMw4KLEZUKOz+KHEE0XJ8WIfoi+s80wQlgaCQsi68ZVxeH9lBOzblQEIcS+b uWQuqemyR+qYhk/97rjr1IeobQmVyO7imeVsqz9cR4kKelFSnEAJQZ7Oh3WZwctdlJmL P8Q0wmDVFVBMTc6qNejQVwwzDmGpb5GgIqtBA= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:subject:message-id:mail-followup-to :references:mime-version:content-disposition :content-transfer-encoding:in-reply-to:user-agent; bh=UdYVx5GGtu9r8tifhjI5FSvucXiHocbZJsPJay6zmRo=; b=lnTe/AQQGhZb4Hpcv83jyLUldcuq0sg0aFM55kQ5MVhGtU0gQNOfKSlT1n+vD2r2YP rBR0dJ5Es2tmSa7vyFYTxgYVjOMohVylV0J3QPQwbGZBSCaGNkCCIYK84F5WqsRRDfhs 9Xf74VGjddm+uv6GXj+gDtlL7sBqY0+9gbGEDdCOpp/2Hq0Mkh7HLZgQafZhuy7amkSj XHOPyPx5CgqXafgu7PrGjV2eHwxfgdKsweuPb82kFMFksmBnkvXnIYJM8H3s5qZMz9r0 wGrM1fUcCwLHtCBDbM4ah1t1k2upALdL7oxicKrR+mmPIdYLUfBMOiJ4gFCZ/2db3W9+ zQ/w== X-Gm-Message-State: AA+aEWbruSz7IbC7jWKwuiVfLvw906AIeMlKrAxa0C/twAK+rSIQ6NtQ 9KfTiFFeQYqMaA70l+bXGUX8Tw== X-Received: by 2002:aed:3a22:: with SMTP id n31mr10901212qte.29.1544989612318; Sun, 16 Dec 2018 11:46:52 -0800 (PST) Received: from pure.paranoia.local ([198.144.156.49]) by smtp.gmail.com with ESMTPSA id h31sm2014690qtk.5.2018.12.16.11.46.50 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Sun, 16 Dec 2018 11:46:51 -0800 (PST) Date: Sun, 16 Dec 2018 14:46:49 -0500 From: Konstantin Ryabitsev To: Joey Pabalinas , Linux Kernel Mailing List , kernelnewbies@kernelnewbies.org, Linus Torvalds , Greg Kroah-Hartman Subject: Re: [RFC] LKML Archive in Maildir Format Message-ID: <20181216194649.GA7732@pure.paranoia.local> Mail-Followup-To: Joey Pabalinas , Linux Kernel Mailing List , kernelnewbies@kernelnewbies.org, Linus Torvalds , Greg Kroah-Hartman References: <20181216190639.6safwjqwdphkce67@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable In-Reply-To: <20181216190639.6safwjqwdphkce67@gmail.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Dec 16, 2018 at 09:06:39AM -1000, Joey Pabalinas wrote: > I spent a lot of time trying to find an LKML archive in Maildir format > that I could use for local searches with nutmuch or something, but all > the links I was able to find were all dead. >=20 > I ended up just compiling one myself and I currently host it at: >=20 > https://alyptik.org/lkml.tar.xz You seem to have duplicated a lot of effort that has already been done to compile the archive on lore.kernel.org. > It's possible I'm the only weirdo who finds this kind of thing useful, but > I figured I should share it just in case I'm not. The maildir format is kind of terrible for LKML, because having millions of messages in a single directory is very hard on the underlying FS. If you break it up into multiple folders, then it becomes difficult to search. This is the main reason why we have chosen to go with the public-inbox format, which solves both of these problems and allows for a very efficient archive updating and replication using git. > It's about 1.1 million files, I was wondering if anyone had an idea of a > better way to host this? I've tried Github and GitLab, but they don't > appreciate repos with that many files, hah. Like I said, you seem to be going down the road we've already tried and rejected. :) -K