Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp349526imu; Thu, 3 Jan 2019 21:51:10 -0800 (PST) X-Google-Smtp-Source: ALg8bN5EpE+Tm/SnElnwKYp0dSL8WXMmlTOfzL0UnVOrLabhFS1a3NZg9uzLQINMjogj66VAIw9f X-Received: by 2002:a65:6684:: with SMTP id b4mr525373pgw.55.1546581070364; Thu, 03 Jan 2019 21:51:10 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1546581070; cv=none; d=google.com; s=arc-20160816; b=yRoezWLCCNviK1yw0lQxE8TuXIBdYNgEvHlK9o7W2kVHno9rtOjUH8mDr2vHDEMIR3 b96ES/Dh2qrzdvC9jAwC1lueuyZYkLrdDeYY2oOi2Bd64PacC4x3uaXrj1SfK9NRZ5BW iqw6nKESli6SXFDLvTY+cg9XYsOpkMpaZ7j23vNC8ou+1nYlTl3jtqvaeYp4Sc1IlR5M +X32rChXerqnu0wr02+wyKjMG0EzbND9/AP+6Js0Pvq0cVot8+KxM978kt5kNJxQcOGB xINQSImBhrzpHAXa+D/y/aorBgLl/7bRwcB/PxIj0AhASR75KtdCd059Sm1wufkbpe6r 2Uqw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=5Bil8wLpfEts5QpgI//T+RmEgalW7vdceFUoN2wjUE0=; b=Y6x7dZJlPLc6niZpITVbUyHCjQX5QFDy3oJvZPv0yFBep2MncQIB5ukhUUB2rZEge9 aa3qy5eUAqa+X0KSBaUKaAqnt25sXzyn9oqdwriK0n6+p1KtC/GkWKNcBs11svcs5wbw NYQ0tyhhA9HHk3hvbHsVKhBvPujWpQ2+u0aCHzdDLNrPUxpP06CgKM4nvTHCeqVuoyb3 KrGrPggxyEzUOuog68BvhYjXS4IRa8RkCz/yzNRKiionhGFafnCgyUKUNxHkMoTaXZfK kAen35Tg6tz76GEST1SZfQAW+nLNonPp12V4thyO1vET688jXONl9jsaUQJh1zuCj11Z y7Lw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o22si1929954pgb.584.2019.01.03.21.50.52; Thu, 03 Jan 2019 21:51:10 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727834AbfADBfX (ORCPT + 99 others); Thu, 3 Jan 2019 20:35:23 -0500 Received: from dcvr.yhbt.net ([64.71.152.64]:58138 "EHLO dcvr.yhbt.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727196AbfADBfW (ORCPT ); Thu, 3 Jan 2019 20:35:22 -0500 Received: from localhost (dcvr.yhbt.net [127.0.0.1]) by dcvr.yhbt.net (Postfix) with ESMTP id 5E47C1F6A9; Fri, 4 Jan 2019 01:35:22 +0000 (UTC) Date: Fri, 4 Jan 2019 01:35:22 +0000 From: Eric Wong To: Joey Pabalinas Cc: linux-kernel@vger.kernel.org, kernelnewbies@kernelnewbies.org, Linus Torvalds , Greg Kroah-Hartman Subject: Re: [RFC] LKML Archive in Maildir Format Message-ID: <20190104013522.stng6gwauwnr6wbi@starla> References: <20181216190639.6safwjqwdphkce67@gmail.com> <20181216194649.GA7732@pure.paranoia.local> <20181216195343.idnt2y5y5wjky5gu@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20181216195343.idnt2y5y5wjky5gu@gmail.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Joey Pabalinas wrote: > My only comment on the public-mailbox choice is that the documentation > is very sparse and erratic. Myself and a couple other people just > couldn't figure out how to convert that format to Maildir or some other > format you could feed into a reader like neomutt. Sorry, I didn't notice this before. I started making some attempts at improving documentation (among other things, when time permits) to public-inbox: https://public-inbox.org/meta/20190102083305.30473-1-e@80x24.org/ And without knowing anything about git or public-inbox, you can get NNTP messages into Maildir or mboxrd pretty easily. Nothing new to learn :) I wrote a one-off Ruby years ago (before public-inbox) for converting slrnspools to Maildir (sample slrnpull.conf below). But yeah, I wouldn't recommend 3M+ messages in a Maildir... ==> slrnspool2maildir <== #!/usr/bin/ruby require 'socket' require 'fileutils' HOSTNAME = Socket.gethostname usage = "Usage #$0 " spooldir = ARGV[0] or abort usage maildir = ARGV[1] or abort usage f = base = nil nr = 0 %w(cur new tmp).each { |x| FileUtils.mkpath("#{maildir}/#{x}") } Dir.glob("#{spooldir}/*").each do |src| File.file?(src) or next base = File.basename(src) dest = "#{maildir}/new/#{Time.now.to_i}_#{base}_0.#{HOSTNAME}:2," begin File.link(src, dest) rescue Errno::EEXIST warn "#{dest} already exists" next end File.unlink(src) end __END__ ==> slrnpull.conf <== # group_name max expire headers_only inbox.com.example.news.group.name 1000000000 1000000000 0 # usage: slrnpull -d $PWD -h news.example.com --no-post # Wouldn't be hard to script something using Net::NNTP in Perl # to write directly to Maildirs, either.