[Python-Dev] Accessing mailing list archives (original) (raw)

Victor Stinner vstinner at redhat.com
Mon Jul 30 18:26:57 EDT 2018


Hi Bob,

I wrote a basic script to compute the number of emails per PEP. It requires to download gzipped mbox files from the web page of archives per month, then ungzip them: https://github.com/vstinner/misc/blob/master/python/parse_mailman_mbox_peps.py

Results: https://mail.python.org/pipermail/python-committers/2018-April/005310.html

Victor

Le lundi 30 juillet 2018, Bob Purvy <bpurvy at gmail.com> a écrit :

hi all, I've been trying to figure out how to access the archives programmatically. I'm sure this is easy once you know, but googling various things hasn't worked. What I want to do is graph the number of messages about PEP 572 by time. (or has someone already done that?) I installed GNU Mailman, and downloaded the gzip'ed archives for a number of months and unzipped them, and I suspect that there's some way to get them all into a single database, but it hasn't jumped out at me. If I count the "Message-ID" lines, the "Subject:" lines, and the "\nFrom " lines in one of those text files, I get slightly different numbers for each. Alternatively, they're maybe already in a database, and I just need API access to do the querying? Can someone help me out? Bob -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.python.org/pipermail/python-dev/attachments/20180731/44122195/attachment.html>



More information about the Python-Dev mailing list