<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Great!</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Thank you again, Christoph and Niels, later I'll try it.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Best,</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Riccardo</div>
<div id="appendonsend"></div>
<hr style="display:inline-block;width:98%" tabindex="-1">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" style="font-size:11pt" color="#000000"><b>Da:</b> Niels ten Oever <mail@nielstenoever.net><br>
<b>Inviato:</b> venerdì 23 aprile 2021 11:50<br>
<b>A:</b> Riccardo Nanni <riccardo.nanni9@unibo.it>; Christoph Becker <chrbecker01@gmail.com><br>
<b>Cc:</b> bigbang-user@data-activism.net <bigbang-user@data-activism.net><br>
<b>Oggetto:</b> Re: R: R: [Bigbang-user] Issue with listserv fetching (3GPP)</font>
<div> </div>
</div>
<div class="BodyFragment"><font size="2"><span style="font-size:11pt;">
<div class="PlainText">Thanks Christoph!<br>
<br>
This was the content of the file example.py:<br>
<br>
import bigbang<br>
from bigbang import listserv<br>
from bigbang.listserv import ListservArchive, ListservList, ListservMessage<br>
<br>
url_archive = "<a href="https://list.etsi.org/scripts/wa.exe?">https://list.etsi.org/scripts/wa.exe?</a>"<br>
url_list = url_archive + "A0=3GPP_TSG_CT_WG6"<br>
<br>
ListservArchive.from_url(<br>
name="3GPP",<br>
url_root=url_archive,<br>
url_home=url_archive + "HOME",<br>
login=auth_key_mock,<br>
instant_save=True,<br>
only_mlist_urls=False,<br>
)<br>
<br>
<br>
Best,<br>
<br>
Niels<br>
<br>
On 23-04-2021 09:25, Riccardo Nanni wrote:<br>
> Dear Niels and Christoph,<br>
> <br>
> thanks a lot for your help!<br>
> I tried Niels' way and I keep getting the 'instant_dump'. <br>
> I did 'git branch' and it shows the following:<br>
> <br>
> *main<br>
> master<br>
> <br>
> I understand I am on the 'main' branch, is it right?<br>
> Then I tried 'git pull' again and it says it is already updated, but it keeps showing the 'instant_dump' message when I try the usual command.<br>
> <br>
> @Christoph: thank you for sharing the file on the alternative way to gather listserv emails, but I don't think it came through: all I can find is an error message that says an attachment was detected as malware (guess my computer 'misread' your file?). Any
chance you can share it again, please?<br>
> <br>
> Thanks a lot again, you're all very helpful! As I'm better at cooking than programming, when you come to Italy I owe you a dinner 🙂🙂<br>
> Cheers,<br>
> <br>
> Riccardo<br>
> ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------<br>
> *Da:* Christoph Becker <chrbecker01@gmail.com><br>
> *Inviato:* venerdì 23 aprile 2021 00:23<br>
> *A:* Niels ten Oever <mail@nielstenoever.net><br>
> *Cc:* Riccardo Nanni <riccardo.nanni9@unibo.it>; bigbang-user@data-activism.net <bigbang-user@data-activism.net><br>
> *Oggetto:* Re: R: [Bigbang-user] Issue with listserv fetching (3GPP)<br>
> <br>
> Hi Niels & Riccardo,<br>
> the argument 'instant_dump' for the ListservArchive class object does not exist anymore in the up-to-date 'main' branch of the git repo.<br>
> @Niels: Do you mean that you did a 'git pull' and encountered the TypeError caused by missing 'instant_dump' too?<br>
> <br>
> But as I said in another message, we are not quite there yet for 3GPP and IEEE to use the 'conventional' method on how BigBang scrapes archives such as W3C.<br>
> I attached a small examples that shows how you can currently scrape the 3GPP archive and save it to mbox files in the CONFIG.mail_path folder.<br>
> Be aware that this could take very long and could use a lot of memory.<br>
> <br>
> Best Wishes,<br>
> Christoph<br>
> <br>
> <br>
> Op do 22 apr. 2021 om 17:17 schreef Niels ten Oever <mail@nielstenoever.net <<a href="mailto:mail@nielstenoever.net">mailto:mail@nielstenoever.net</a>>>:<br>
> <br>
> Hi Riccardo and Christoph,<br>
> <br>
> I see there might be an issue with the usage of special characters in the mailinglist URLs, to get it working I had to put a '\' in front on the '?', but this could also be fixed by using " " around the URL. However, after that fetching did not work either
- so let's ask Christoph (cc).<br>
> <br>
> Cheers,<br>
> <br>
> Niels<br>
> <br>
> <br>
> <br>
> <br>
> <br>
> <br>
> On 22-04-2021 17:43, Riccardo Nanni wrote:<br>
> > Hi Niels,<br>
> ><br>
> > thanks for your answer!<br>
> > I did, and I found the changes I can see in Github (e.g. the listserv.3GPP.txt file, etc.).<br>
> > I did it again when I saw it didn't work and it says 'già aggiornato' (already updated).<br>
> ><br>
> > Riccardo<br>
> ><br>
> ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------<br>
> > *Da:* Bigbang-user <bigbang-user-bounces@data-activism.net <<a href="mailto:bigbang-user-bounces@data-activism.net">mailto:bigbang-user-bounces@data-activism.net</a>>> per conto di Niels ten Oever <mail@nielstenoever.net <<a href="mailto:mail@nielstenoever.net">mailto:mail@nielstenoever.net</a>>><br>
> > *Inviato:* giovedì 22 aprile 2021 17:38<br>
> > *A:* bigbang-user@data-activism.net <<a href="mailto:bigbang-user@data-activism.net">mailto:bigbang-user@data-activism.net</a>> <bigbang-user@data-activism.net <<a href="mailto:bigbang-user@data-activism.net">mailto:bigbang-user@data-activism.net</a>>><br>
> > *Oggetto:* Re: [Bigbang-user] Issue with listserv<br>
> > <br>
> > Hi Riccardo,<br>
> ><br>
> > This is not a very informed response - but did you first do:<br>
> ><br>
> > git pull<br>
> ><br>
> > to ensure that you have the latest version with all the recent changes?<br>
> ><br>
> > Best,<br>
> ><br>
> > Niels<br>
> ><br>
> > On 22-04-2021 17:31, Riccardo Nanni wrote:<br>
> >> Dear all,<br>
> >><br>
> >> how are you?<br>
> >> I tried to collect email from 3GPP by running these commands:<br>
> >> python bin/collect_mail.py -u <a href="https://list.etsi.org/scripts/wa.exe">
https://list.etsi.org/scripts/wa.exe</a> <<a href="https://list.etsi.org/scripts/wa.exe">https://list.etsi.org/scripts/wa.exe</a>>? <<a href=""></a>https://list.etsi.org/scripts/wa.exe <<a href="https://list.etsi.org/scripts/wa.exe>?">https://list.etsi.org/scripts/wa.exe>?</a>>
<<a href=""></a>https://list.etsi.org/scripts/wa.exe <<a href="https://list.etsi.org/scripts/wa.exe">https://list.etsi.org/scripts/wa.exe</a>>? <<a href=""></a>https://list.etsi.org/scripts/wa.exe <<a href="https://list.etsi.org/scripts/wa.exe>?">https://list.etsi.org/scripts/wa.exe>?</a>>>;<br>
> >> python3 bin/collect_mail.py -u <a href="https://list.etsi.org/scripts/wa.exe">
https://list.etsi.org/scripts/wa.exe</a> <<a href="https://list.etsi.org/scripts/wa.exe">https://list.etsi.org/scripts/wa.exe</a>>? <<a href=""></a>https://list.etsi.org/scripts/wa.exe <<a href="https://list.etsi.org/scripts/wa.exe>?">https://list.etsi.org/scripts/wa.exe>?</a>>
<<a href=""></a>https://list.etsi.org/scripts/wa.exe <<a href="https://list.etsi.org/scripts/wa.exe">https://list.etsi.org/scripts/wa.exe</a>>? <<a href=""></a>https://list.etsi.org/scripts/wa.exe <<a href="https://list.etsi.org/scripts/wa.exe>?">https://list.etsi.org/scripts/wa.exe>?</a>>><br>
> >> AND<br>
> >> python3 bin/collect_mail.py -f examples/url_collections/listserv.3GPP.txt<br>
> >><br>
> >> Also tried to scrape a specific group's list with the same commands: <a href="https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN">
https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN</a> <<a href="https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN">https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN</a>> <<a href=""></a>https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN <<a href="https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN">https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN</a>>>
<<a href=""></a>https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN <<a href="https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN">https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN</a>> <<a href=""></a>https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN
<<a href="https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN">https://list.etsi.org/scripts/wa.exe?A0=3GPP_TSG_RAN</a>>>><br>
> >><br>
> >> I get the following error:<br>
> >> TypeError: from_url() got an unexpected keyword argument 'instant_dump'<br>
> >><br>
> >> I don't understand what I'm missing. Can you help me, please?<br>
> >> Thanks a lot in advance! The only similar argument I could find on Stackoverflow has no answers...<br>
> >><br>
> >> Riccardo<br>
> >><br>
> >><br>
> >><br>
> >><br>
> >> _______________________________________________<br>
> >> Bigbang-user mailing list<br>
> >> Bigbang-user@data-activism.net <<a href="mailto:Bigbang-user@data-activism.net">mailto:Bigbang-user@data-activism.net</a>><br>
> >> <a href="https://lists.ghserv.net/mailman/listinfo/bigbang-user">https://lists.ghserv.net/mailman/listinfo/bigbang-user</a> <<a href="https://lists.ghserv.net/mailman/listinfo/bigbang-user">https://lists.ghserv.net/mailman/listinfo/bigbang-user</a>>
<<a href=""></a>https://lists.ghserv.net/mailman/listinfo/bigbang-user <<a href="https://lists.ghserv.net/mailman/listinfo/bigbang-user">https://lists.ghserv.net/mailman/listinfo/bigbang-user</a>>><br>
> >><br>
> ><br>
> > --<br>
> > Niels ten Oever, PhD<br>
> > Postdoctoral Researcher - Media Studies Department - University of Amsterdam<br>
> > Research Fellow - Centre for Internet and Human Rights - European University Viadrina<br>
> > Associated Scholar - Centro de Tecnologia e Sociedade - Fundação Getúlio Vargas<br>
> ><br>
> > <a href="https://nielstenoever.net">https://nielstenoever.net</a> <<a href="https://nielstenoever.net">https://nielstenoever.net</a>> <<a href=""></a>https://nielstenoever.net <<a href="https://nielstenoever.net">https://nielstenoever.net</a>>> - mail@nielstenoever.net
<<a href="mailto:mail@nielstenoever.net">mailto:mail@nielstenoever.net</a>> - @nielstenoever - +31629051853<br>
> > PGP: 2458 0B70 5C4A FD8A 9488 643A 0ED8 3F3A 468A C8B3<br>
> ><br>
> > Read my latest article on Internet infrastructure governance in New Media & Society here:
<a href="https://journals.sagepub.com/doi/full/10.1177/1461444820929320">https://journals.sagepub.com/doi/full/10.1177/1461444820929320</a> <<a href="https://journals.sagepub.com/doi/full/10.1177/1461444820929320">https://journals.sagepub.com/doi/full/10.1177/1461444820929320</a>>
<<a href=""></a>https://journals.sagepub.com/doi/full/10.1177/1461444820929320 <<a href="https://journals.sagepub.com/doi/full/10.1177/1461444820929320">https://journals.sagepub.com/doi/full/10.1177/1461444820929320</a>>><br>
> ><br>
> > _______________________________________________<br>
> > Bigbang-user mailing list<br>
> > Bigbang-user@data-activism.net <<a href="mailto:Bigbang-user@data-activism.net">mailto:Bigbang-user@data-activism.net</a>><br>
> > <a href="https://lists.ghserv.net/mailman/listinfo/bigbang-user">https://lists.ghserv.net/mailman/listinfo/bigbang-user</a> <<a href="https://lists.ghserv.net/mailman/listinfo/bigbang-user">https://lists.ghserv.net/mailman/listinfo/bigbang-user</a>>
<<a href=""></a>https://lists.ghserv.net/mailman/listinfo/bigbang-user <<a href="https://lists.ghserv.net/mailman/listinfo/bigbang-user">https://lists.ghserv.net/mailman/listinfo/bigbang-user</a>>><br>
> <br>
> -- <br>
> Niels ten Oever, PhD<br>
> Postdoctoral Researcher - Media Studies Department - University of Amsterdam<br>
> Research Fellow - Centre for Internet and Human Rights - European University Viadrina<br>
> Associated Scholar - Centro de Tecnologia e Sociedade - Fundação Getúlio Vargas<br>
> <br>
> <a href="https://nielstenoever.net">https://nielstenoever.net</a> <<a href="https://nielstenoever.net">https://nielstenoever.net</a>> - mail@nielstenoever.net <<a href="mailto:mail@nielstenoever.net">mailto:mail@nielstenoever.net</a>> - @nielstenoever
- +31629051853<br>
> PGP: 2458 0B70 5C4A FD8A 9488 643A 0ED8 3F3A 468A C8B3<br>
> <br>
> Read my latest article on Internet infrastructure governance in New Media & Society here:
<a href="https://journals.sagepub.com/doi/full/10.1177/1461444820929320">https://journals.sagepub.com/doi/full/10.1177/1461444820929320</a> <<a href="https://journals.sagepub.com/doi/full/10.1177/1461444820929320">https://journals.sagepub.com/doi/full/10.1177/1461444820929320</a>><br>
> <br>
> <br>
> <br>
> -- <br>
> <><><><><><><><><><><><><><><><><br>
> /Christoph Becker /(/he/him/his/)///<br>
> PhD at the<br>
> /<br>
> /Institute for Data Science and/<br>
> /Institute for Computational Cosmology/<br>
> /Durham University/<br>
> /United Kingdom/<br>
> //christovis.github.io// <<a href="http://christovis.github.io">http://christovis.github.io</a>><br>
<br>
-- <br>
Niels ten Oever, PhD<br>
Postdoctoral Researcher - Media Studies Department - University of Amsterdam<br>
Research Fellow - Centre for Internet and Human Rights - European University Viadrina<br>
Associated Scholar - Centro de Tecnologia e Sociedade - Fundação Getúlio Vargas<br>
<br>
<a href="https://nielstenoever.net">https://nielstenoever.net</a> - mail@nielstenoever.net - @nielstenoever - +31629051853<br>
PGP: 2458 0B70 5C4A FD8A 9488 643A 0ED8 3F3A 468A C8B3<br>
<br>
Read my latest article on Internet infrastructure governance in New Media & Society here:
<a href="https://journals.sagepub.com/doi/full/10.1177/1461444820929320">https://journals.sagepub.com/doi/full/10.1177/1461444820929320</a><br>
</div>
</span></font></div>
</body>
</html>