[Bigbang-dev] Big Bang Notebook Questions Corinne

Niels ten Oever niels at article19.org
Tue Jul 17 13:57:24 CEST 2018


The discrepancy between the two hrpc mailinglist archives is a bug on the IETF side, and will be fixed soon!

Cheers,

Niels

⁣Sent from Blue ​

On Jul 17, 2018, 03:40, at 03:40, Corinne Cath <corinnecath at gmail.com> wrote:
>Dear all,
>
>
>
>Thanks a ton for putting together the notebook based on my question.
>Niels
>got me all set up and I have been working with it over the last days.
>It is
>excellent! I have some questions about further refinements that would
>be
>very useful for my project and other similar research.
>
>
>
>I will indicate per box.
>
>
>
>*In [6]: top senders over a time period*
>
>Currently, it gives the absolute numbers, for example, niels ten oever
>77.0
>
>It would be great to know what percentage that represents of the total
>number of emails sent in the period specified. So, those 77 emails are
>they
>0.5, 5, 50% etc of the total over that period?
>
>
>
>*In [7]: number of emails in a time frame*
>
>I was wondering if it would also be possible to indicate the number of
>threads versus single emails (with no responses) to get a sense of how
>responsive a mailing list is in a certain time period.
>
>
>
>In [8]: I would be interested in, for instance, the average number of
>emails per user, across multiple years.
>
>It seems that currently, the numbers presented are not the average
>across
>the time specified but the absolute. That might also be because for the
>test run, I set
>
>
>
>date_from = pd.datetime(2014,10,1,tzinfo=pytz.utc)
>
>date_to = pd.datetime(2015,11,30,tzinfo=pytz.utc)
>
>which is only a little over a year.
>
>
>
>Then there is the separate question Niels posted to the list regarding
>which mailing list is authoritative because there seem to be twoo.
>
>
>
>Because the mailing list archives used for big bang start in 2015 (with
>the
>official start of the hrpc at irtf list, whereas my qualitative analysis
>starts in 2014 with the hrpc at article19 mailing list.) This means that
>it is
>hard for me to use the outputs from bigbang, because it reflects a
>slightly
>less complete mailing list than the rest of my analysis does.
>
>
>
>Hope this is clear if not do ping me so we can further improve bigbang!
>
>Best,
>
>-- 
>Corinne Cath
>Ph.D. Candidate, Oxford Internet Institute & Alan Turing Institute
>
>Web: www.oii.ox.ac.uk/people/corinne-cath
>Email: ccath at turing.ac.uk & corinnecath at gmail.com
>Twitter: @C_Cath
>
>
>------------------------------------------------------------------------
>
>_______________________________________________
>Bigbang-dev mailing list
>Bigbang-dev at data-activism.net
>https://lists.ghserv.net/mailman/listinfo/bigbang-dev
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ghserv.net/pipermail/bigbang-dev/attachments/20180717/85971b9e/attachment.html>


More information about the Bigbang-dev mailing list