[Bigbang-dev] Big Bang Notebook Questions Corinne
Beraldo, Davide
d.beraldo at uva.nl
Tue Jul 24 17:24:15 CEST 2018
Hi Corinne,
sorry I totally forgot to get back on this. Responding on coding-related questions.
In [6]: top senders over a time period
Currently, it gives the absolute numbers, for example, niels ten oever 77.0
It would be great to know what percentage that represents of the total number of emails sent in the period specified. So, those 77 emails are they 0.5, 5, 50% etc of the total over that period?
Easy peasy
In [7]: number of emails in a time frame
I was wondering if it would also be possible to indicate the number of threads versus single emails (with no responses) to get a sense of how responsive a mailing list is in a certain time period.
Yes
In [8]: I would be interested in, for instance, the average number of emails per user, across multiple years.
It seems that currently, the numbers presented are not the average across the time specified but the absolute. That might also be because for the test run, I set
date_from = pd.datetime(2014,10,1,tzinfo=pytz.utc)
date_to = pd.datetime(2015,11,30,tzinfo=pytz.utc)
which is only a little over a year.
OK I think I misunderstood your question before. So you want to do: for each user, count how many emails she sent in a timeframe, divide by number of years? If so, be aware that if you don't use a round number of years you would get inconsistent results on the tails. Also, do you mean year as in 365 days from date_from, or year as in 2015,2016,2017,...?
I'll try to incorporate these changes tomorrow, hope it's not too late!
Cheers,
Davide
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ghserv.net/pipermail/bigbang-dev/attachments/20180724/77acde1c/attachment.html>
More information about the Bigbang-dev
mailing list