[Bigbang-dev] Big Bang Notebook Questions Corinne

Corinne Cath corinnecath at gmail.com
Tue Jul 24 17:51:22 CEST 2018


Hi Davide,

Many many thanks! So for the last one, I would like for each user how many
emails they sent in each year and how those relate to each other. My
phrasing is a bit clumsy, my apologies allow me to demonstrate:

So if Corinne sent 10 emails in year 1 and 20 in year two, that is a 50%
increase of her emails. I guess I could also do that by head/calculator.

I will also send a longer email to the list explaining some issues I ran
into today!

Best,



On Tue, Jul 24, 2018 at 5:24 PM, Beraldo, Davide <d.beraldo at uva.nl> wrote:

> Hi Corinne,
>
> sorry I totally forgot to get back on this. Responding on coding-related
> questions.
>
>
> *    In [6]: top senders over a time period*
>
>     Currently, it gives the absolute numbers, for example, niels ten oever
> 77.0
>
>     It would be great to know what percentage that represents of the total
> number of emails sent in the     period specified. So, those 77 emails are
> they 0.5, 5, 50% etc of the total over that period?
>
>
> Easy peasy
>
>
> *    In [7]: number of emails in a time frame*
>
>     I was wondering if it would also be possible to indicate the number of
> threads versus single emails (with no responses) to get a sense of how
> responsive a mailing list is in a certain time period.
>
>
> Yes
>
>
>
>     In [8]: I would be interested in, for instance, the average number of
> emails per user, across multiple years.
>
>     It seems that currently, the numbers presented are not the average
> across the time specified but the absolute. That might also be because for
> the test run, I set
>
>
>
>
>     date_from = pd.datetime(2014,10,1,tzinfo=pytz.utc)
>
>     date_to = pd.datetime(2015,11,30,tzinfo=pytz.utc)
>
>     which is only a little over a year.
>
>
> OK I think I misunderstood your question before. So you want to do: for
> each user, count how many emails she sent in a timeframe, divide by number
> of years? If so, be aware that if you don't use a round number of years you
> would get inconsistent results on the tails. Also, do you mean year as in
> 365 days from date_from, or year as in 2015,2016,2017,...?
>
>
> I'll try to incorporate these changes tomorrow, hope it's not too late!
>
>
>
> Cheers,
>
> Davide
>



-- 
Corinne Cath
Ph.D. Candidate, Oxford Internet Institute & Alan Turing Institute

Web: www.oii.ox.ac.uk/people/corinne-cath
Email: ccath at turing.ac.uk & corinnecath at gmail.com
Twitter: @C_Cath
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ghserv.net/pipermail/bigbang-dev/attachments/20180724/2d323734/attachment-0001.html>


More information about the Bigbang-dev mailing list