[Bigbang-dev] Organization metadata into bigbang.datasets ?

Sebastian Benthall sbenthall at gmail.com
Mon Apr 18 20:16:36 CEST 2022


Hello,

This issue #509 <https://github.com/datactive/bigbang/issues/509> was
perhaps misclassified as a later milestone and so hasn't come up in our
recent meetings.

It is essentially a proposal that we use submodules to make curated
datasets available via bigbang. Example documentation for data that links
email domains to a category is here:
https://bigbang-py.readthedocs.io/en/latest/datasets.html#

My understanding is that the richest dataset in the BigBang repository is
the organizations data, which is currently in examples/organizations:

https://github.com/datactive/bigbang/blob/main/examples/organizations/organization_categories.csv

How would you feel about a PR that moved this, and its .md metadata
document, to bigbang.datasets.organizations ?

I would quite like to include this dataset as part of the tutorial at CLBE,
so I would try to have this in by the end of the week for inclusion in the
0.4 release.

Cheers,
Seb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ghserv.net/pipermail/bigbang-dev/attachments/20220418/57140aba/attachment.htm>


More information about the Bigbang-dev mailing list