<div dir="ltr"><div dir="ltr">Hello,<div><br></div><div>We now have a start to a data file that splits organizations into categories, like "Telecommunications Provider", "Network Equipment Vendor", and so on.</div><div><br></div><div><a href="https://github.com/datactive/bigbang/pull/396">https://github.com/datactive/bigbang/pull/396</a></div><div><br></div><div>I have tried using this to resolve the affiliations from the datatracker (represented in the image in the previous email on this thread). </div><div><br></div><div>However, there are some difficulties. There is a "long tail" of organizations, and it is hard to categorize them all. This is a list of the top ~40 organizations in _this_ data set, which is different from the one that the original list was based on.</div><div><br></div><div>How can we develop a system for classifying these organizations?</div><div>The main take home message is that IETF is dominated by the networking equipment vendors, for what it's worth.</div><div><div><pre><b>Networking equipment vendor 4025</b>
<b>Telecommunications Provider 482</b>
Google 236
<b>Consumer hardware and software vendor 252</b>
<b>Research Institution 129</b>
Alcatel-Lucent 124
<b>China Mobile 123</b>
<b>Internet Registry 122</b>
Content Distribution Network 105
ZTE Corporation 84
Independent 83
VeriSign, Inc. 73
Oracle 68
Comcast 67
Old Dog Consulting 60
<b>Individual 58</b>
InterDigital Communications, LLC 56
Arrcus, Inc. 53
Qualcomm 52
<b>Software provider 49</b>
Intel 49
Internet Governance Body 48
Vigil Security 45
Telefonica 45
Isode Ltd 44
Universitaet Bremen TZI 42
Nortel Networks 41
IBM 40
Sun Microsystems 38
ETH Zurich 37
Neustar 36
NTT Communications 36
University of Aberdeen 35
Dell EMC 33
INRIA 33
Beijing Jiaotong University 32
Telecom Italia 32</pre></div></div></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
</blockquote></div>
</blockquote></div></div>