<div dir="ltr"><div dir="ltr">Hello,<div><br></div><div>We now have a start to a data file that splits organizations into categories, like "Telecommunications Provider", "Network Equipment Vendor", and so on.</div><div><br></div><div><a href="https://github.com/datactive/bigbang/pull/396">https://github.com/datactive/bigbang/pull/396</a></div><div><br></div><div>I have tried using this to resolve the affiliations from the datatracker (represented in the image in the previous email on this thread). </div><div><br></div><div>However, there are some difficulties. There is a "long tail" of organizations, and it is hard to categorize them all. This is a list of the top ~40 organizations in _this_ data set, which is different from the one that the original list was based on.</div><div><br></div><div>How can we develop a system for classifying these organizations?</div><div>The main take home message is that IETF is dominated by the networking equipment vendors, for what it's worth.</div><div><div><pre><b>Networking equipment vendor              4025</b>
<b>Telecommunications Provider               482</b>
Google                                    236
<b>Consumer hardware and software vendor     252</b>
<b>Research Institution                      129</b>
Alcatel-Lucent                            124
<b>China Mobile                              123</b>
<b>Internet Registry                         122</b>
Content Distribution Network              105
ZTE Corporation                            84
Independent                                83
VeriSign, Inc.                             73
Oracle                                     68
Comcast                                    67
Old Dog Consulting                         60
<b>Individual                                 58</b>
InterDigital Communications, LLC           56
Arrcus, Inc.                               53
Qualcomm                                   52
<b>Software provider                          49</b>
Intel                                      49
Internet Governance Body                   48
Vigil Security                             45
Telefonica                                 45
Isode Ltd                                  44
Universitaet Bremen TZI                    42
Nortel Networks                            41
IBM                                        40
Sun Microsystems                           38
ETH Zurich                                 37
Neustar                                    36
NTT Communications                         36
University of Aberdeen                     35
Dell EMC                                   33
INRIA                                      33
Beijing Jiaotong University                32
Telecom Italia                             32</pre></div></div></div><div class="gmail_quote"><div dir="ltr" class="gmail_attr"><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
</blockquote></div>
</blockquote></div></div>