[liberationtech] Fw: Concept for takedown-resistant publishing

Daniel Margo dmargo at eecs.harvard.edu
Thu Feb 2 17:02:37 PST 2012


Let me try to explain what is and isn't technically difficult about this 
idea. Feel free to pass it around. I have cc'd the liberationtech list.

Torrent is an open protocol. There are numerous open source torrent 
libraries. Opera can download torrents natively, for Firefox there are 
extensions to do this, and so far as I know every browser has some kind 
of plug-able framework for MIME type (Internet media type) handling. To 
have a browser download and then open a torrent of a page is totally 
realistic, and probably not that much code.

The problem is that this is not the actual technical challenge. The 
actual challenges are these:
1. Name resolution: How do I find the torrent file, and then the P2P 
cloud itself?
2. Updates: How do I share a Web site as a living, updating document?
3. Server backend: Is it realistic to run a modern Web site without a 
client-server relationship?

1. Name resolution comes in two parts: finding the torrent file, and 
then finding the actual P2P cloud. Web sites like The Pirate Bay 
function as name resolution services for finding torrent files, but 
obviously any single such server can be blocked. What I presume we 
actually desire is a name resolution service just like DNS for Web 
sites: you type in a URL, it gets resolved to a torrent file.

URLs have a hostname and a path, e.g. 
"www.hostname.com/path/to/torrent.file". "www.hostname.com" is resolved 
by DNS to a server, and then "/path/to/torrent.file" is resolved by that 
server. So unfortunately, you don't gain any takedown resistance by 
hosting your torrent file at "www.myserver.com/my.torrent", because if 
"www.myserver.com" is taken down, then "/my.torrent" can't be resolved. 
So what you would actually need is for DNS to provide hostname 
resolution directly into torrents. This is a sweet idea, but changing 
DNS is hard.

Even if you could do that, well, what's in a "torrent file"? A torrent 
file contains information to get you joined into the P2P cloud; 
specifically, it contains the address of a tracker, a central server 
that gates entry into the cloud. If the tracker is down or blocked by 
some technology, you can't enter the cloud. This is primarily a weakness 
of the BitTorrent protocol, and not the idea itself; there are 
extensions to BitTorrent and other P2P protocols that are resilient to 
this weakness. But fundamentally, finding your way into the P2P cloud is 
an act of name resolution to find other peers in the cloud, which is 
most easily done by a name resolution server, in this case the tracker. 
If we're going about making changes to DNS, probably the most 
technically sane (but politically unrealistic) solution would be for DNS 
itself to provide tracking capability for the cloud. Again, that is a 
sweet idea, but changing DNS is hard.

I don't mean to suggest these issues are insurmountable, merely that 
this is the actual Hard Part.

2.  In order to support dynamic content there has to be a cryptographic 
distinction between updates coming from the legitimate publisher of the 
site (Alice), and subterfuge coming from Evil Eve. Cryptographic 
authentication might be authenticated by the data being shared (e.g. 
this data is signed by Alice, and everybody knows her signature) or by 
the P2P network itself (e.g. in addition to sharing the torrent, the 
servers also provide a distributed authentication service that will only 
accept updates signed by Alice). I am by no means an expert on this 
subject, so I will refrain from talking about it extensively, but I 
bring it up merely because cryptography is non-optional for any dynamic 
scheme, and I'm not aware of any update-able, cryptographically-secured 
P2P torrents. It sounds like maybe they should exist? It also sounds 
Hard, and Google isn't turning anything up.

Again, I'm not suggesting these issues are insurmountable; in some sense 
Google Docs does all this. But they do it with a pretty sophisticated 
backend that glues many technologies together (I guarantee Google has a 
killer internal name resolution and authentication service), and I have 
no idea how Hard it would be to make those parts takedown-resistant (in 
the sense that there are no central servers. There are unquestionably 
central servers at Google).

3. The real elephant in the room is that modern Web sites are best 
thought of as programs, not files, and program distribution is 
infinitely harder than file distribution. When you visit a Wordpress 
blog, what you appear to receive is an HTML file: but in actuality that 
HTML was streamed by a PHP script running on the server talking to a 
MySQL database. The number of layers in this onion is arbitrary; it's 
anything you could run on a computer. Arbitrary code, or I could hook 
MSPaint up to the thing if I wanted it enough. Distributing *and 
executing* arbitrary code like this is Quite Possibly Impossible, and if 
it is possible it is Very Far Away. At any rate, BitTorrent can't do this.

The word "arbitrary" is important. In specific cases, you can certainly 
find a case-specific resolution. Diaspora is building something like a 
distributed social network "program", and I do honestly believe that 
with a lot of code and hard thinking, you could distribute a Wordpress 
blog's backend on P2P. It would probably require a total rewrite of 
Wordpress such that it wouldn't even be the same piece of software, and 
you would have to solve the other Hard Problems above, but I think it is 
technically possible at this time. But in the general case, you might 
have more luck working towards a Singularity and then asking the 
Machine-Gods for an answer.

What seems more realistic to me is taking snapshots of a Web site and 
distributing those as files instead, rather than trying to distribute 
the actual Web site program. That's why we designed Mirror As You Link 
to work that way.

These are all the hard issues I can think of, as a technical person with 
some distributed systems background. There may be others, since in some 
cases we're really exploring uncharted waters here.
- Daniel Margo

On 02/02/2012 10:40 AM, Kendra Albert wrote:
> Some interesting stuff
>
> ---
> Kendra Albert
>
> Research Assistant to Jonathan Zittrain
> Berkman Center for Internet and Society
> Harvard University
>
> Forwarded message:
>
>> *From:* Jonathan Zittrain <zittrain at law.harvard.edu>
>> *To:* Rob Faris <rfaris at cyber.law.harvard.edu>
>> *Cc:* emstark at gmail.com, Kendra Albert 
>> <kalbert at cyber.law.harvard.edu>, Alicia Solow-Niederman (Google Docs) 
>> <asolowniederman at gmail.com>
>> *Date:* Wednesday, February 1, 2012 6:08:49 PM
>> *Subject:* Re: Fwd: [liberationtech] Concept for takedown-resistant 
>> publishing
>>
>> thanks
>>
>> At UTC-5 03:51 PM 2/1/2012, Rob Faris wrote:
>>> I have no idea if this has any merit - but think of you when I see 
>>> content mirroring.
>>>
>>>
>>> -------- Original Message --------
>>> Subject: [liberationtech] Concept for takedown-resistant publishing
>>> Date: Wed, 01 Feb 2012 21:49:42 +0100
>>> From: Fabio Pietrosanti (naif) <lists at infosecurity.ch> 
>>> <mailto:lists at infosecurity.ch>
>>> To: liberationtech at lists.stanford.edu 
>>> <mailto:liberationtech at lists.stanford.edu>
>>>
>>>
>>>
>>> Hi all,
>>>
>>> with a brainstorming with various persons it went out an idea of a
>>> concept of a takedown-resistat publishing system for website.
>>>
>>> I'm writing here to submit the concept idea to contribution/critics,
>>> maybe someone will do it someday (if feasible).
>>>
>>> I tried to abstract two concept:
>>>
>>> a) What is a website?
>>> A website is a set of information (text/images) that can be accessed by
>>> a user trough a web browser.
>>>
>>> b) What do you need to be takedown-resistant?
>>> Have a lot of mirrors, on as many server as possible around the globe.
>>>
>>> c) How do you make a lot of mirror available?
>>> By making use a very simple and diffused technology for the mirroring,
>>> possibly something that's already diffused across a lot of users.
>>>
>>> d) Which is the most known/used information distribution/mirroring
>>> system used nowday?
>>>
>>> Bittorrent P2P system.
>>> Bittorrent is the system that today is most used to do file-sharing,
>>> de-facto spreading, distributing, mirroring files.
>>> Bittorrent software is widely available and heavily diffused among
>>> users.
>>>
>>> So, if it would be possible to let a browser easily access the content
>>> available on bittorrent, it would be possible to easily create a
>>> scalable, distributed and resistant network of mirrors capable of
>>> resisting takedown.
>>>
>>> Obviously this require:
>>> - That the content on bittorrent should be file containing all the
>>> elements of the website glued together
>>> - That the web browser should have capability to access a such content
>>> on Bittorrent
>>>
>>> I am not entering into the technological discussion, if it's feasible or
>>> not feasible (maybe not, maybe yes).
>>>
>>> As a concept, what do you think? Other ideas/extensions?
>>>
>>> -naif
>>> _______________________________________________
>>> liberationtech mailing list
>>>
>>> liberationtech at lists.stanford.edu  <mailto:liberationtech at lists.stanford.edu>
>>>
>>> Should you need to change your subscription options, please go to:
>>>
>>>
>>> https://mailman.stanford.edu/mailman/listinfo/liberationtech
>>>
>>> If you would like to receive a daily digest, click "yes" (once
>>> you click above) next to "would you like to receive list mail
>>> batched in a daily digest?"
>>>
>>> You will need the user name and password you receive from the list
>>> moderator in monthly reminders.
>>>
>>> Should you need immediate assistance, please contact the list moderator.
>>>
>>> Please don't forget to follow us on
>>>
>>> http://twitter.com/#!/Liberationtech  <http://twitter.com/#%21/Liberationtech>
>



More information about the liberationtech mailing list