Elliott C. Back: In Aere Aedificare

Hitman iTunes Download Free

Posted in Deals & Savings, Apple, bit torrent, bittorrent by Elliott Back 1 week, 2 days ago.

Don’t download the Hitman movie with Bittorrent when Apple will give it to you free on iTunes. Just hit “Browse” on the Quick Links on the right side of the main iTunes home page. Then, select “Movies” and “Thriller.” You’ll see the title “Hitman” for free, just 1.1GB away:

hitman-free-itunes.png

You might not want to watch it, Rottentomatoes gave it a 15%.

Mark Cuban’s P2P Ideas Suck

Posted in P2P, Scalability, Performance, bit torrent, bittorrent, Celebrities by Elliott Back on November 25th, 2007.

In a three-part rant about peer-to-peer technologies (1, 2, 3), Mark Cuban demands that peer-to-peer technologies “die a quick death” in order to”speed up [his own] internet connection.” He suggests that “Google Video is a far better solution for audio and video distribution than any P2P solution” and that cable companies “charge for upstream bandwidth usage.”

Guess what–I already get charged for all the bandwidth I use, either up or down. When Verizon strings a fiberoptic cable to my home, I’m getting a certain amount of fixed capacity into the greater internet at large. If I want to trade a little upstream capacity for greater downstream capacity, that’s my call! Have you ever noticed that downloading over http is typically slow because there are 100s of clients and 1 host? If I download the same information over bittorrent, I can sustain 12Mbs because everyone is a server–including me. Distributed protocols, such as the ones powering Amazon Dynamo or bittorrent, are more efficient, cost effective, and fault tolerant than single-server models.

Reactions around the blogosphere indicate that Mark Cuban’s thoughts on P2P are nonsensical rubbish. Mashable calls him “a guy who does not understand how P2P works, and yet he wants it shut down.” Ars Technica notes that “if users who are currently saturating their connections with BitTorrent start saturating their connections with Google Video content, the end result is more or less the same.” And a slashdotter comments, “Just imagine how fast the internet would be if there were no content to view. After P2Ps gone, get rid of all these freeloading websites, emails, etc. and it will be blisteringly fast.”

My guess is that billionaire Mark Cuban has a slow, shared cable internet connection at home, the modern equivalent of a party line. This might lead him to confuse his own slow internet connection with a greater systemic problem. What he should be complaining about is why Verizon hasn’t strung fiber in his area yet.

Inside Elite P2P Filesharing Networks

Posted in Computers & Technology, Cornell University, Law, Scandal, bit torrent, bittorrent, Copyright by Elliott Back on September 1st, 2006.

An Introduction

You’ve heard that private file sharing networks exist, but you’ve probably never had a chance to explore one from the inside. These networks of software, music, television, and movie pirates often are run on the internal network infrastructure of private educational institutions. Because a university network has a fixed set of IP addresses, college pirates can run DC++ and write simple scripts to only allow users from the internal IP pool, or even the residential dormitory pool. This prevents unwanted interference (RIAA, MPAA, Police) with the network by simply making it invisible to the outside world. Also, most university networks are lightly-satured high-speed ethernet, giving student pirates the bandwidth to share large files.

riaa.gifWhile I attended Cornell University, students there ran a large DC++ hub to share files. There were anywhere between 1000 and 2000 users of the DC++ hub, which provided access to terabytes of shared files. Before I left the University to work, I transfered a complete set of users’ file lists to my home computer for later analysis. With 1215 XML file lists from DC++, I wrote a few perl scripts to calculate metrics on the 600mb data set.

Interestingly, the DC++ hub appears to still be around at its old redirect address thchub.no-ip.com:3307. Apparently a student r253141224 is hosting the service on his dorm computer 128.253.141.224.

Data From 20,000 Feet

From the file lists I have, there were 2,456,462 unique files, 5,424,446 total files, 19.07 unique terabytes, and 75.55 total terabytes. Here’s a histogram and data listing of the most popular file types:

file-types-histogram.jpg

mp3	1857432
jpg	828815
m4a	312173
png	264820
gif	224034
avi	203304
dll	133889
wma	116851
htm	82130
zip	79114

The file types follow a classic long-tail distribution, and let us query the data in more interesting ways. For example, for avi movie files, what were the most popular file names? Here’s the top 20:

crash.avi	90
pulp fiction.avi	76
garden state.avi	74
office space.avi	74
good will hunting.avi	72
wedding crashers.avi	67
sin city.avi	66
lost - 2x05 - ...and found.avi	65
super troopers.avi	63
zoolander.avi	60
robin hood - men in tights.avi	59
lost - 2x09 - what kate did.avi	58
eternal sunshine of the spotless mind.avi	57
lost - 2x04 - everybody hates hugo.avi	57
memento.avi	57
american beauty.avi	55
batman begins.avi	55
mean girls.avi	55
lost - 2x07 - the other 48 days.avi	54
old school.avi	54

We can take advantage of common patterns in the data to try and find other patterns, but I’ll save that for another day, and another post in what will undoubtably become a series.

Leveraging the Sharing Culture

Posted in Music, P2P, YouTube, bit torrent, bittorrent by Elliott Back on July 10th, 2006.

It is inevitable that piracy occur. My age group, the college student or young professional, almost exclusively acquires media through illegitimate channels. It doesn’t make sense to pay for what we can get for free, especially when downloading requires less effort and gives more immediate gratification. A trip to the record store: 30 minutes of your time gone. A download of a new album: 5 minutes of computer time gone. So, the clear choice is casual music piracy, a fact of modern American youth culture that the RIAA has yet to properly address.

piracy.jpg

  • The problem: not selling records.
  • The solution: not selling records.

As piracy leads to the commodization of popular music, revenue cannot be generated from sale of albums. Rather, popular music will be created for relicensing and branding. Commercial deals, inclusion in movie soundtracks, elevator music, radio royalties will be the primary source of income for records. Eventually there will be a “song” or “band” of Coca Cola company, ties between musicians, labels, and other corporations that deliver content to the public. It is these ties, not the content itself, which can be negotiated for money.

Conversely, selective cooperation with music piracy could lead to labels and artists reaping greater profits. As the revenue a song or album brings in becomes tied to its popularity, leaking a hot single before an album release becomes smart. Take, for example, the widely-released new single from Justin Timberlake, Sexy Back. It was released on MySpace, leaked to the torrent sites, and played all over the radio. While releasing singles before albums is not a new strategy, letting the internet believe they’ve stolen something hot, secret, and new will improve its appearance with the masses.

A song sampled in stolen solitude seems a lot better than something the industry is force feeding its market.

Bandwidth & Connections Limiting: A How To Guide

Posted in My Blog, Computers & Technology, Code, Scalability, Performance, bit torrent, bittorrent by Elliott Back on June 20th, 2006.

I wanted to host for my brother the latest WoW patch, but at a hefty 450 MB, I didn’t want to blow all my bandwidth on it either. I am setting a limit for the download at 2.5 TB of bandwidth, and limiting it to 5mbs at 10 connections a second. My hardware is RHEL 4 running on a P4 with Plesk 8. In the guide to follow, you’ll see a few steps that wouldn’t be needed on a non-plesk system. To setup bandwidth limiting for the host, we need to be able to compile a new apache module against our system, and then install and configure it.

Step 1: The prereqs

First I needed to setup a yum repository for FC4. This can be accomplished by issuing an RPM command:

rpm -ivh http://rpm.livna.org/livna-release-4.rpm

Now that you can access the RPM packages, we need to install http-devel using yum:

yum install httpd-devel

If all went well, we can now extract, compile, and install mod_cband for apache:

cd /tmp
wget http://cband.linux.pl/download/mod-cband-0.9.7.4.tgz
tar xzvf mod-cband-0.9.7.4.tgz
cd mod-cband-0.9.7.4
./configure
make
make install

If all went well, restart apache with the new module (you can check httpd.conf to make sure the module is going to be loaded):

/etc/init.d/httpd restart

Step 2: Configuration

Here’s where the Plesk part gets annoying. We can’t just edit our vhosts file, because plesk writes over it all the time. Instead, we edit a /home/httpd/vhosts/*/subdomains/*/conf/vhost.conf file. Mine looks like this:

CBandScoreboard /var/www/scoreboard
CBandPeriod 4W
CBandDefaultExceededCode 509
CBandLimit 2500G
CBandSpeed 5000 5 10
CBandRemoteSpeed 1600 3 1
<Location /cband-status>
  SetHandler cband-status
</Location>
<Location /cband-status-me>
  SetHandler cband-status-me
</Location>

What does this mean? (1) Use /var/www/scoreboard to log usage and limits (2) Reset the limit count every 4 weeks (3) Throw a 509 error when the limits are exceeded (4) Allow 2.5TB per period (5) Allow 5mbs with 5 requests a second and 10 connections at a time oeverall (6) Allow 1.6mbs with 3 requests a second and 1 connections at a time per client (7) Allow us to access a page at /cband-status to view the status.

You’ll also need to issue commands to make the scoreboard directory, and to allow apache to own it:

mkdir /var/www/scoreboard
chown apache:apache /var/www/scoreboard

Finally, Plesk requires you to inform it of your work:

/usr/local/psa/admin/sbin/websrvmng -u --vhost-name=yourdomain.com

If you restart Apache now, everything should work!

/etc/init.d/httpd restart

Step 3: An Example

You can take a peek at my status page right now if you’d like. It looks a bit like this:

cband-panel.jpg

Update: Fixed a little formatting.

Next Page »