Elliott C. Back: In Aere Aedificare

How many users does DIGG have?

Posted in Blogging, Science, Web 2.0, Scalability, Quantitative by Elliott Back on February 3rd, 2008.

When John Graham-Cumming asked the question How Many Users Does Digg Have?, there were a few things he couldn’t tell you, since his data consisted of randomly self-sampled users. Well, with the power of two PHP scripts, we can pull large amounts of user data and form queries. Our first question is how has DIGG grown over time?

digg-users-over-time.png
A graph of 187,054 digg users, randomly plotted against when they joined

This doesn’t tell us much, though, about how many DIGG users there actually are, or how active they are, so I plotted a histogram of the number of times these 200k users’ profiles had been viewed; the answer, unsurprisingly, is not very often in most cases:

digg-profile-views-histogram.png
83% of users had less than 50 profile views

And what about users who are active? How many people are digging stories every day? The answer is very few. I took a sample of 29,225 users from the previous sample (randomly) and used the DIGG API to query for their last digg. It turns out 31% (9125) had never dugg anything! After I removed those, here is the histogram I got:

digg-last-dugg.png
About 15% of Digg users dugg a story in the last week

Concluding thoughts

Digg boasts an official tally of 2.2M users, but at most 20% of them can be considered real, active users. That would bring their user count down to 440,000, far far less than a popular web 2.0 boom child can boast about, and significantly hurting that $300M (or ~$700 a user) valuation that they keep trying to get.

Code Appendix

The {digg user, time joined, digg id, profile page views} information was gathered by the following script:

<?php
    error_reporting
(E_ALL);
    
ini_set(‘user_agent’‘My-Application/2.5′);
    
ini_set(“include_path”“.:/usr/share/pear”);
    require_once 
‘Services/Digg.php’;
    require_once 
‘Services/Digg/Response/php.php’;

    $base ‘http://services.digg.com/users/?appkey=http://example.com&type=php’;
    
$data unserialize(file_get_contents($base.‘&count=0′));
    
    
$total $data->total;
    echo 
“There are $total total users\n”;
    echo 
“ID,Number,Name,Date,Views\n”;

    for($i 0$i 1000$i++){
        
$offset rand(0$total 100);
        
$data unserialize(@file_get_contents($base.‘&count=100&offset=’.$offset));

        $j 0;
        foreach(
$data->users as $user){
            
$page = @file_get_contents(‘http://digg.com/users/’.$user->name.‘/’);

            if(!$page)
                continue;

            preg_match(‘/id=”userid” value=”(\d+)”/i’$page$matches);
            echo 
$matches[1] . “,”;
            echo (
$offset $j++) . “,”
            echo 
$user->name “,”;
            echo 
$user->registered “,”;
            echo 
$user->profileviews .“\n”;
        }
    }
?>

This entry was posted on Sunday, February 3rd, 2008 at 2:34 pm and is tagged with users profiles, php scripts, john graham, histogram, profile views, digg, cumming, page views, pear, tally, appendix, graph, api, boom, lt, queries, nbsp. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback.

Leave a Reply

Powered by WP Hashcash

Fresh, related resources:

Supplied by Google Blog Search
  • Buried as Inaccurate
    Does Digg's favorite candidate, Obama, campaign by insulting those above him? No, he works his ass off and climbs his way up. If you admire Obama like I do, then be like him and take the high road. Just keep submitting, trying different ...
  • Conquering Kings
    The key is you won’t beat Digg or any of the others if you take the same approach as they do. There really isn’t that much differentiation between them so another site in this field that does what they do with one more added feature ...
  • The Rise of Contextual User Interfaces
    It was considered unorthodox and even dangerous to present the interface in non-standard ways because everyone believed that users were, to be frank, stupid, and wouldn't want to deal with anything other than what they were used to. ...
  • The Soul of the Internet
    Simple. Twitter is allowing for the increased connection between people, and the increased empathy of users. On no other platform have I seen groups so quickly mobilize to help each other. It is about people helping people, ...
  • Life on the investment trail
    It is ok if you localize when you have built the product, but at least make it very easy to do by separating the language text files of the interface. Obvious? Yes. Do not forget that many languages have words much longer than english ...