Continuing my partnership with the great blog directory Blogwise, I analysed generic first name data on 24,846 different blogs. My goal was to determine how many men versus how many women are blogging, and what the most popular blogger names are for each gender are. To do this, I compared the raw data against US Census first names data, which lists 1000 male and female names, ranked by frequency. If there is a tie between these lists for a given name, I choose the one with a frequency twice as much as the other, or else ignore it. This rarely occurs. Then, there are plenty of other first names that are harder to classify. Those fall into the backup lists, which are simply long lists of male / female names I’ve trolled off the internet. Finally, if it can’t match that, it gives up. This brings me to the following “raw numbers” result:
That’s right. Out of the 18,938 names I could identify, 76.8% of them are male and only 23.2% female:
Both men and women’s name form a typically power law distribution, where the few most popular names account for most of all names. Note that these are log-y graphs:
And finally, the 10 most popular men’s names:
And the 10 most popular women-blogger’s names:
If this makes you feel lonely, women, it should. 50% of america is female. 50% of bloggers of bloggers should be female. I won’t even *start* to hand out blame or speculate why–I’ll leave that for the comments–but it looks like there’s work to be done. For more information about blogging and naming, see What do you call your blog.
|This entry was posted on Saturday, July 23rd, 2005 at 2:08 am and is tagged with james jason, power law distribution, lonely women, us census, raw numbers, names john, first names, many men, sarah michelle, raw data, blogger, bloggers, graphs, blogging, heather, men and women, partnership, chris mark. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback.|