We have a GoogleBOT
The Google Blog announces a wall mural of their Googlebot at one of the datacenters:

I suppose they want to humanize their lil’ crawler.
Privacy in the marketplace
The privacy pundits over at BoingBoing are hot and bothered because a tanning salon requires fingerprint identification to authenticate its customers. In the post, the original author writes:
WAYNE: “Hi, do you require a thumbrpint scan to get a tan there?”
TANNING BIMBO: “Yes, sir, we do.”[…]
I think the Arkansas chapter of the ACLU and the Arkansas state attorney general’s office need to be contacted
I think the answer to this is that you don’t need to use that tanning salon. If you dislike their “invasion” of your biometric privacy, you’ll have to go somewhere else.
Latent Semantic Indexing can improve your WordPress search results
Latent Semantic Indexing (LSI) can improve the quality of Wordpress search results dramatically. Rather than just look for any one of a set of keywords in the body of your posts, LSI creates a low-rank approximation of the relationship between your blog posts and the words you use. Since the document term-space is of much lower order than the original document-term matrix, words with related semantic value (i.e., “Microsoft” and “Bill Gates”) become associated, and searches for one term will return results that are closely related.
Some examples:
Take a look at this query for “writing good code”. Naturally, you would like Wordpress to return articles about coding practices, or even computer programming at all! However, the first three matches are Ludacris lyrics, ethical blogging, and finally something useful–Microsoft interview tales. Now, take a look at what I get back with LSI: Google Desktop Search, Heavyweight Categories plugin, and Things I want to do for Wordpress. These, to me anyway, seem a little more relevant. And, if you do try looking for “rap music” with the LSI technique, one of your results is Pot Smokers = Psychotic. Now how relevent is that?
If you need more proof, “pop culture” gives me Paris Hilton, and sex returns The “really big” boys get it wrong.
Some downsides:
To do LSI, you have to create a term document matrix, which will be really big. Mine is 12,525 x 726, and takes up 40 mb of space in full form. Of course, it’s a very sparse matrix, so you can store it in a sparse structure and save most of the space. However, you still have to compute the SVD of that huge matrix, and do a number of painful multiplications and solvings. In other words, LSI is a little slow for a web application. Queries on my p4 here at home take as long as a minute to run–imagine the wait on a loaded server!
Still, the results are astounding, and the WP dev’s should definitely code up a hack!
A Jordanian man shot dead his divorced sister …
A Jordanian man shot dead his divorced sister after seeing her photo on his friend’s camera-equipped cellphone in the latest “honour” killing in the kingdom, hospital officials said Monday.
The unidentified man shot the 31-year-old mother twice in the head Sunday night and then turned himself in to police saying he committed the murder to “cleanse his family’s honour”.
The incident is the fifth example of a so-called honour killing in Jordan this year. Those found guilty usually face sentences of a maximum of one year in jail under Jordanian law.
Google Adsense: Three Facts & 1 Theory
Three facts:
Fact: Google needs to post significantly greater-than-average earnings to meet high market pricepoints. If it’s stock is going to exceed $250 anytime soon, earnings must be exceptional. [source]
Fact: Google’s cut of Adsense revenue has been shown to vary unpredictably. [source]
Fact: Most of Google’s income comes directly from Advertising. [source]
One Theory:
By taking a percentage of Adsense revenue according to a specified random distribution, Google can infuse cash into its company at any time, without raising suspicion. It’s profits are exceptional, and no one will notice. Even more, if it takes a bigger cut from smaller customers, it can leverage the long tail effect of blog traffic to maximize profits. People that aren’t getting lots of traffic won’t expect to make much anyway.

