Elliott C. Back: Internet & Technology

Cuil Sucks At Search (Go Google!)

Posted in Google, Search by Elliott Back on July 29th, 2008.

I love the idea behind Cuil, the latest search engine in a long list of failures (Mahalo, Ask, Powerset) to challenge Google. As Mashable explains, they are pulling out all the stops to hit Google from multiple directions across their core search competency:

Enter Cuil, a very serious competitor, packed with ex-Googlers (Tom Costello and Anna Patterson are the backbone of Cuil, and they’ve both worked at Google), and claiming to have the largest index of websites – 120 billion – in the world.

It doesn’t end there: Cuil pulls pretty much every trick in the book. Big claims about the biggest index, privacy concerns (IP addresses of users aren’t saved, making it impossible for a third party to request it from them), semi-semantic approach (Cuil’s engine recognizes the relations between certain words on a web site, which helps it rank pages better). Hell, they even pulled the energy-saving trick: the front page of Cuil is completely black, in contrast to Google’s eye-poking whiteness.

Check out the Slashdottie thread for more discussion. I’m not interested in going there; rather I’m more concerned with how relevant the results from Cuil are, compared to Google, in a stricter context of information retrieval. After all, a search engine is about finding information.

Let’s start with a query “how to rip a dvd” in Cuil and Google:

Cuil on “How To Rip a DVD”

cuil-how-to-rip-a-dvd.png

4 of the 9 total results are spam from Ebooksbay. An additional 4 are for converting MP3s. The final result (which is quite spammy) is for ripping DVDs to a variety of formats. Score: 11%.

Google on “How To Rip a DVD”

google-how-to-rip-a-dvd.png

Google gives you 7 DVD ripping guides, and three spams site of ripping software. Essentially, you have to give it a Score: 100%, since it’s pretty much the baseline in our test. Just based on what I’ve seen so far, this will be a comparison not of relative merits, but of how much less relevant the results from Cuil are compared to Google.

Cuil on “ConcurrentHashMap”

cuil-concurrenthashmap.png

Wait, what is that in the rightmost result!!!? Yes, that winsome young woman is carefully inspecting a ConcurrentHashMap! Ahm, bad image / search results correlations aside, the search listings fail to list the authority Java documentation source (Sun’s website) and instead list 2 mirrors (java 5 and 6), 4 bug reports, 3 mailing list discussions, and 2 random libraries with a similarly named class. Score: 50%.

Google on “ConcurrentHashMap”

google-concurrenthashmap.png

Google nicely gives us the Sun Java page as the first result, 2 snippets of code using this class, 6 guides to using concurrent hash maps, a benchmark, one of the same random libraries as Cuil (Oswego), and a different random library (backport-util). I’d give them Score: 80% at this task.

Anyway, I’m getting tired of writing this. Cuil just doesn’t deliver fast, consistent, high-quality search results. The relevance is quite low, in spite of the interface improvements and searching / clustering / recommendation features.

SearchMe: Visual, Clustering search

Posted in Apple, Google, Interface, Search, Web 2.0 by Elliott Back on April 27th, 2008.

The more I look at visual search engine SearchMe, the more I like it. In a way that text-based search engine Google has never done, SearchMe brings thumbnails to search results without losing any of the textual indicators we need to process relevance. SearchMe is also innovating in clustering search results into categories or topics, something Google has experimented with their sets demo but never implemented into the larger search engine. Perhaps the best way to show you how much more relevant SearchMe can be is through a short example, searching for “Obama.”

searchme-obama-1.png

The first thing I get, as I type “Obama,” is a list of categories that SearchMe finds relevant. I click on “Politicians” and it takes me to the next screen, the main area for exploring search results:

searchme-obama-2.png

There are a few features you should note that set the SearchMe results apart from their competition. First, they keep the list of categories you’re interested in just one click away from instant filtering at the top of the results. Second, all of the available space of the page is filled with a gigantic preview of the search results. The title of the website is shown at the bottom, along with the site URL when you mouseover the results. Essentially, their search results are a better version of Apple’s coverflow, applied to websites. Clicking on a preview will take you directly to the page of interest, in the same tab, just like most search engines do today.

searchme-scientology.png

Their dynamic snippets code is nice, as well, highlighting the search terms you used in multiple colours. It appears to have been implemented directly in the coverflow-like flash engine, or behind the scenes is coming back as a new layer of image, as it loads only after the high resolution preview has loaded. An unfortunate side-effect of their highlighting algorithm is that when searching for multiple words, like “Calderon de la Barca,” the words will be highlighted separately, even if found next to each other.

searchme-china.png

Not all their results work well; for example, searching for “China” leads me into irrelevance, regardless of the category I choose, and also brings up this half-rendered view of NBA China, that my own browser renders properly. Other search terms also return odd categories and funny previews, but I imagine that this is something that will improve over time. The big problems for a search engine, responsiveness and interface, are already solved as SearchMe is both lightning fast and beautiful.

If you’re interested, you can go check out their blog or signup to the private beta. Apparently, the venture is Sequoia backed, according to Techcrunch, which probably means it’s serious about being a big web search contender in the future. According to Louis Grey, the searchme spider is aggressively hitting his blog, too. It will be interesting to come back and a year and see how SearchMe has evolved. The most likely outcome for this is being acquired by one of the big four–Facebook, Google, Yahoo, or Microsoft–since it’s hard to imagine unseating any of them in the popular mindset.

Google Supplemental Link Units

Posted in Google, Optimization, Search by Elliott Back on March 11th, 2008.

Oh yeah, I’m a I’m a baller! You know you’ve made it when you get your own supplemental link unit section from Google! I’ve been waiting a long time for these, and now I’ve finally got them, even if they are a little bit incorrect. I think I’ve got some 301 redirects that need to be changed…

elliott-back-supplemental-links.png

Next Page »