<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Elliott C. Back &#187; Search</title>
	<atom:link href="http://elliottback.com/wp/category/computers-technology/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://elliottback.com/wp</link>
	<description>Internet &#38; Technology</description>
	<lastBuildDate>Tue, 03 Nov 2009 23:59:21 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Cuil Sucks At Search (Go Google!)</title>
		<link>http://elliottback.com/wp/cuil-sucks-at-search-go-google/</link>
		<comments>http://elliottback.com/wp/cuil-sucks-at-search-go-google/#comments</comments>
		<pubDate>Wed, 30 Jul 2008 02:16:20 +0000</pubDate>
		<dc:creator>Elliott Back</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://elliottback.com/wp/archives/2008/07/29/cuil-sucks-at-search-go-google/</guid>
		<description><![CDATA[I love the idea behind Cuil, the latest search engine in a long list of failures (Mahalo, Ask, Powerset) to challenge Google.  As Mashable explains, they are pulling out all the stops to hit Google from multiple directions across their core search competency:
Enter Cuil, a very serious competitor, packed with ex-Googlers (Tom Costello and [...]]]></description>
			<content:encoded><![CDATA[<p>I love the <em>idea</em> behind <a href="http://www.cuil.com/">Cuil</a>, the latest search engine in a long list of failures (Mahalo, Ask, Powerset) to challenge Google.  As <a href="http://mashable.com/2008/07/28/cuil-search-google/">Mashable explains</a>, they are pulling out all the stops to hit Google from multiple directions across their core search competency:</p>
<blockquote><p>Enter Cuil, a very serious competitor, packed with ex-Googlers (Tom Costello and Anna Patterson are the backbone of Cuil, and they’ve both worked at Google), and claiming to have the largest index of websites &#8211; 120 billion &#8211; in the world.</p>
<p>It doesn’t end there: Cuil pulls pretty much every trick in the book. Big claims about the biggest index, privacy concerns (IP addresses of users aren’t saved, making it impossible for a third party to request it from them), semi-semantic approach (Cuil’s engine recognizes the relations between certain words on a web site, which helps it rank pages better). Hell, they even pulled the energy-saving trick: the front page of Cuil is completely black, in contrast to Google’s eye-poking whiteness. </p></blockquote>
<p>Check out the <a href="http://tech.slashdot.org/article.pl?sid=08/07/28/068211&#038;from=rss">Slashdottie thread</a> for more discussion.  I&#8217;m not interested in going there; rather I&#8217;m more concerned with how relevant the results from Cuil are, compared to Google, in a stricter context of information retrieval.  After all, a search engine is about finding information.</p>
<p>Let&#8217;s start with a query &#8220;how to rip a dvd&#8221; in <a href="http://www.cuil.com/search?q=how%20to%20rip%20a%20dvd&#038;sl=long">Cuil</a> and <a href="http://www.google.com/search?q=how+to+rip+a+dvd">Google</a>:</p>
<p><strong><a href="http://www.cuil.com/search?q=how%20to%20rip%20a%20dvd&#038;sl=long">Cuil on &#8220;How To Rip a DVD&#8221;</a></strong></p>
<p><img id="image2704" src="http://elliottback.com/wp/wp-content/uploads/2008/07/cuil-how-to-rip-a-dvd.png" alt="cuil-how-to-rip-a-dvd.png" /></p>
<p>4 of the 9 total results are spam from Ebooksbay.  An additional 4 are for converting MP3s.  The final result (which is quite spammy) is for ripping DVDs to a variety of formats.  <strong>Score: 11%</strong>.</p>
<p><strong><a href="http://www.google.com/search?q=how+to+rip+a+dvd">Google on &#8220;How To Rip a DVD&#8221;</a></strong></p>
<p><img id="image2705" src="http://elliottback.com/wp/wp-content/uploads/2008/07/google-how-to-rip-a-dvd.png" alt="google-how-to-rip-a-dvd.png" /></p>
<p>Google gives you 7 DVD ripping guides, and three spams site of ripping software.  Essentially, you have to give it a <strong>Score: 100%</strong>, since it&#8217;s pretty much the baseline in our test.  Just based on what I&#8217;ve seen so far, this will be a comparison not of relative merits, but of how much less relevant the results from Cuil are compared to Google.</p>
<p><strong><a href="http://www.cuil.com/search?q=ConcurrentHashMap&#038;sl=long">Cuil on &#8220;ConcurrentHashMap&#8221;</a></strong></p>
<p><img id="image2706" src="http://elliottback.com/wp/wp-content/uploads/2008/07/cuil-concurrenthashmap.png" alt="cuil-concurrenthashmap.png" /></p>
<p>Wait, what is that in the rightmost result!!!?  Yes, that winsome young woman is carefully inspecting a ConcurrentHashMap!  Ahm, bad image / search results correlations aside, the search listings fail to list the authority Java documentation source (Sun&#8217;s website) and instead list 2 mirrors (java 5 and 6), 4 bug reports, 3 mailing list discussions, and 2 random libraries with a similarly named class.  <strong>Score: 50%</strong>.</p>
<p><strong><a href="http://www.google.com/search?q=ConcurrentHashMap">Google on &#8220;ConcurrentHashMap&#8221;</a></strong></p>
<p><img id="image2707" src="http://elliottback.com/wp/wp-content/uploads/2008/07/google-concurrenthashmap.png" alt="google-concurrenthashmap.png" /></p>
<p>Google nicely gives us the Sun Java page as the first result, 2 snippets of code using this class, 6 guides to using concurrent hash maps, a benchmark, one of the same random libraries as Cuil (Oswego), and a different random library (backport-util).  I&#8217;d give them <strong>Score: 80%</strong> at this task.</p>
<p>Anyway, I&#8217;m getting tired of writing this.  Cuil just doesn&#8217;t deliver fast, consistent, high-quality search results.  The relevance is quite low, in spite of the interface improvements and searching / clustering / recommendation features.</p>
]]></content:encoded>
			<wfw:commentRss>http://elliottback.com/wp/cuil-sucks-at-search-go-google/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>SearchMe: Visual, Clustering search</title>
		<link>http://elliottback.com/wp/searchme-visual-clustering-search/</link>
		<comments>http://elliottback.com/wp/searchme-visual-clustering-search/#comments</comments>
		<pubDate>Sun, 27 Apr 2008 05:28:44 +0000</pubDate>
		<dc:creator>Elliott Back</dc:creator>
				<category><![CDATA[Apple]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Interface]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://elliottback.com/wp/archives/2008/04/27/searchme-visual-clustering-search/</guid>
		<description><![CDATA[The more I look at visual search engine SearchMe, the more I like it.  In a way that text-based search engine Google has never done, SearchMe brings thumbnails to search results without losing any of the textual indicators we need to process relevance.  SearchMe is also innovating in clustering search results into categories [...]]]></description>
			<content:encoded><![CDATA[<p>The more I look at visual search engine <a href="http://searchme.com">SearchMe</a>, the more I like it.  In a way that text-based search engine Google has never done, SearchMe brings thumbnails to search results without losing any of the textual indicators we need to process relevance.  SearchMe is also innovating in clustering search results into categories or topics, something Google has experimented with <a href="http://labs.google.com/sets">their sets demo</a> but never implemented into the larger search engine.  Perhaps the best way to show you how much more relevant SearchMe can be is through a short example, searching for &#8220;Obama.&#8221;</p>
<p><img id="image2618" src="http://elliottback.com/wp/wp-content/uploads/2008/04/searchme-obama-1.png" alt="searchme-obama-1.png" /></p>
<p>The first thing I get, as I type &#8220;Obama,&#8221; is a list of categories that SearchMe finds relevant.  I click on &#8220;Politicians&#8221; and it takes me to the next screen, the main area for exploring search results:</p>
<p><img id="image2619" src="http://elliottback.com/wp/wp-content/uploads/2008/04/searchme-obama-2.png" alt="searchme-obama-2.png" /></p>
<p>There are a few features you should note that set the SearchMe results apart from their competition.  First, they keep the list of categories you&#8217;re interested in just one click away from instant filtering at the top of the results.  Second, all of the available space of the page is filled with a gigantic preview of the search results.  The title of the website is shown at the bottom, along with the site URL when you mouseover the results.  Essentially, their search results are a better version of Apple&#8217;s coverflow, applied to websites.  Clicking on a preview will take you directly to the page of interest, in the same tab, just like most search engines do today.</p>
<p><img id="image2621" src="http://elliottback.com/wp/wp-content/uploads/2008/04/searchme-scientology.png" alt="searchme-scientology.png" /></p>
<p>Their dynamic snippets code is nice, as well, highlighting the search terms you used in multiple colours.  It appears to have been implemented directly in the coverflow-like flash engine, or behind the scenes is coming back as a new layer of image, as it loads only after the high resolution preview has loaded.  An unfortunate side-effect of their highlighting algorithm is that when searching for multiple words, like &#8220;Calderon de la Barca,&#8221; the words will be highlighted separately, even if found next to each other.</p>
<p><img id="image2620" src="http://elliottback.com/wp/wp-content/uploads/2008/04/searchme-china.png" alt="searchme-china.png" /></p>
<p>Not all their results work well; for example, searching for &#8220;China&#8221; leads me into irrelevance, regardless of the category I choose, and also brings up this half-rendered view of NBA China, that my own browser renders properly.  Other search terms also return odd categories and funny previews, but I imagine that this is something that will improve over time.  The big problems for a search engine, responsiveness and interface, are already solved as SearchMe is both lightning fast and beautiful.</p>
<p>If you&#8217;re interested, you can go <a href="http://blog.searchme.com/">check out their blog</a> or signup to the <a href="http://beta.searchme.com/">private beta</a>.  Apparently, <a href="http://www.techcrunch.com/2008/03/11/searchme-launches-new-search-engine-with-heavy-backing-from-sequoia/">the venture is Sequoia backed</a>, according to Techcrunch, which probably means it&#8217;s serious about being a big web search contender in the future.  According to Louis Grey, the <a href="http://www.louisgray.com/live/2008/02/limelight-networks-searchme-spider.html">searchme spider is aggressively hitting his blog</a>, too.  It will be interesting to come back and a year and see how SearchMe has evolved.  The most likely outcome for this is being acquired by one of the big four&#8211;Facebook, Google, Yahoo, or Microsoft&#8211;since it&#8217;s hard to imagine unseating any of them in the popular mindset.</p>
]]></content:encoded>
			<wfw:commentRss>http://elliottback.com/wp/searchme-visual-clustering-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Supplemental Link Units</title>
		<link>http://elliottback.com/wp/google-supplemental-link-units/</link>
		<comments>http://elliottback.com/wp/google-supplemental-link-units/#comments</comments>
		<pubDate>Tue, 11 Mar 2008 22:45:07 +0000</pubDate>
		<dc:creator>Elliott Back</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Optimization]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://elliottback.com/wp/archives/2008/03/11/google-supplemental-link-units/</guid>
		<description><![CDATA[Oh yeah, I&#8217;m a I&#8217;m a baller!  You know you&#8217;ve made it when you get your own supplemental link unit section from Google!  I&#8217;ve been waiting a long time for these, and now I&#8217;ve finally got them, even if they are a little bit incorrect.  I think I&#8217;ve got some 301 redirects [...]]]></description>
			<content:encoded><![CDATA[<p>Oh yeah, I&#8217;m a I&#8217;m a baller!  You know you&#8217;ve made it when you get your own supplemental link unit section from Google!  I&#8217;ve been waiting a long time for these, and now I&#8217;ve finally got them, even if they are a little bit incorrect.  I think I&#8217;ve got some 301 redirects that need to be changed&#8230;</p>
<p><img id="image2578" src="http://elliottback.com/wp/wp-content/uploads/2008/03/elliott-back-supplemental-links.png" alt="elliott-back-supplemental-links.png" /></p>
]]></content:encoded>
			<wfw:commentRss>http://elliottback.com/wp/google-supplemental-link-units/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Ask&#8217;s Search Suggest is Hilarious</title>
		<link>http://elliottback.com/wp/asks-search-suggest-is-hilarious/</link>
		<comments>http://elliottback.com/wp/asks-search-suggest-is-hilarious/#comments</comments>
		<pubDate>Fri, 02 Nov 2007 23:45:33 +0000</pubDate>
		<dc:creator>Elliott Back</dc:creator>
				<category><![CDATA[Humour]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://elliottback.com/wp/archives/2007/11/02/asks-search-suggest-is-hilarious/</guid>
		<description><![CDATA[If you go to Ask.com and start typing, it will try to help you find what you&#8217;re looking for by displaying a list of suggestions it thinks are relevant to your query (and give the arrogance of Ask&#8217;s marketing department, probably your life as well).
But not every suggestion is a hit; most are ok, reasonable [...]]]></description>
			<content:encoded><![CDATA[<p>If you go to <a href="http://www.ask.com/?o=0">Ask.com</a> and start typing, it will try to help you find what you&#8217;re looking for by displaying a list of suggestions it thinks are relevant to your query (and give the arrogance of Ask&#8217;s marketing department, probably your life as well).</p>
<p>But not every suggestion is a hit; most are ok, reasonable extensions, but in <em>nearly every</em> phrase I type in there is something hilarious.  Take, for example, &#8220;learn to speak&#8221; which has the usual suspects Spanish, English, Italian, German, and the most unusual <strong>Gibberish</strong>:</p>
<p><img id="image2471" src="http://elliottback.com/wp/wp-content/uploads/2007/11/ask-gibberish.png" alt="ask-gibberish.png" /><br />
<small>If you ask me, it&#8217;s the most expressive language in the world</small></p>
<p>While most people want to learn to play a guitar, freestyle, or draw, little Johnny is learning to <strong>levitate objects</strong>:</p>
<p><img id="image2472" src="http://elliottback.com/wp/wp-content/uploads/2007/11/ask-learn-to.png" alt="ask-learn-to.png" /><br />
<small>With his mind, what else?</small></p>
<p>Americans love to sue you.  They&#8217;ll do it for wrongful imprisonment, slander, or just about any kind of distress they can of, including <strong>tubal ligation failure</strong>:</p>
<p><img id="image2474" src="http://elliottback.com/wp/wp-content/uploads/2007/11/ask-sue-for.png" alt="ask-sue-for.png" /><br />
<small>I&#8217;d rather check her phone logs&#8230;</small></p>
<p>Ask&#8217;ers don&#8217;t know much about themselves, so naturally they don&#8217;t know if they are pregnant or not, as seen in this query for &#8220;how can i:&#8221;</p>
<p><img id="image2475" src="http://elliottback.com/wp/wp-content/uploads/2007/11/ask-pregnant.png" alt="ask-pregnant.png" /><br />
<small>Also, your dog is expecting!</small></p>
<p>A typo on &#8220;how do i find&#8221; led me to this gem of <strong>feng shui confusion</strong>:</p>
<p><img id="image2476" src="http://elliottback.com/wp/wp-content/uploads/2007/11/ask-fung.png" alt="ask-fung.png" /><br />
<small>Since when is this a verb?</small></p>
<p>I saved the best for last, which is the auto-complete results for &#8220;is it legal to,&#8221; a horrifying look into modern culture:</p>
<p><img id="image2477" src="http://elliottback.com/wp/wp-content/uploads/2007/11/ask-is-it-legal.png" alt="ask-is-it-legal.png" /><br />
<small>The answer is a categorical NO!</small></p>
<p>There&#8217;s an indication that this data is filtered&#8211;try starting lesbian, for example&#8211;so I&#8217;m not sure if I believe these results.  My operating assumption is that this is a marketing gimmick to try and get some churn on the blogosphere and free PR.  It wouldn&#8217;t be hard to drop in a single unlikely phrase to spice things up, now would it?</p>
<p>p.s., don&#8217;t type &#8220;how should I&#8221; or &#8220;can I sell my&#8221; or &#8220;ask me about&#8221; because they&#8217;re sophomoric.  I bet Google&#8217;s users are equally idiotic, but they certainly don&#8217;t go out of their way to show that to us!</p>
]]></content:encoded>
			<wfw:commentRss>http://elliottback.com/wp/asks-search-suggest-is-hilarious/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Google Pagerank Falls on Paid Links, Blogs</title>
		<link>http://elliottback.com/wp/google-pagerank-falls-on-paid-links-blogs/</link>
		<comments>http://elliottback.com/wp/google-pagerank-falls-on-paid-links-blogs/#comments</comments>
		<pubDate>Thu, 25 Oct 2007 15:53:25 +0000</pubDate>
		<dc:creator>Elliott Back</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://elliottback.com/wp/archives/2007/10/24/google-pagerank-falls-on-paid-links-blogs/</guid>
		<description><![CDATA[The blogosphere today is in collective shock after Google downgraded the pagerank of many leading blogs and news sources.  The response tends to fall into several categories: we knew it was coming, pagerank doesn&#8217;t matter, and we deserved it.  Techcrunch does a pretty good job of examining the evidence behind the update:
The only [...]]]></description>
			<content:encoded><![CDATA[<p>The blogosphere today is in <a href="http://andybeard.eu/2007/10/pagerank-update.html">collective shock</a> after Google downgraded the pagerank of many leading blogs and news sources.  The response tends to fall into several categories: <a href="http://www.copyblogger.com/thanks-google/">we knew it was coming</a>, <a href="http://www.searchengineguide.com/jennifer-laycock/anyone-have-some-boots-i-could-borrow.php">pagerank doesn&#8217;t matter</a>, and <a href="http://www.webpronews.com/blogtalk/2007/10/24/google-page-ranks-been-dead-for-quite-some-time">we deserved it</a>.  Techcrunch does a pretty good job of examining the <a href="http://www.techcrunch.com/2007/10/24/google-declares-jihad-on-blog-link-farms/">evidence behind the update</a>:</p>
<blockquote><p>The only clear change appears to be among large scale blog networks and similar link farms, where each site in the network provides hundreds of outgoing links on each page of the blog to other blogs in the network, in some cases creating tens, even hundred of thousands of cross links. Previously such behavior has been rewarded by Google with high page rank, although it would now appear that this loop hole may now be shut.</p></blockquote>
<p>Here&#8217;s a table of pagerank changes organized by the percent difference:</p>
<table style="width:100%;">
<tr>
<td style="background:#ffabab; padding:5px; text-align:center;">Pagerank -4</td>
<td style="background:#fff8a4; padding:5px; text-align:center;">Pagerank -3</td>
<td style="background:#c4ffa4; padding:5px; text-align:center;">Pagerank -2</td>
</tr>
<tr>
<td style="background:#ffabab; padding:5px; text-align:center;">Statcounter</td>
<td style="background:#fff8a4; padding:5px; text-align:center;">SEO Rountable<br />Search Engine Journal<br />Quickonline Tips</td>
<td style="background:#c4ffa4; padding:5px; text-align:center;">Forbes<br />SF Gate<br />The Washington Post<br />Engadget<br />The Blog Herald<br />Autoblog<br />Problogger<br />Joystiq<br />The Unofficial Apple Weblog</td>
</tr>
</table>
<p>An interesting tidbit comes from <a href="http://www.syntagmamedia.com/2007/10/24/google-attacks-its-competitors-with-pr-meltdown/">Syntagma</a> who note that &#8220;the majority of these decreases happened after a human review.&#8221;  So, it might not be easy for you to fix your linking strategy and regain Pagerank automatically.</p>
<p>Ironically, this coincides with <a href="http://googlified.com/2007goog-at-666/">GOOG hitting $666</a> today.  And, <a href="http://svextra.com/blogs/gmsv/2007/10/googles_reappraisal_leaves_many_sites_pagerankled.html">Silicon Valley</a> is calling us &#8220;Pagerankled.&#8221;  For you people out there running blogs, an immediate solution is the following:</p>
<ul>
<li>Make sure you nofollow any links that you don&#8217;t directly control</li>
<li>Avoid using static link-farms like directories, like linking to every blog in your network from every page</li>
<li>Don&#8217;t let your commenters add links to their sites</li>
</ul>
<p>Here&#8217;s an example of the link distribution of my site after I&#8217;ve properly annotated some links with nofollow:</p>
<p><img id="image2461" src="http://elliottback.com/wp/wp-content/uploads/2007/10/link-types.png" alt="link-types.png" /></p>
<p>The green areas (header, footer, content, and some meta data) represent regular links, the red areas (advertising, sidebar links, tags, and related stories) are nofollow links, and the blue areas are dynamic links (javascript widgets) which don&#8217;t need updating.  I am not sure if I want to nofollow anything else&#8211;what do you think?</p>
<p><strong>Update:</strong> Forbes <a href="http://www.forbes.com/home/technology/2007/10/24/google-web-traffic-tech-cx_ag_1024google2.html">weighs in</a>, &#8220;it could also be Google simply taking into account the growth of the Internet.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://elliottback.com/wp/google-pagerank-falls-on-paid-links-blogs/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
