<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.2" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: To Crawl or not to Crawl?  The Alts Speak Out!</title>
	<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/</link>
	<description>The most wonderful search engines you've never seen!</description>
	<pubDate>Mon, 13 Oct 2008 17:07:04 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.2</generator>

	<item>
		<title>By: Alt Search Engines &#187; Blog Archive &#187; Guest Author: Crawling and Indexing the Web</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6868</link>
		<author>Alt Search Engines &#187; Blog Archive &#187; Guest Author: Crawling and Indexing the Web</author>
		<pubDate>Thu, 06 Sep 2007 16:06:50 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6868</guid>
		<description>[...] Today we are very fortunate to have another blogger&#8217;s perspective on the post that we did on alternative search engines, asking which of the &#8221;Alts&#8221; crawled the web and why:&#8220;To crawl or not to crawl? The Alts speak out!&#8221;  [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] Today we are very fortunate to have another blogger&#8217;s perspective on the post that we did on alternative search engines, asking which of the &#8221;Alts&#8221; crawled the web and why:&#8220;To crawl or not to crawl? The Alts speak out!&#8221;  [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tom Dibaja</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6160</link>
		<author>Tom Dibaja</author>
		<pubDate>Tue, 28 Aug 2007 16:47:35 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6160</guid>
		<description>Surprising words from Migoa.... surely they ought to know that we - Properazzi - have a proprietary crawler. Not least because we're the world's largest real estate search engine and we're both based in Barcelona! :) 

Plus, there's at least half-a-dozen other real estate sites in Europe with crawlers...</description>
		<content:encoded><![CDATA[<p>Surprising words from Migoa&#8230;. surely they ought to know that we - Properazzi - have a proprietary crawler. Not least because we&#8217;re the world&#8217;s largest real estate search engine and we&#8217;re both based in Barcelona! <img src='http://altsearchengines.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>Plus, there&#8217;s at least half-a-dozen other real estate sites in Europe with crawlers&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Oli</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6126</link>
		<author>Oli</author>
		<pubDate>Mon, 27 Aug 2007 20:33:40 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6126</guid>
		<description>If they'd all crawl, some small sites may get the majority of their traffic from spiders ;-) 

Anyway, IMO it makes more sense for a small SE to just try to directly work on Alexa's indices/data repositories than crawling _yet again_ and collecting all the data all others are collecting as well (what a waste of bandwidth). With more and more SEs coming out, there must be a market for crawled data, particularly pre-extracted data (e.g. extracted text from pdf, OCRed text, speech-to-text on audio files, other "strange" document formats, etc. which is harder to collect than plain vanilla HTML).</description>
		<content:encoded><![CDATA[<p>If they&#8217;d all crawl, some small sites may get the majority of their traffic from spiders <img src='http://altsearchengines.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p>Anyway, IMO it makes more sense for a small SE to just try to directly work on Alexa&#8217;s indices/data repositories than crawling _yet again_ and collecting all the data all others are collecting as well (what a waste of bandwidth). With more and more SEs coming out, there must be a market for crawled data, particularly pre-extracted data (e.g. extracted text from pdf, OCRed text, speech-to-text on audio files, other &#8220;strange&#8221; document formats, etc. which is harder to collect than plain vanilla HTML).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tommy Chieng</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6103</link>
		<author>Tommy Chieng</author>
		<pubDate>Mon, 27 Aug 2007 05:12:55 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6103</guid>
		<description>Thanks for visiting my blog &#62;&#62; http://www.crispnetworks.com

Your blog looks interesting. Definitely a source for me to know more about alternate search engines

Cheers.</description>
		<content:encoded><![CDATA[<p>Thanks for visiting my blog &gt;&gt; <a href="http://www.crispnetworks.com" rel="nofollow">http://www.crispnetworks.com</a></p>
<p>Your blog looks interesting. Definitely a source for me to know more about alternate search engines</p>
<p>Cheers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yakov</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6102</link>
		<author>Yakov</author>
		<pubDate>Mon, 27 Aug 2007 04:17:35 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6102</guid>
		<description>To crawl or not to crawl is more a business question that anything else. For those search engines that want to create the most long-term value for their shareholders, it is a 'must have' to be independent from the others, i.e. to crawl and index the Web.</description>
		<content:encoded><![CDATA[<p>To crawl or not to crawl is more a business question that anything else. For those search engines that want to create the most long-term value for their shareholders, it is a &#8216;must have&#8217; to be independent from the others, i.e. to crawl and index the Web.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: University Update - Open Source - To Crawl or not to Crawl? The Alts Speak Out!</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6097</link>
		<author>University Update - Open Source - To Crawl or not to Crawl? The Alts Speak Out!</author>
		<pubDate>Sun, 26 Aug 2007 23:47:26 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6097</guid>
		<description>[...]                           To Crawl or not to Crawl? The Alts Speak Out! &#187;  This Summary is from an article posted at Alt Search Engines  on Sunday, August 26, 2007     This [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;]                           To Crawl or not to Crawl? The Alts Speak Out! &#187;  This Summary is from an article posted at Alt Search Engines  on Sunday, August 26, 2007     This [&#8230;]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: University Update - Yahoo - To Crawl or not to Crawl? The Alts Speak Out!</title>
		<link>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6091</link>
		<author>University Update - Yahoo - To Crawl or not to Crawl? The Alts Speak Out!</author>
		<pubDate>Sun, 26 Aug 2007 21:46:47 +0000</pubDate>
		<guid>http://altsearchengines.com/2007/08/26/to-crawl-or-not-to-crawl-the-alts-speak-out/#comment-6091</guid>
		<description>[...]                           To Crawl or not to Crawl? The Alts Speak Out! &#187;  This Summary is from an article posted at Alt Search Engines  on Sunday, August 26, 2007     This [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;]                           To Crawl or not to Crawl? The Alts Speak Out! &#187;  This Summary is from an article posted at Alt Search Engines  on Sunday, August 26, 2007     This [&#8230;]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
