<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: AdSense Bot Working Overtime</title>
	<atom:link href="http://www.3dogmedia.com/adsense-bot-working-overtime/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.3dogmedia.com/adsense-bot-working-overtime/</link>
	<description></description>
	<lastBuildDate>Thu, 17 Dec 2009 08:46:34 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: vitaplease</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1223</link>
		<dc:creator>vitaplease</dc:creator>
		<pubDate>Thu, 20 Apr 2006 19:53:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1223</guid>
		<description>Nice find WG,

So will the trend be adding Adsense blended into hiding for quicker indexing of content change? (not that I see anything explicitly wrong with mediabot helping out indexing).

A while ago:

&lt;a href=&quot;http://www.webmasterworld.com/forum89/14-2-16.htm&quot; rel=&quot;nofollow&quot;&gt;
http://www.webmasterworld.com/forum89/14-2-16.htm&lt;/a&gt;

Googleguy:

&lt;blockquote&gt;It&#039;s not a quicker way into the index, vitaplease. It&#039;s a separate crawl&lt;/blockquote&gt;</description>
		<content:encoded><![CDATA[<p>Nice find WG,</p>
<p>So will the trend be adding Adsense blended into hiding for quicker indexing of content change? (not that I see anything explicitly wrong with mediabot helping out indexing).</p>
<p>A while ago:</p>
<p><a href="http://www.webmasterworld.com/forum89/14-2-16.htm" ><br />
</a><a href="http://www.webmasterworld.com/forum89/14-2-16.htm" >http://www.webmasterworld.com/forum89/14-2-16.htm</a></p>
<p>Googleguy:</p>
<blockquote><p>It&#8217;s not a quicker way into the index, vitaplease. It&#8217;s a separate crawl</p></blockquote>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Thies</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1222</link>
		<dc:creator>Dan Thies</dc:creator>
		<pubDate>Tue, 18 Apr 2006 22:15:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1222</guid>
		<description>Yeah, it &lt;a href=&quot;http://www.shoemoney.com/2006/04/18/matt-cutts-confirms-media-bot-crawling-for-big-daddy/&quot; rel=&quot;nofollow&quot;&gt;looks like I guessed wrong&lt;/a&gt;. Thanks for posting that clarification, Greg. What a really, really stupid idea Google has had here. I misunderestimated them.

I&#039;m sure you&#039;re quaking in your black boots about &quot;Matt also stated that you will gain zero advantage in search listings however if you are serving different content to MediaBot then to Googlebot then you could be in trouble.&quot; Yep, quaking and shivering. :D</description>
		<content:encoded><![CDATA[<p>Yeah, it <a href="http://www.shoemoney.com/2006/04/18/matt-cutts-confirms-media-bot-crawling-for-big-daddy/" >looks like I guessed wrong</a>. Thanks for posting that clarification, Greg. What a really, really stupid idea Google has had here. I misunderestimated them.</p>
<p>I&#8217;m sure you&#8217;re quaking in your black boots about &#8220;Matt also stated that you will gain zero advantage in search listings however if you are serving different content to MediaBot then to Googlebot then you could be in trouble.&#8221; Yep, quaking and shivering. <img src='http://www.3dogmedia.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: WebGuerrilla</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1221</link>
		<dc:creator>WebGuerrilla</dc:creator>
		<pubDate>Tue, 18 Apr 2006 21:12:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1221</guid>
		<description>&lt;blockquote&gt;&lt;em&gt;My guess is that the Adsense bot isn&#039;t grabbing pages to be indexed, just updating the cached version.&lt;/em&gt;&lt;/blockquote&gt;

The search results are reflecting the content of the cached page. It would certainly be a bit easier to tell if the mediabot template had words on it that were not included in the googlebot template, (which isn&#039;t the case) but you can still see that what is indexed and what is cached is the same because the googlebot does have unique words on it.

&lt;strong&gt;Example:&lt;/strong&gt;

Search for &lt;a href=&quot;http://www.google.com/search?&amp;q=godaddy+sucks&quot; rel=&quot;nofollow&quot;&gt;godaddy sucks&lt;/a&gt;.

The #6 listing is a post that got hit by the mediabot. Notice that the title for the listing is being generated by the first words on the page. That&#039;s because the mediabot template doesn&#039;t have any page titles or heading tags.

Now do a search for &lt;a href=&quot;http://www.google.com/search?&amp;q=%22why+godaddy+sucks%22&quot; rel=&quot;nofollow&quot;&gt;&quot;why godaddy sucks&quot;&lt;/a&gt; (Using quotes)

The exact phrase is now bolded in the url, but the excerpt doesn&#039;t show any other occurences of the exact phrase, despite the fact that the phrase appears in both the title and the heading tag of the page that was served to googlebot.

Now search for &lt;a href=&quot;http://www.google.com/search?&amp;q=%22why+godaddy+sucks%22+%22theme+developed+by+webguerrilla%22&quot; rel=&quot;nofollow&quot;&gt;&quot;why godaddy sucks&quot; + &quot;theme developed by webguerrilla&quot;&lt;/a&gt;

The GoDaddy post isn&#039;t returned because &quot;theme developed by webguerrilla&quot; doesn&#039;t exist on the mediabot template. It only exists on the googlebot template. (You can see it highlighted at the bottom of the cached pages that were returned as a match).

Based on that, I think it&#039;s a bit of a stretch to argue that the cache isn&#039;t a representation of what was indexed.</description>
		<content:encoded><![CDATA[<blockquote><p><em>My guess is that the Adsense bot isn&#8217;t grabbing pages to be indexed, just updating the cached version.</em></p></blockquote>
<p>The search results are reflecting the content of the cached page. It would certainly be a bit easier to tell if the mediabot template had words on it that were not included in the googlebot template, (which isn&#8217;t the case) but you can still see that what is indexed and what is cached is the same because the googlebot does have unique words on it.</p>
<p><strong>Example:</strong></p>
<p>Search for <a href="http://www.google.com/search?&#038;q=godaddy+sucks" >godaddy sucks</a>.</p>
<p>The #6 listing is a post that got hit by the mediabot. Notice that the title for the listing is being generated by the first words on the page. That&#8217;s because the mediabot template doesn&#8217;t have any page titles or heading tags.</p>
<p>Now do a search for <a href="http://www.google.com/search?&#038;q=%22why+godaddy+sucks%22" >&#8220;why godaddy sucks&#8221;</a> (Using quotes)</p>
<p>The exact phrase is now bolded in the url, but the excerpt doesn&#8217;t show any other occurences of the exact phrase, despite the fact that the phrase appears in both the title and the heading tag of the page that was served to googlebot.</p>
<p>Now search for <a href="http://www.google.com/search?&#038;q=%22why+godaddy+sucks%22+%22theme+developed+by+webguerrilla%22" >&#8220;why godaddy sucks&#8221; + &#8220;theme developed by webguerrilla&#8221;</a></p>
<p>The GoDaddy post isn&#8217;t returned because &#8220;theme developed by webguerrilla&#8221; doesn&#8217;t exist on the mediabot template. It only exists on the googlebot template. (You can see it highlighted at the bottom of the cached pages that were returned as a match).</p>
<p>Based on that, I think it&#8217;s a bit of a stretch to argue that the cache isn&#8217;t a representation of what was indexed.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Thies</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1220</link>
		<dc:creator>Dan Thies</dc:creator>
		<pubDate>Tue, 18 Apr 2006 20:12:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1220</guid>
		<description>Greg,

My question isn&#039;t about what is being cached, it&#039;s about what is being indexed. Caching and indexing are different things. The cached page isn&#039;t necessarily what is being used to return search results.

Caching = storing an HTML page so that users can see how it looked

Indexing = dissecting the page and storing the word occurences in the search engine&#039;s index

You have shown how the Adsense bot is updating the cached version of a web page. What I am wondering is whether that&#039;s what Google is indexing.

You could determine whether or not this is happening by adding a unique word to the page that you deliver to the Adsense bot, then searching for that word to see if it appears on the page that Google has indexed.

So if you added the words &quot;smoking gun&quot; to this post when you deliver it to the Adsense bot (but not the regular Googlebot), you could search Google for:
inurl:overtime site:google.webguerilla.com smoking gun

My guess is that the Adsense bot isn&#039;t grabbing pages to be indexed, just updating the cached version.</description>
		<content:encoded><![CDATA[<p>Greg,</p>
<p>My question isn&#8217;t about what is being cached, it&#8217;s about what is being indexed. Caching and indexing are different things. The cached page isn&#8217;t necessarily what is being used to return search results.</p>
<p>Caching = storing an HTML page so that users can see how it looked</p>
<p>Indexing = dissecting the page and storing the word occurences in the search engine&#8217;s index</p>
<p>You have shown how the Adsense bot is updating the cached version of a web page. What I am wondering is whether that&#8217;s what Google is indexing.</p>
<p>You could determine whether or not this is happening by adding a unique word to the page that you deliver to the Adsense bot, then searching for that word to see if it appears on the page that Google has indexed.</p>
<p>So if you added the words &#8220;smoking gun&#8221; to this post when you deliver it to the Adsense bot (but not the regular Googlebot), you could search Google for:<br />
inurl:overtime site:google.webguerilla.com smoking gun</p>
<p>My guess is that the Adsense bot isn&#8217;t grabbing pages to be indexed, just updating the cached version.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: WebGuerrilla</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1219</link>
		<dc:creator>WebGuerrilla</dc:creator>
		<pubDate>Tue, 18 Apr 2006 17:14:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1219</guid>
		<description>If you mean do I have a screen cap of that page when it was originally indexed by Googlebot, no I don&#039;t. But I can tell you that it was originally indexed properly. We serve the pages from this site to Google in a stripped down &quot;lite&quot; template that is quite different than the template for the media bot.

The Googlebot template retains all the page titles and site navigation. The Mediabot template doesn&#039;t. it just presents the content of the actual post, so it&#039;s easy to determine which bot was indexed the page in the cache.

&lt;a href=&quot;http://www.google.com/search?q=cache%3Ahttp%3A//clueless.webguerrilla.com/search-usability/&quot; rel=&quot;nofollow&quot;&gt;Here&#039;s an example&lt;/a&gt; of what a post looks like when it&#039;s serverd to Googlebot</description>
		<content:encoded><![CDATA[<p>If you mean do I have a screen cap of that page when it was originally indexed by Googlebot, no I don&#8217;t. But I can tell you that it was originally indexed properly. We serve the pages from this site to Google in a stripped down &#8220;lite&#8221; template that is quite different than the template for the media bot.</p>
<p>The Googlebot template retains all the page titles and site navigation. The Mediabot template doesn&#8217;t. it just presents the content of the actual post, so it&#8217;s easy to determine which bot was indexed the page in the cache.</p>
<p><a href="http://www.google.com/search?q=cache%3Ahttp%3A//clueless.webguerrilla.com/search-usability/" >Here&#8217;s an example</a> of what a post looks like when it&#8217;s serverd to Googlebot</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Thies</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1218</link>
		<dc:creator>Dan Thies</dc:creator>
		<pubDate>Tue, 18 Apr 2006 16:55:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1218</guid>
		<description>&lt;blockquote&gt;Does the ad bot also fetch and respect robots.txt?&lt;/blockquote&gt;
Yes it does, Jeremy. The problem is that you have to let the Adsense bot fetch a page if you want paid ads to display.

Greg, you&#039;ve posted an example where the Adsense bot is putting pages into the cache. Do you have an example of that version actually being indexed? You know, like where you&#039;ve maybe added the words &quot;smoking gun&quot; to the page you serve to the Adsense bot?</description>
		<content:encoded><![CDATA[<blockquote><p>Does the ad bot also fetch and respect robots.txt?</p></blockquote>
<p>Yes it does, Jeremy. The problem is that you have to let the Adsense bot fetch a page if you want paid ads to display.</p>
<p>Greg, you&#8217;ve posted an example where the Adsense bot is putting pages into the cache. Do you have an example of that version actually being indexed? You know, like where you&#8217;ve maybe added the words &#8220;smoking gun&#8221; to the page you serve to the Adsense bot?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeremy Zawodny</title>
		<link>http://www.3dogmedia.com/adsense-bot-working-overtime/#comment-1217</link>
		<dc:creator>Jeremy Zawodny</dc:creator>
		<pubDate>Tue, 18 Apr 2006 15:14:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.gregboser.com/2006/04/14/adsense-bot-working-overtime/#comment-1217</guid>
		<description>Does the ad bot also fetch and respect robots.txt?</description>
		<content:encoded><![CDATA[<p>Does the ad bot also fetch and respect robots.txt?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (enhanced) (user agent is rejected)

Served from: www.3dogmedia.com @ 2010-07-31 11:09:25 -->