<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Smackdown! &#187; Cuttisms</title>
	<atom:link href="http://smackdown.blogsblogsblogs.com/category/cuttisms/feed/" rel="self" type="application/rss+xml" />
	<link>http://smackdown.blogsblogsblogs.com</link>
	<description>Smackdown!</description>
	<lastBuildDate>Tue, 22 Nov 2011 22:40:24 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>How Matt Cutts Leveraged The Stack Overflow And Hacker News Communities In Redefining The Phrase &#8220;Content Farms&#8221;</title>
		<link>http://smackdown.blogsblogsblogs.com/2011/01/31/how-matt-cutts-leveraged-the-stack-overflow-and-hacker-news-communities-in-redefining-the-phrase-content-farms/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2011/01/31/how-matt-cutts-leveraged-the-stack-overflow-and-hacker-news-communities-in-redefining-the-phrase-content-farms/#comments</comments>
		<pubDate>Mon, 31 Jan 2011 21:52:15 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[spin]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=892</guid>
		<description><![CDATA[A little over a week ago, on the Friday before last, Matt Cutts, the head of Google&#8217;s Web Spam Team, wrote a post on the Official Google Blog titled &#8220;Google search and search engine spam&#8221;. This post, and the upcoming changes it discussed, were most likely in response to a growing trend of dissatisfaction with [...]]]></description>
			<content:encoded><![CDATA[<p>A little over a week ago, on the Friday before last, Matt Cutts, the head of Google&#8217;s Web Spam Team, wrote a post on the Official Google Blog titled <a href="http://googleblog.blogspot.com/2011/01/google-search-and-search-engine-spam.html" target="_blank">&#8220;Google search and search engine spam&#8221;</a>. This post, and the upcoming changes it discussed, were most likely in response to a <a href="http://www.codinghorror.com/blog/2011/01/trouble-in-the-house-of-google.html" totle="Trouble In the House of Google" target="_blank">growing trend of dissatisfaction with Google&#8217;s results</a> that have been cropping up around the blogosphere. In the post Matt talks about how Google feels that things are in fact not as bad as people are saying, and that &#8220;Google&#8217;s search quality is better than it has ever been in terms of relevance, freshness and comprehensiveness.&#8221; He does say that recently, due to increase in both &#8220;size and freshness&#8221; that of course some spam did get indexed, and also states that as the old, tired, run of the mill spam decreased in Google&#8217;s index that Google will now be shifting it&#8217;s focus on to content that just sucks:</p>
<blockquote><p>As &#8220;pure webspam&#8221; has decreased over time, attention has shifted instead to &#8220;content farms,&#8221; which are sites with shallow or low-quality content. <em>- Matt Cutts</em></p></blockquote>
<p>Whoa. This, especially coming from Matt Cutts, is huge. For those who don&#8217;t know, <a href="http://en.wikipedia.org/wiki/Content_farm" target="_blank">&#8220;content farms&#8221;</a> are <span id="more-892"></span>organizations that generate websites composed of large amounts of low cost &#8220;fluff&#8221; or filler content, with little to no regard to quality. The content is generated not based on having information and the desire to share it, but rather in response to queries that might get typed into a search engine, and are built for search spiders rather than human consumption. They include companies like <a href="http://www.seobook.com/demand-medias-ehow-com-using-interesting-expired-domain-redirect-seo-strategy" target="_blank">Demand Media</a>, <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/mahalo-com-meet-the-new-spam-worse-than-the-old-spam/" target="_blank">Mahalo</a>, and Associated Content.</p>
<p>Historically speaking, Matt has pretty much refused to come right out and say that these content farms were indeed spam, despite the fact that they <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66355" target="_blank">clearly violated Google&#8217;s quality guidelines</a>:</p>
<blockquote><p>Doorway pages are typically large sets of poor-quality pages where each page is optimized for a specific keyword or phrase&#8230; Google&#8217;s aim is to give our users the most valuable and relevant search results. Therefore, we frown on practices that are designed to manipulate search engines and deceive users by directing them to sites other than the ones they selected, and that provide content solely for the benefit of search engines. Google may take action on doorway sites and other sites making use of these deceptive practice, including removing these sites from the Google index. <em>- Google Webmaster Tools Help</em></p></blockquote>
<p>Regardless of the very clear wording of their policies, Google has to date not banned any of these content farms for their violations. In fact, quite the opposite &#8211; Matt has in the past even defended these sites, and in Mahalo&#8217;s case at least given warnings to them which he then allowed them to ignore. He alluded to the fact that one of the algorithm updates from last year, Mayday, was supposed to help filter out &#8220;really kind of lower quality&#8221; sites, and many people thought he must be talking about content farms back then, but alas that <a href="http://smackdown.blogsblogsblogs.com/2010/06/11/was-the-google-mayday-update-a-complete-failure-then/" target="_blank">turned out to be a bust</a>. So when he comes right out and says, hey, you&#8217;ve waited long enough, now we&#8217;re going to target content farms for reals, y&#8217;all, then yeah, that&#8217;s a Pretty Big Deal.</p>
<p>Now, Richard Rosenblatt, the CEO of Demand Media, may be may be in denial about his company being a content farm, but that definition has existed for quite some time, and regardless of what you call it low quality content built specifically for search engines is in violation of Google&#8217;s guidelines. However, he still persists in his belief that as long as you can get some people to call it something else, his <a href="http://mediamemo.allthingsd.com/20110127/demand-media-says-its-getting-along-just-fine-with-google-thank-you-very-much/" target="_blank">&#8220;partnership with Google&#8221;</a> will keep them protected regardless of what happens:</p>
<blockquote><p>This is why our partnership with Google makes sense. 1) We help them fill the gaps in their index, where they don’t have quality content. 2) We’re the largest supplier of all video to YouTube, over two billion views and 3) we’re a large AdSense partner. So our relationship is synergistic, and it’s a great partnership. And it’s a partnership that we’re excited to continue to expand. <em>- Richard Rosenblatt, attempting to give Google&#8217;s PR team a heart attack</em></p></blockquote>
<p>I am guessing that Mr. Rosenblatt missed the section in Matt&#8217;s post where he very specifically discussed the fact that <em>no</em> special partnerships would protect the content mills from these changes:</p>
<blockquote><p>One misconception that we’ve seen in the last few weeks is the idea that Google doesn’t take as strong action on spammy content in our index if those sites are serving Google ads. To be crystal clear:</p>
<ul>
<li>Google absolutely takes action on sites that violate our quality guidelines regardless of whether they have ads powered by Google;</li>
<li>Displaying Google ads does not help a site’s rankings in Google; and</li>
<li>Buying Google ads does not increase a site’s rankings in Google’s search results.</li>
</ul>
<p><em> &#8211; Matt Cutts, being crystal clear</em></p></blockquote>
<p>Then Friday rolls around, and <a href="http://www.mattcutts.com/blog/algorithm-change-launched/" target="_blank">Matt announces that these changes already happened earlier in the week</a>. If you didn&#8217;t notice any changes, then that&#8217;s probably because, according to Matt, less than half of a percent of queries would show any perceptible ranking differences. If you didn&#8217;t notice any changes in queries involving content farms, well&#8230; as near as I can tell that is because there weren&#8217;t any. In fact, in his announcement post Matt doesn&#8217;t even use the phrase &#8220;content farms&#8221; at all, and instead only discusses that the net effect of these changes is that in cases where content was scraped, searchers are more likely to see the original content first. He then thanks <a href="http://www.codinghorror.com/blog/" target="_blank">Jeff Atwood</a> (one of the ones who wrote a story discussing Google&#8217;s decline in quality that had a large audience) and <a href="http://stackoverflow.com/" target="_blank">Stack Overflow&#8217;s team</a> (a site that Jeff co-founded) for their feedback. A few people asked about the omission in the comments, but as of yet anyway Matt has not replied to any of them.</p>
<p>As to the results themselves, for the most part I am seeing what I was seeing before, so that &#8220;less than half of a percent&#8221; doesn&#8217;t surprise me. If you search for [<a href="http://www.google.com/search?q=mcdonalds+coupons" target="_blank">mcdonalds coupons</a>] the #1 site is still a Mahalo page that doesn&#8217;t actually have any coupons on it, and very little original content. If you search for [<a href="http://www.google.com/search?q=mcdonalds+free+salad+coupons" target="_blank">mcdonalds free salad coupons</a>] you get a different Mahalo page that does actually have a picture of a coupon on it (good only in Canada, and expired in July 2010, however), and if you search for [<a href="http://www.google.com/search?q=mcdonalds+happy+meal+coupons" target="_blank">mcdonalds happy meal coupons</a>] the second listing is a Mahalo page, again with no coupons on it. These pages are filled with riveting dialog, such as the section labeled &#8220;McDonalds Happy Meal Coupons Coupon Policies,&#8221; which states:</p>
<blockquote><p>The policies for McDonalds Happy Meal coupons may have certain restrictions and these might include not being able to combine discounts or limiting the period of use. Make sure you read and understand the instructions listed on the coupon carefully to ensure that you know when the coupon will become valid and when it will expire as well as what special restrictions apply. Also included in this information will be which product or products the coupon can be used to purchase. Insuring that you understand the coupon policy can help you to avoid any mistakes during the checkout process. <em>- Content Mahalo actually paid for</em></p></blockquote>
<p>Seriously?</p>
<p>It&#8217;s not just Mahalo, of course&#8230; type in [<a href="http://www.google.com/search?q=how+to+reset+your+blackberry" target="_blank">how to reset your blackberry</a>] and you will find ranking just fine a page from eHow that is nothing more than the phrase <a href="http://www.ehow.com/how_4776425_reset-blackberry-removing-battery.html" target="_blank" rel="nofollow">&#8220;hit alt+right-shift+delete&#8221;</a> wrapped in light, fluffy filler content. I also still see queries where the duplicate content outranks the original, such as the copy of a Wikipedia page that ranks #1 for [<a href="http://www.google.com/search?q=elvett+semic" target="_blank">elvett semic</a>]. The changes, whatever they were, truly are barely (if at all) perceptible. The change was so small that one of Matt&#8217;s readers asked, &#8220;I&#8217;m wondering why announce it if you&#8217;ve gotten the feedback and the algorithm update would presumably be of such little consequence that no one would likely notice or comment on it unless you told everyone.&#8221; Indeed, why make such a big deal out of something when almost no one can tell the difference?</p>
<p>To answer that you need to take a look at exactly what it was that did change. When I search in Google now for questions that were asked on Stack Overflow, at least for the queries I checked, I now see SO ranking instead of sites that scrape their content. This is of course how it should be, and the main concern that the people from that community were <s>bitching</s> giving feedback about to Matt. Stack Overflow is, as I mentioned, the site that was co-founded by Jeff Atwood, who is the author of the much quoted post that generated quite a bit of buzz about Google&#8217;s decline in quality. Many of the frequenters of Stack Overflow are also regulars on <a href="http://news.ycombinator.com/" target="_blank">Hacker News</a> which (not so) coincidentally Matt decided to hold a good portion of the discussion about these changes, both before and after they were implemented. While the HN and SO communities in and of themselves might be tiny compared to the web as a whole, the fact is that their voices do carry within the online community. Start buzz there about Google showing quite a bit of improvement and it has a very good chance of spreading, even if the data set demonstrating that is overall quite small. Add to that the fact that Richard Rosenblatt, CEO of Demand Media <em>knows</em> that the changes aren&#8217;t targeted at his company (and when asked if Google had discussed the changes with him, replies &#8220;I can’t comment on that.&#8221;), and then toss in Jason Calacanis&#8217;s ingratiating comments on Matt&#8217;s blog post about the changes going live:</p>
<blockquote><p>It was clear that Mahalo was getting grouped into the &#8220;content farm&#8221; space&#8230; <em>- Jason Calacanis</em></p></blockquote>
<p>No kidding? Really? Past tense there, eh Jason?</p>
<p>So Matt loosely ties the concepts of &#8220;content farms&#8221; and &#8220;scrapers&#8221; together in a blog post on the official Google Blog, and claims that they are taking action against them. He then announces a change that appears to only affect scraper sites, and furthermore only those scraping a specific dissatisfied community, publicly thanks that community for their help, and then doesn&#8217;t mention the phrase &#8220;content farm&#8221; again. Even though the changes were practically non-existent, there is a good chance that the overall impression from those who don&#8217;t look too closely is that action was indeed taken, and that if what were <em>formerly</em> referred to as content farms are still ranking well, then obviously they must be there for a reason.</p>
<p>From a strategic standpoint it&#8217;s actually rather clever. If I were Google and I needed to conceal special relationships I had with companies (especially if I was thinking that the FTC might want to get involved in my business) then I too would probably try very hard to sway the public opinion about the labels attached to the sites those companies owned, and shift the focus to something I could fix without caring about the damage, and then crowd source a tech community to help spread the impression that things were better. Most people probably won&#8217;t even pay enough attention to notice.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><a href="http://www.seobook.com/images/content-farms.gif" target="_blank"><img src="/images/not-content-farms.gif" onmouseup="hl2l(event);" alt="Were not content farms! No! Moo!" border="0" width="500px"></a><br />
(<em>click to view original</em>)</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2011/01/31/how-matt-cutts-leveraged-the-stack-overflow-and-hacker-news-communities-in-redefining-the-phrase-content-farms/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Matt Cutts Criticizes Deceptive Ads, Doesn&#8217;t Realize Google Is The One Serving Them</title>
		<link>http://smackdown.blogsblogsblogs.com/2011/01/30/matt-cutts-criticizes-deceptive-ads-doesnt-realize-google-is-the-one-serving-them/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2011/01/30/matt-cutts-criticizes-deceptive-ads-doesnt-realize-google-is-the-one-serving-them/#comments</comments>
		<pubDate>Sun, 30 Jan 2011 21:00:56 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=894</guid>
		<description><![CDATA[Yesterday over on Daggle.com Danny Sullivan published a post titled, Of Misleading Acai Berry Ads &#038; Fake Editorial Sites. In the article Danny discuses a rising trend of deceptive marketing practices involving fake news sites, the way they rip people off with products they are selling, and the fact that authority sites such as the [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday over on <a href="http://daggle.com" target="_blank">Daggle.com</a> Danny Sullivan published a post titled, <a href="http://daggle.com/misleading-acai-berry-ads-fake-editorial-sites-2435" target="_blank">Of Misleading Acai Berry Ads &#038; Fake Editorial Sites</a>. In the article Danny discuses a rising trend of deceptive marketing practices involving fake news sites, the way they rip people off with products they are selling, and the fact that authority sites such as the <a href="http://www.latimes.com/" target="_blank">LA Times</a> are the ones carrying these ads, lending them some credibility in the public eye. Danny states in the post that the ads showing are being served by Zedo, and that he wishes the ad network should raise it&#8217;s standards and not allow such blatantly misleading advertising:</p>
<blockquote><p>Personally, I’d like to see Zedo up its standards for the type of ads it will accept. This type of junk shouldn’t be allowed. <em>- Danny Sullivan</em></p></blockquote>
<p>He&#8217;s right, too, the ad networks <em>should</em> be policing this type of deception, by all means. Matt Cutts, Google&#8217;s head of the web spam team, agrees. He <a href="http://twitter.com/mattcutts/status/31751730140024832">tweeted about the story</a>, and also<span id="more-894"></span> commented his take on the matter in the post itself:</p>
<blockquote><p>    My favorite part of the disclaimer for those type of sites is &#8220;This website, and any page on the website, is based loosely off a true story, but has been modified in multiple ways including, but not limited to: the story, the photos, and the comments.&#8221;</p>
<p>    Oh, so I can trust the website except for the story, photos, and comments? In other words, the entire website?</p>
<p>    And if you read the disclaimer carefully, most of these sites promise a &#8220;free trial&#8221; with $1.95 in shipping, but actually set your card up with a recurring subscription. The &#8220;one weird old tip&#8221; ad that I clicked from the L.A. Times mentioned this in the fine print: &#8220;If you do not cancel within seven (7) days of the date that you enroll in the Program, we will charge the same card you provided at enrollment the non-refundable one-year membership fee of $149.95&#8243;. Then they also start charging you $12.95 a month. Grr. <em> &#8211; Matt Cutts, on deceptive &#8220;flat belly&#8221; ads</em></p></blockquote>
<p>Grr, indeed. </p>
<p>Danny also mentions in his post about how &#8220;The ad, unlike Google&#8217;s ads, doesn’t report what ad network is delivering them,&#8221; which if they did would be a form of disclosure. And Danny is right&#8230; except for one thing. Danny derived the fact that the ad was being served by Zedo by examining the url. However, if you view the source on the LA Times article and go to the spot on the page where the ad is showing, you don&#8217;t see the Zedo ad network code. The ad itself is being generated by Javascript that is being pulled from yet another ad network:</p>
<p>&nbsp;</p>
<p><img src="/images/latimes-source-doubleclick.png" onmouseup="hl2l(event);" alt="Doubleclick is the real culprit" border="0"></p>
<p>&nbsp;</p>
<p>The actual ad network that the LA Times has a relationship with, and the ones responsible for what ads show on their site, is Doubleclick. And who owns Doubeclick, you might ask? As most of you probably already know, <a href="http://www.google.com/doubleclick/" target="_blank">Google does</a>, since they <a href="http://www.nytimes.com/2007/04/14/technology/14DoubleClick.html" target="_blank">bought them back in 2007 for $3.1 billion</a>. So obviously not all of the ads Google delivers disclose what network they are from.</p>
<p>It gets better. AdSense, Google&#8217;s flagship advertising network, serves what are known as &#8220;contextual ads&#8221;, where in theory the ad targeting is based on the context of the page contents where the ad blocks are placed. Danny uses AdSense on his site, with one of the blocks being at the very top of the page. Due to the various feeds in the sidebar, the content of the article, and the title, &#8220;Acai Berry&#8221; is mentioned 8 times on that same page. Therefore it is only natural, of course, that this is what we see when we look at the ads being served on the top:</p>
<p>&nbsp;</p>
<p><a href="/images/fake-news-ads-daggle2.png" target="_blank"><img src="/images/fake-news-ads-daggle2-sm.png" onmouseup="hl2l(event);" alt="The worlds most resilient bittorrent site." border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>Now, can you guess where that ad leads? That&#8217;s right:</p>
<p>&nbsp;</p>
<p><a href="/images/fake-news-site2.png" target="_blank"><img src="/images/fake-news-site2-sm.png" onmouseup="hl2l(event);" alt="The worlds most resilient bittorrent site." border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>It&#8217;s a fake news site identical to the one Danny is discussing, with the same text, layout, and even images embedded in the &#8220;story&#8221;, with the only variation being that the one Danny landed on is &#8220;News 7&#8243;, and this one is &#8220;News 8&#8243;. </p>
<p>What makes this story particularly interesting is that recently Matt Cutts <a href="http://searchengineland.com/mr-cutts-goes-to-washington-61234" target="_blank">visited Washington D.C., lobbying the FTC</a> about Google&#8217;s integrity, trying to convince them that they don&#8217;t require government oversight, and how they could be trusted to police themselves. Google also happens to be in a very unique position to help clean up these kinds of abuses. Not only could they pull these ads from their own vast array of properties, and require their third party partners to do the same, but they could also warn publishers who use networks that continue to promote scams that their sites rankings could suffer, in the same way that they have punished websites in the past for what they said was deceptive marketing, in the form of <a href="http://www.mattcutts.com/blog/hidden-links/" target="_blank">undisclosed paid links</a>. Instead, they themselves appear to be participating in the problem, not the solution.</p>
<p>So, Matt, are you willing to back up your testimony to the FTC about Google&#8217;s integrity, and lobby within your own company to help eradicate deceptive marketing from the web? Do you feel that websites that allow deceptive advertising to be shown on their sites should have their trust revoked? </p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2011/01/30/matt-cutts-criticizes-deceptive-ads-doesnt-realize-google-is-the-one-serving-them/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>zOMG! Jason Calacanis Lied Again?? Shocker!</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 12:45:58 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=799</guid>
		<description><![CDATA[Last Thursday, in response to Matt Cutts stating that he needed more than &#8220;arbitrary inurl searches&#8221; to sway him (which was in turn in response to a Hacker News submission about Mahalo and the plethora of keyword rich domains they were apparently building out) I wrote a post explaining in some detail how the latest [...]]]></description>
			<content:encoded><![CDATA[<p>Last Thursday, in response to Matt Cutts stating that he needed more than &#8220;arbitrary inurl searches&#8221; to sway him (which was in turn in response to a <a href="http://news.ycombinator.com/item?id=1433676" target="_blank">Hacker News submission</a> about Mahalo and the plethora of keyword rich domains they were apparently building out) I wrote a post explaining in some detail how the latest <a href="http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/" target="_blank">Mahalo spam is in fact spam</a>. I demonstrated in the post how Jason had developed a linkfarm which was being used as a link source back to Mahalo.com. It wasn&#8217;t just that the individual sites were all linking back to the mother site, which would in fact be normal, but also that the pages were linking back to specific pages within the main site, pages that in many cases had few, if any, links going to them aside from the ones from this linkfarm.</p>
<p>Each time it happens Matt&#8217;s defense of Mahalo spamming Google just gets more perplexing. In this latest round he started by saying that his job was not to have knee jerk reactions, as if Mahalo hadn&#8217;t already established a <a href="http://www.seobook.com/official-mahalo-com-spam-according-googles-internal-spam-documents" target="_blank">pattern of spamming</a> over a long period of time, and that Matt is pretending he hadn&#8217;t already had a talk with Jason and told him that if he didn&#8217;t raise the bar with his site that <a href="http://outspokenmedia.com/internet-marketing-conferences/ask-the-search-engines/" target="_blank">Google would take action</a> on Mahalo. From there it got even weirder &#8211; Matt looked at the linkfarm and basically told me that a) he didn&#8217;t care as long as it wasn&#8217;t passing link juice, and b) he&#8217;s the only one who could tell if that was the case.</p>
<p>I could have sworn that it was if you were caught <em>trying</em> to spam you were penalized, and you couldn&#8217;t get the penalty removed unless you <em>promised not to do it again</em>. Now, where did I get such a crazy and wild idea? Oh yeah, I remember now&#8230; <span id="more-799"></span><em><a href="http://www.mattcutts.com/blog/reinclusion-request-howto/" target="_blank">it was from Matt Cutts</a></em>:</p>
<blockquote><p>Now we come to the heart of things: what goes into a reinclusion request. Fundamentally, Google wants to know two things: 1) that any spam on the site is gone or fixed, and 2) that it’s not going to happen again. &#8211; <em>Matt Cutts on the bare essentials of a reconsideration request</em></p></blockquote>
<p>The reasons Matt gives out for defending Mahalo seem to be getting more and more creative (even if not more believable). Jason&#8217;s, on the other hand, are the same old song and dance he has been spouting since I first <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">called him on his bs</a> and demonstrated that the <a href="http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/" target="_blank">vast majority of his site</a> was nothing more than empty, auto-generated pages. On Thursday&#8217;s post, before he started to lose it with his <a href="http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/comment-page-1/#comment-50660" target="_blank">&#8220;fuck you losers, I&#8217;m rich&#8221;</a> tirade, Jason made this statement:</p>
<blockquote><p>We have humans write pages of at least 300 words. We don’t index 99.99% of pages with < 300 (it would have to be something unique), and we police the system to get short pages up to 300 words within 30 days. - <em>Jason Calacanis, 4 days ago</em></p></blockquote>
<p>Orly? Let&#8217;s take a look at those claims, shall we?</p>
<p>The Mahalo coupon pages are about the crappiest pages I have found on the site. When I was doing my initial investigation I stumbled across quite a few of them. My guess is that [{brand} coupon] generates AdSense blocks with a decent eCPM since they are, after all, &#8220;targeted&#8221; pages. None of the Mahalo &#8220;coupon&#8221; pages actually have any coupons, which of course means that the end user is much more likely to click on one of the ads when they land there, and more required clicks does means a poorer user experience. What content these pages do have is fluff text that gives ample opportunity for Mahalo to link back to itself, and have spammy signals that are easy to spot like when there are near-identical versions of the same topic page, usually by doing one page for &#8220;coupons&#8221; and another for &#8220;printable coupons&#8221; (and no, there is nothing to print out on those pages either). Therefore i picked those as where I would look first to point out, yet again, how Jason was simply pulling these claims out of his ass with no supporting truths behind them.</p>
<p>Digging back into my old data, from March 13th, I was able to determine that from the day the site started adding content up until that point in time Mahalo had amassed 2,655 coupon based pages. When I re-scanned and looked this time I found that there was now 16,601 of these pages. That is a huge increase for 3 months, and a ton of content to create uniquely, even if you ditch quality altogether. Mahalo currently only has a grand total of 90,494 of actual pages on that side of things, so that means 18% of the site is made up of &#8220;coupon&#8221; pages &#8211; and by that I mean coupon pages that don&#8217;t actually <em>have</em> any coupons on them.</p>
<p>What&#8217;s more, it actually looks like there is a chance that 9,932 of those pages were added last week, over a <em>3 day period</em>. How the hell do you get writers to create 9,932 pages of even crappy content, all about <em>coupons</em>, in only 3 days?</p>
<p>As I started looking into it I suddenly understood&#8230; they didn&#8217;t just ditch the quality to create those pages, they went ahead and ditched the <em>content</em>, yet again. I checked over 30 pages, and time after time I found what I found was auto-generated pages that were nothing but ads, affiliate links, and scraper feeds.</p>
<p><a href="http://www.mahalo.com/1800pools-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1800pools-coupons</a>:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-1800Pools-coupons.png" target="_blank"><img src="/images/mahalo-1800Pools-coupons-sm.png" onmouseup="hl2l(event);" alt="Mahalo 1800pools (non)coupons" border="0"></a><br />
(<em>click to view full page screenshot</em>)</p>
<p>&nbsp;</p>
<p><a href="http://www.mahalo.com/tigerdirect-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/tigerdirect-coupons</a>:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-tigerdirect-coupons.png" target="_blank"><img src="/images/mahalo-tigerdirect-coupons-sm.png" onmouseup="hl2l(event);" alt="Mahalo TigerDirect (non)coupons" border="0"></a><br />
(<em>click to view full page screenshot</em>)</p>
<p>&nbsp;</p>
<p><a href="http://www.mahalo.com/topnotchcare-com-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/topnotchcare-com-coupons</a>:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-topnotchcare.com-coupons.png" target="_blank"><img src="/images/mahalo-topnotchcare.com-coupons-sm.png" onmouseup="hl2l(event);" alt="Mahalo TopNotchCare (non)coupons" border="0"></a><br />
(<em>click to view full page screenshot</em>)</p>
<p>&nbsp;</p>
<p>Most of the pages I checked had the affiliate links provided by Savings.com, and most linked to the same two questions pages: one discussing the Outback coupons page, and one discussing &#8220;grocery coupons&#8221;&#8230; and in every case neither question had anything to do with what the actual &#8220;coupon&#8221; page was supposedly about:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-topnotchcare.com-coupons-qna-sm.png" onmouseup="hl2l(event);" alt="Mahalo TopNotchCare coupons questions?" border="0"></p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-1800Pools-coupons-qna-sm.png" onmouseup="hl2l(event);" alt="Mahalo 1800pools coupons questions?" border="0"></p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-tigerdirect-coupons-qna-sm.png" onmouseup="hl2l(event);" alt="Mahalo TigerDirect coupons questions?" border="0"></p>
<p>&nbsp;</p>
<p>For the pages that did not have Savings.com affiliate feeds on them it was because they were using as keywords the names of sites that wouldn&#8217;t actually be Savings.com publishers, like <a href="http://www.gbb.org/" target="_blank">GBB.org</a> and <a href="http://rlsforum.net/" target="_blank" rel="_nofollow">RLS Forum</a>. It looks like Jason somehow got his hands on a list of sites that for some reason or another looked like they <em>might</em> have offered some sort of coupon. These were then dumped into the database in the form of pages, and were then checked to see if they matched up with the Savings.com feed. If they did, great, if not that&#8217;s ok too, they still had AdSense on them &#8211; despite the fact that putting AdSense on pages without actual content is a <a href="" target="_blank">direct violation of Google AdSense policies</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/adsense-policies.png" onmouseup="hl2l(event);" alt="Mahalo violates AdSense policies" border="0"></p>
<p>&nbsp;</p>
<p>That&#8217;s ok though, I am sure Jason doesn&#8217;t care that he is risking the bulk of the site&#8217;s revenue stream by violating the terms of the program, since it looks like the AdSense team is giving him just as much of a pass as the spam team is.</p>
<p>In addition to the pages simply being devoid of content, Jason also uses the tactic of creating near-duplicate versions of some of these pages in order to get the most out of the long-tail phrase variations:</p>
<p><a href="http://www.mahalo.com/1and1-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1-coupons</a><br />
<a href="http://www.mahalo.com/1and1-internet-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1-internet-coupons</a><br />
<a href="http://www.mahalo.com/1and1-web-hosting-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1-web-hosting-coupons</a><br />
<a href="http://www.mahalo.com/1and1affiliate-com-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1affiliate-com-coupons</a></p>
<p>Let&#8217;s look at Jason&#8217;s statements again&#8230;</p>
<p>&nbsp;</p>
<p><strong>We have humans write pages</strong></p>
<p>&nbsp;</p>
<p>Well, no. You have humans write <em>some</em> pages, but an assload are still auto-generated. In addition to the ones shown here, Google also says that you still have 13,200 pages that you <a href="http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/" target="_blank">scraped from Wikipedia</a> in their index:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-13.2k-wikipedia-pages.png" onmouseup="hl2l(event);" alt="Mahalo Wikipedia scraped pages" border="0"></p>
<p>&nbsp;</p>
<p>Adding the above auto-generated pages in with the Wikipedia ones, that means that at this point an estimated 33% of the Mahalo content pages are scraped or auto-generated, <em>and that&#8217;s just the stuff that&#8217;s easy to find</em>. Yay footprints.</p>
<p>&nbsp;</p>
<p><strong>of at least 300 words</strong></p>
<p>&nbsp;</p>
<p>Again, no, even on the human generated pages that is not always true. Take a look, for instance, at the 1and1 page on Mahalo.com that all 4 of the above coupons reference:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-1and1.png" onmouseup="hl2l(event);" alt="Mahalo 1and1 (very) short page" border="0"></p>
<p>&nbsp;</p>
<p>Including words of 3 letters and less that page still only has 212 words of human generated content on it. I also pointed out last week that some of the Wikipedia scraped pages remained thin, such as the one on &#8220;The Alice B. Toklas Cookbook&#8221;, which has only 261 words on it.</p>
<p>&nbsp;</p>
<p><strong>We don’t index 99.99% of pages with < 300 [words]</strong></p>
<p>&nbsp;</p>
<p>Bullshit. Not one single one of the pages I examined had a &#8220;noindex&#8221; tag on it, or was blocked by robots.txt. In fact, just the opposite &#8211; every single one of them was pushed to Mahalo&#8217;s sitemap, to make it <em>easier</em> for Google to find (and index) them.</p>
<p>&nbsp;</p>
<p><strong>we police the system to get short pages up to 300 words within 30 days</strong></p>
<p>&nbsp;</p>
<p>Again, bullshit. The 1and1 page has been that way since at least March 11th:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-1and1-lastmod.png" onmouseup="hl2l(event);" alt="Mahalo 1and1 page last modified March 11th" border="0"></p>
<p>&nbsp;</p>
<p>And the Alice B. Toklas one since March 12th:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-alice-b-toklas-lastmod.png" onmouseup="hl2l(event);" alt="Mahalo Alice B. Toklas page last modified March 12th" border="0"></p>
<p>&nbsp;</p>
<p>So Jason, please, enough with the bs. Quit claiming stuff that simply is not true, especially when it&#8217;s <em>so</em> damn easy to disprove what you say. I still have no idea why it is that Matt Cutts is choosing to ignore your spam, but to the rest of us it&#8217;s as plain as day. And no, Jason&#8230; going in now and trying to clean it up in no way changes the fact that you spammed in the first place.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>Need Help Understanding The Latest Mahalo Spam?</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/#comments</comments>
		<pubDate>Thu, 17 Jun 2010 12:30:38 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=764</guid>
		<description><![CDATA[On Tuesday of this week someone posted the following question to the Hacker News website: How long has Mahalo been using keyword domains like this? The link in the story points to a search in Google, [inurl:tip_guidelines mahalo]. The results of this query show a list of somewhere between 180 and 270 sites (Google doesn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>On Tuesday of this week someone posted the following question to the Hacker News website: <a href="http://news.ycombinator.com/item?id=1433676" target="_blank">How long has Mahalo been using keyword domains like this?</a> The link in the story points to a search in Google, [<a href="http://www.google.com/search?q=inurl:tip_guidelines+mahalo&#038;hl=en&#038;filter=0" target="_blank">inurl:tip_guidelines mahalo</a>]. The results of this query show a list of somewhere between 180 and 270 sites (Google doesn&#8217;t show all of them, just the first 184 or so) all belonging to Mahalo.com, all keyword rich domains, all using the Mahalo Answers platform, and all covering material that Mahalo.com already covers. I am sure most of you are familiar with that fact that Google labels sites that have little or no content and are designed to drive affiliate conversions as <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66361" target="_blank">Thin Affiliate sites</a>:</p>
<blockquote><p>These sites usually have no original content and may be cookie-cutter sites or templates with no unique content. &#8211; <em>Google Webmasters Tools Help, on sites Google does not like</em></p></blockquote>
<p>These sites that Mahalo has started churning out, all that were apparently created just this year, would appear to be the AdSense version of the classic &#8220;thin affiliate&#8221; website.</p>
<p>I showed Matt Cutts the link to the search itself, and asked if he thought that the list of sites<span id="more-764"></span> being returned looked spammy to him. His reply?</p>
<p>&nbsp;</p>
<p><img src="/images/matt-defends-mahalo-spam.png" onmouseup="hl2l(event);" alt="@mvandemar it's not about arbitrary inurl searches that would sway me; it's impact on users (e.g. better/worse diversity) that matters most."></p>
<p>&nbsp;</p>
<p>So, yet again, for some inexplicable reason the head of the Google Web Spam team appears to be defending Jason Calacanis. Despite the <a href="http://www.seobook.com/official-mahalo-com-spam-according-googles-internal-spam-documents" target="_blank">numerous posts</a> that <a href="http://www.seobook.com/black-hat-seo-case-study" target="_blank">clearly demonstrate</a> that <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">Jason Calacanis is spamming Google</a>, Matt is saying that he needs <em>proof</em> that these new sites are spammy. Pretty much anyone else in the industry can tell at a glance what is going on, but Google&#8217;s foremost expert on the subject of spam still needs help seeing it.</p>
<p>That&#8217;s fine. Let&#8217;s go ahead and take a deeper look at what is happening behind the scenes with these sites. Here&#8217;s one of the new sites that deals with cooking, <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Acooking-questions.com" target="_blank">cooking-questions.com</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-off-site-cooking.png" onmouseup="hl2l(event);" alt="The new mahalo cooking site"></p>
<p>&nbsp;</p>
<p>336 pages indexed there. So, was the <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Amahalo.com%2Fanswers+%2Bcooking" target="_blank">&#8220;cooking&#8221; topic on Mahalo.com</a> not covered then&#8230;?</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-on-site-cooking.png" onmouseup="hl2l(event);" alt="The old mahalo cooking site"></p>
<p>&nbsp;</p>
<p>6,680 pages currently on Mahalo Answers, all about cooking. Looking at that list you can tell from the first two listings that Mahalo has one category for &#8220;Cooking Recipes&#8221;, and a completely separate category for &#8220;Cooking <em>and</em> Recipes&#8221;, whereas most non-spam directories, blogs, etc. would simply have picked one or the other. If they are struggling with diversity on the main site, how is it adding the same topic to an entirely new site is going to help? </p>
<p>Just for the record, the new, smaller site also seems to see the need to have both of those nearly identical categories as well:</p>
<p>&nbsp;</p>
<p><img src="/images/cooking-questions.com-cooking-recipes.png" onmouseup="hl2l(event);" alt="Cooking Recipes category"></p>
<p><img src="/images/cooking-questions.com-cooking-and-recipes.png" onmouseup="hl2l(event);" alt="The completely different Cooking *and* Recipes category"></p>
<p>&nbsp;</p>
<p>Also, in case you&#8217;re thinking that maybe it&#8217;s the individual questions themselves on these new sites that are &#8220;diverse&#8221;, one of the questions on the new site is &#8220;how-do-you-know-when-corn-on-the-cob-is-fully-cooked&#8221;. Mahalo.com already has 10 pages on corn on the cob, 3 of which are: &#8220;how-long-do-you-cook-corn-on-the-cob&#8221;, &#8220;how-many-minutes-do-you-think-is-the-perfect-time-to-cook-corn-on-the-cob&#8221;, and &#8220;what-is-the-perfect-amount-of-time-to-cook-corn-on-the-cob&#8221;. Yeah, that&#8217;s diversity for ya.</p>
<p>Let&#8217;s look at some of the other sites, see if maybe that first one was just a fluke. The new Mahalo site for Star Wars, <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Astarwarsanswers.com" target="_blank">starwarsanswers.com</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-off-site-star-wars.png" onmouseup="hl2l(event);" alt="The new mahalo Star Wars site"></p>
<p>&nbsp;</p>
<p>308 results. So, is <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Amahalo.com%2Fanswers+%2B%22star+wars%22" target="_blank">Star Wars not handled on Mahalo.com</a> then?</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-on-site-star-wars.png" onmouseup="hl2l(event);" alt="The Star Wars on Mahalo.com"></p>
<p>&nbsp;</p>
<p>1,200 results, so obviously this isn&#8217;t an example of &#8220;better diversity&#8221; either. Similar results for their Oklahoma City site, <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Awww.oklahomacityanswers.com" target="_blank">oklahomacityanswers.com</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-off-site-ok-city.png" onmouseup="hl2l(event);" alt="Oklahoma City questions?"></p>
<p>&nbsp;</p>
<p>177 pages, and yet again, <a href="http://www.google.com/search?q=site:mahalo.com/answers+%2B"Oklahoma+City"&#038;num=100&#038;hl=en&#038;safe=off" target="_blank">not new subject matter</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-on-site-ok-city.png" onmouseup="hl2l(event);" alt="Nothing new on OK City"></p>
<p>&nbsp;</p>
<p>632 pages. On Oklahoma City questions. How did people manage to come up with that many questions about Oklahoma City on a site that hardly anyone ever actually goes to on purpose?</p>
<p>The answer, of course, is that they didn&#8217;t. On the new site there are exactly <a href="/images/oklahomacityanswers.com-sitemap-question.xml" target="_blank">11 actual questions</a> as of this writing. The other 166 pages are (mostly empty) category pages, member pages, and other fluff that each of these <strong>&#8220;cookie-cutter sites or templates with no unique content&#8221;</strong> come with by default. Since this is all crap content then, why would Calacanis even bother with them? There is little to no interest in many of these subjects&#8230; what game could Jason possibly be playing here?</p>
<p>I know! Let&#8217;s all play:</p>
<p>&nbsp;</p>
<p><img src="/images/linkfarmville.png" onmouseup="hl2l(event);" alt="Lets Play LinkFarmVille!"></p>
<p>&nbsp;</p>
<p>Back in February I discussed how the internal pages on Mahalo.com <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">get almost no natural links</a>. Almost all of their PageRank (and thus ranking power) comes from either employees linking from their blogs or it comes from scrapers. With hundreds of thousands pages to support, however, relying on that kind of sketchy link profile has a good chance of not panning out in the long run. Someone must have pointed this out to Jason, and now these mini-sites are his solution. If you can&#8217;t build enough quality pages for people to want to link to you naturally, build sites and link to yourself. Look at, for instance, the cooking site&#8217;s question on &#8220;What is the most common way to make Angel Food Cake?&#8221;:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-cooking-angel-food-cake.png" onmouseup="hl2l(event);" alt="Angel food cake question"></p>
<p>&nbsp;</p>
<p>Within just the question itself, including the title, there are 14 links, <em>all pointing back to Mahalo.com topic pages</em>. If we examine the link profile for those pages being linked to, we see that these linkfarms that Jason is putting up provide the majority of their link juice. For example, the first page linked to is Malao&#8217;s page on &#8220;cake&#8221;:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-links-to-cake.png" target="_blank"><img src="/images/mahalo-links-to-cake-sm.png" onmouseup="hl2l(event);" alt="Cake links" border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>Of the 10 links listed, 4 come from scrapers, one is a url shortener that is actually on a Mahalo.com page, and the other 5 all come from Jason&#8217;s linkfarm: cooking-questions.com, parenting-questions.com, and foodiequestions.com. You can see the same is true throughout when you check the other links in the question, like the ones to the Mahalo pages on &#8220;chocolate&#8221; [<a href="http://search.yahoo.com/search?p=link%3Ahttp%3A%2F%2Fwww.mahalo.com%2Fchocolate+-site%3Amahalo.com" target="_blank">link:http://www.mahalo.com/chocolate -site:mahalo.com</a>] and &#8220;baking&#8221; [<a href="http://search.yahoo.com/search?p=link%3Ahttp%3A%2F%2Fwww.mahalo.com%2Fbaking+-site%3Amahalo.com" target="_blank">link:http://www.mahalo.com/baking -site:mahalo.com</a>].</p>
<p>Some of the pages being linked to don&#8217;t exist any more, since Jason did go in and delete some content in response to a good talking to Matt Cutts gave him a couple of months ago&#8230; but that&#8217;s ok, the links are there just in case they ever decide to build those pages back out.</p>
<p>The embedded links aren&#8217;t the only ones on these sites, either. You also have the sidebar links pointing back to Mahalo:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-cooking-site-sidebar-links.png" onmouseup="hl2l(event);" alt="Sidebar link spam too"></p>
<p>&nbsp;</p>
<p>Of course all of these little sites need some link juice themselves, in order to stay indexed and pick up some long tail rankings, which in turn with garner them some scraper backlink juice of their own. Jason&#8217;s got that covered too. For some of the pages he has redirected old pages from Mahalo.com, which acts as a reciprocal link exchange between Mahalo and the new sites. On iphoneqna.com, he has added a &#8220;Recommended Q&#038;A Communities&#8221; block to the sidebar, which adds in an extra level of interlinking to the equation:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-iphoneqna-sidebar-spam-links.png" onmouseup="hl2l(event);" alt="Sidebar link spam two, too"></p>
<p>&nbsp;</p>
<p>And of course there&#8217;s always the fall back tactic of Jason simply linking to some of these sites from his <a href="http://calacanis.com/2010/04/17/cool-site-for-folks-with-facebook-questions-knowledge-httpbit-ly9npq0n/" target="_blank" rel="nofollow">personal blog</a> and <a href="http://jasoncalacanis.tumblr.com/post/528859313/cool-site-for-folks-with-facebook-questions-knowledge" target="_blank" rel="nofollow">Tumblr account</a> to give them a little extra kick.</p>
<p>Like much of what Jason does, nothing in these tactics is new. Google has directly addressed the non-acceptability of this type of link building for quite a while now:</p>
<blockquote><p>&#8230; some webmasters engage in link exchange schemes and build partner pages exclusively for the sake of cross-linking, disregarding the quality of the links, the sources, and the long-term impact it will have on their sites. This is in violation of Google&#8217;s webmaster guidelines and can negatively impact your site&#8217;s ranking in search results. &#8211; <em><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66356" target="_blank">Google Webmaster Tools Help page on Link schemes</a></em></p></blockquote>
<p>It is spam, pure and simple. There is no added user experience, no diversity, and no reason for all of these sites, <em>including Mahalo.com</em>, to not get banned from Google.</p>
<div><em>Original <a href="http://www.flickr.com/photos/44702485@N04/4530409989/" target="_blank">FarmVille image</a> attribution goes to <a href="http://www.flickr.com/photos/44702485@N04/">tarikgore1</a>.</em></div>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/feed/</wfw:commentRss>
		<slash:comments>80</slash:comments>
		</item>
		<item>
		<title>Dear Matt Cutts, What&#8217;s Your Take On Addon Domains?</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/15/dear-matt-cutts-whats-your-take-on-addon-domains/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/15/dear-matt-cutts-whats-your-take-on-addon-domains/#comments</comments>
		<pubDate>Tue, 15 Jun 2010 19:58:05 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[nerdiness]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=756</guid>
		<description><![CDATA[Today Matt Cutts answered a question from &#8220;Land Lubber&#8221;, Colorado. Land Lubber asks: What&#8217;s your take on &#8220;addon domains&#8221;? Does Google penalize someone for having one or more addon domains on their main website, (or if they&#8217;re self hosting)? e.g. 2, 5, or 10 all coming from the same IP address, would that be bad? [...]]]></description>
			<content:encoded><![CDATA[<p>Today <a href="" target="_blank">Matt Cutts</a> answered a question from &#8220;Land Lubber&#8221;, Colorado. Land Lubber asks:</p>
<blockquote><p>What&#8217;s your take on &#8220;addon domains&#8221;? Does Google penalize someone for having one or more addon domains on their main website, (or if they&#8217;re self hosting)? e.g. 2, 5, or 10 all coming from the same IP address, would that be bad? &#8211; <em>Land Lubber, CO</em></p></blockquote>
<p>Matt responded with the following:<span id="more-756"></span></p>
<blockquote><p>Land Lubber, I live in a golden castle in the sky, and I have never had to use shared hosting in my life. Furthermore, I never listen to silly seo rumors either, so I had no idea people were even wasting time on such ridiculous ideas. To be honest, I have no idea what an &#8220;addon domain&#8221; is, even though you spelled it out for me as being multiple domains on the same hosting account. Therefore, I am just going to make something up and talk about that instead. &#8211; <em>Matt Cutts&#8230; well, sorta, interpreted</em></p></blockquote>
<p>Here is the actual video:</p>
<p>&nbsp;</p>
<p><object width="500" height="306"><param name="movie" value="http://www.youtube.com/v/RlNPRonDxuU?fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/RlNPRonDxuU?fs=1" type="application/x-shockwave-flash" width="500" height="306" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>&nbsp;</p>
<p>My take? Obviously based on Matt&#8217;s response as long as you are not doing it for spammy purposes Google doesn&#8217;t give a rats ass if you have more than one site on the same account. <em>Hopefully</em> his lack of caring (or knowing) about it clears that up. <img src='http://smackdown.blogsblogsblogs.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><strong>Update:</strong> Here is a link where Matt dispelled the myth that multiple sites all hosted on the same IP would be an inherent issue with Google all the way back in 2006:</p>
<p><a href="http://www.mattcutts.com/blog/myth-busting-virtual-hosts-vs-dedicated-ip-addresses/" target="_blank">Myth busting: virtual hosts vs. dedicated IP addresses</a></p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/15/dear-matt-cutts-whats-your-take-on-addon-domains/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Was The Google Mayday Update A Complete Failure Then?</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/11/was-the-google-mayday-update-a-complete-failure-then/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/11/was-the-google-mayday-update-a-complete-failure-then/#comments</comments>
		<pubDate>Fri, 11 Jun 2010 15:52:35 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=722</guid>
		<description><![CDATA[Earlier this week at SMX Advanced Seattle, during the You&#038;A With Matt Cutts, the topic of the latest Google update, dubbed Mayday by webmaster last month, happened to come up. According to Ryan Jones&#8217; live blogging account of the SMX Keynote the update had nothing to do with the web spam team. It was an [...]]]></description>
			<content:encoded><![CDATA[<p>Earlier this week at <a href="http://searchmarketingexpo.com/advanced/" target="_blank">SMX Advanced Seattle</a>, during the <strong>You&#038;A With Matt Cutts</strong>, the topic of the latest Google update, dubbed <a href="http://searchengineland.com/google-confirms-mayday-update-impacts-long-tail-traffic-43054" target="_blank">Mayday</a> by webmaster last month, happened to come up. According to Ryan Jones&#8217; <a href="http://www.dotcult.com/live-blogging-matt-cutts-you-a" target="_blank">live blogging account of the SMX Keynote</a> the update had nothing to do with the web spam team. It was an algorithmic change that was intended to &#8220;make long tail results more useful&#8221;. Matt made statements in effect telling webmasters who might have been affected by MayDay that they should look at their content and see how usefulness or unique content could be added to those pages. This indicates that the point of the Mayday update was to filter out or penalize results that are <em>not</em> unique content, or that are simply autogenerated results.</p>
<p>Matt made similar statements when he was <a href="http://www.webpronews.com/topnews/2010/06/09/google-mayday-update-designed-to-hit-auto-generated-pages-content-farms" target="_blank">interviewed by WebProNews</a> and the topic came up:<span id="more-722"></span></p>
<blockquote><p>How do I make sure that I am returning the highest quality content, stuff that&#8217;s really useful for users, whether it&#8217;s editorial discretion, unique content user generated content, you know, stuff that&#8217;s not available anywhere else, versus just something that&#8217;s scraped, or duplicate, or really kind of lower quality. &#8211; <em>Matt Cutts, explaining how not to get penalized by the Mayday Update</em></p></blockquote>
<p>During the keynote, in response to Matt&#8217;s explanation of what Mayday was supposed to accomplish Danny Sullivan indicated that he hopes that this update will help filter out results from <a href="http://www.seobook.com/content-mills" target="_blank">content mills</a> like Mahalo. <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/mahalo-com-meet-the-new-spam-worse-than-the-old-spam/" target="_blank">Mahalo</a> certainly sounds exactly like what Matt was describing, as I have written about in the past. </p>
<p>So, did the Mayday update actually accomplish filtering out this &#8220;low quality&#8221; content from the search results? I went back and checked some of <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">Jason Calacanis&#8217; spam pages on Mahalo</a> that I had blogged about in the past. Unsurprisingly enough, most of the pages I checked were still ranking just fine in Google. What was slightly unexpected, however, was what the listing looked like for one of them, [<a href="http://www.google.com/search?q=need+for+speed+walkthrough" target="_blank">need for speed walkthrough</a>]:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-need-for-speed-stub-listing-sm.png" onmouseup="hl2l(event);" alt="Need for speed walkthrough stub listing"></p>
<p>&nbsp;</p>
<p>Notice the lack of a snippet for that listing in the search results? That is because due to my earlier write-ups about Mahalo and Google, in an attempt to <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/jason-calacanis-makes-matt-cutts-a-liar/" target="_blank">keep up appearances with Matt Cutts</a>, Jason had the team move a bunch of the pages that I wrote about into a directory named /stub, and then <a href="http://www.mahalo.com/robots.txt" target="_blank">blocked that directory via robots.txt</a>. When Google encounters blocked content that has enough link juice it lists those pages, like you see here, as url-only. What Google usually doesn&#8217;t do, however, is actually rank those pages well in the search results.</p>
<p>Thinking that maybe it was a fluke, and that perhaps that particular listing just had some extra ranking power because I had blogged about it before and therefore it gained a few extra links, I delved further. This is just a sampling of what I found:</p>
<p>[<a href="http://www.google.com/search?&#038;q=fallen+angel+walkthrough" target="_blank">fallen angel walkthrough</a>] (#4)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-fallen-angel-sm.png" onmouseup="hl2l(event);" alt="fallen angel walkthrough stub listing" border="0"></p>
<p>&nbsp;</p>
<p>[<a href="http://www.google.com/search?q=jack+keane+walkthrough" target="_blank">jack keane walkthrough</a>] (#7)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-jack-keane-sm.png" onmouseup="hl2l(event);" alt="jack keane walkthrough stub listing" border="0"></p>
<p>&nbsp;</p>
<p>[<a href="http://www.google.com/search?q=how+to+plan+thanksgiving+dinner" target="_blank">how to plan thanksgiving dinner</a>] (#7)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-thanksgiving-dinner-sm.png" onmouseup="hl2l(event);" alt="how to plan thanksgiving dinner stub listing" border="0"></p>
<p>&nbsp;</p>
<p>[<a href="http://www.google.com/search?q=allheart+coupons&#038;num=10&#038;start=10" target="_blank">allheart coupons</a>] (#11)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-allheart-coupons-sm.png" onmouseup="hl2l(event);" alt="allheart coupons" border="0"></p>
<p>&nbsp;</p>
<p>[<a href="http://www.google.com/search?q=adult+friend+finder+coupons" target="_blank">adult friend finder coupons</a>] (#1)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-adult-friend-finder-sm.png" onmouseup="hl2l(event);" alt="adult friend finder coupons" border="0"></p>
<p>&nbsp;</p>
<div class="standout">
<strong>Let&#8217;s look at that, especially the last one.</strong></p>
<p>This is a page with 0 (as in none, nil, nadda, zilch) spiderable content, yet Google has deemed it worthy of ranking it #1, above <em>every other page in the index</em> that matches that phrase. Every.Single.One. We&#8217;re given the company line telling webmasters that in order to succeed in ranking in Google one must focus on &#8220;quality&#8221; and &#8220;unique&#8221; content, yet Google decides to give Mahalo a golden ticket for pages they can&#8217;t even <em>see</em>? </p>
<p><strong>Wtf?</strong>
</div>
<p>&nbsp;</p>
<p>Hm. Maybe the key is that if you want your duplicate, low quality, spammy content ranked then all you have to do is block it with robots.txt&#8230;?</p>
<p>Obviously that is not the case, and anyone who understands at all about how these things work will recognize that statement as ludicrous&#8230; but just to be sure, let&#8217;s look at some of the non-blocked content Mahalo pages are currently ranking for:</p>
<p>[<a href="http://www.google.com/search?q=julmust" target="_blank">julmust</a>] (#7)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-julmust-sm.png" onmouseup="hl2l(event);" alt="Mahalo julmust scraped content" border="0"></p>
<p>&nbsp;</p>
<p>[<a href="http://www.google.com/search?q=The+Alice+B.+Toklas+Cookbook&#038;num=10&#038;start=10" target="_blank">The Alice B. Toklas Cookbook</a>] (#11)</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-alice-b-toklas-sm.png" onmouseup="hl2l(event);" alt="Mahalo Alice B Toklas scraped content" border="0"></p>
<p>&nbsp;</p>
<p>These Mahalo pages being returned are both examples of the many pages on Mahalo.com that are nothing more than <a href="http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/" target="_blank">content scraped directly from Wikipedia</a>:</p>
<p><a href="http://en.wikipedia.org/wiki/Julmust" target="_blank">http://en.wikipedia.org/wiki/Julmust</a><br />
<a href="http://en.wikipedia.org/wiki/The_Alice_B._Toklas_Cookbook">http://en.wikipedia.org/wiki/The_Alice_B._Toklas_Cookbook</a></p>
<p>Note also that Wikipedia identifies some of their content as being &#8220;stubs&#8221; (the Alice B Toklas Cookbook page has a mere 261 words of content, including numbers and &#8220;a&#8221;, &#8220;and&#8221;, and &#8220;the&#8221;), but Mahalo is fine with presenting that exact same content as non-stub for whatever reason.</p>
<p>So, Danny, sorry&#8230; it looks as if Google did not achieve what it reportedly wanted to do with this latest update. It looks like both the low quality and completely duplicate (or even non-existant) content on Mahalo.com continues to rank. Maybe next time.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/11/was-the-google-mayday-update-a-complete-failure-then/feed/</wfw:commentRss>
		<slash:comments>31</slash:comments>
		</item>
		<item>
		<title>The Mahalo Paid Link Evidence Trail</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/#comments</comments>
		<pubDate>Sat, 13 Mar 2010 22:53:45 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=636</guid>
		<description><![CDATA[Matt Cutts asked me in a Sphinn comment the following question relating the my post on paid links on Mahalo.com: &#160; &#160; Let&#8217;s ignore, for the moment anyways, that for absolutely any other website on the internet with the evidence that I presented the process would most likely entail the webmaster proving their innocence rather [...]]]></description>
			<content:encoded><![CDATA[<p>Matt Cutts asked me in a <a href="http://sphinn.com/story/144419/#75691" target="_blank">Sphinn comment</a> the following question relating the my post on <a href="http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/" target="_blank">paid links on Mahalo.com</a>:<span id="more-636"></span></p>
<p>&nbsp;</p>
<p><img src="/images/matt-cutts-paid-link-question.png" onmouseup="hl2l(event);" alt="mvandemar, the site you mention is whatsyourconundrum.com, but that site is hosted on ns1.mahalo.com (the same nameserver as Mahalo). That makes it a cross-link, but what's the evidence that it's a paid link?" border="0"></p>
<p>&nbsp;</p>
<p>Let&#8217;s ignore, for the moment anyways, that for absolutely any other website on the internet with the evidence that I presented the process would most likely entail the webmaster proving their innocence rather than the person reporting the paid link proving that they are guilty, and that they would have to do so <em>after</em> they actually got banned. Before I get started, keep in mind&#8230; whatsyourconundrum.com is <em>not</em> Conundrum Wine&#8217;s main website. Their actual website is <a href="http://conundrumwine.com/" target="_blank">http://conundrumwine.com/</a>. The site that is being linked to from Mahalo is a marketing device, a link or PageRank funnel, something that acts as an intermediary link bait, or simply to increase brand recognition. This is actually a great way to help increase your exposure without having to put gimmicky items on your company&#8217;s professional website. That main website is <em>not</em> hosted on Mahalo.com&#8217;s servers. </p>
<p>Now, Matt is correct. The whatsyourconundrum.com website is indeed hosted on Mahalo servers. Without even looking at the IP address, we know by looking at the <a href="http://whois.domaintools.com/whatsyourconundrum.com" target="_blank">whois record</a> the dns servers are ns1.mahalo.com and ns2.mahalo.com:</p>
<p>&nbsp;</p>
<p><img src="/images/whatsyourconundrum-whois.png" onmouseup="hl2l(event);" alt="whatsyourconundrum.com whois" border="0"></p>
<p>&nbsp;</p>
<p>The reason, I am guessing, is that apparently whatsyourconundrum.com is powered by a white label version of Mahalo&#8217;s Answers. What we also see in the whois record is that the domain is owned by Camus Vinyards, and that their administrative email is domainadmin@caymus.com. Following that hint, we see that the caymus.com domain is owned by Caymus Vinyards:</p>
<p>&nbsp;</p>
<p><img src="/images/caymus-whois.png" onmouseup="hl2l(event);" alt="caymus.com whois" border="0"></p>
<p>&nbsp;</p>
<p>Caymus.com is in turn being hosted on ewinerysolutions.com servers. Ewinerysolutions.com is being hosted <a href="http://whois.domaintools.com/ewinerysolutions.com" target="_blank">on their own servers</a>, so the dns trail stops there. If we go to the <a href="http://www.ewinerysolutions.com/" target="_blank">eWinery Solutions</a> website, we can see that what they do is offer marketing services to wineries:</p>
<blockquote><p>The most profitable and fastest-growing segment of the wine industry today is the consumer-direct channel. eWinery Solutions offers you the freedom to market your wines easily and effectively by creating a one-on-one dialogue with your best customers.</p></blockquote>
<p>Jot on over to their <a href="http://www.ewinerysolutions.com/portfolio" target="_blank">Portfolio page</a>, and we see that Conundrum Wine is in fact a client of eWinery Solutions:</p>
<p>&nbsp;</p>
<p><img src="/images/ewinery-clients.png" onmouseup="hl2l(event);" alt="Conundrum is an eWinery client" border="0"></p>
<p>&nbsp;</p>
<p>Now, does eWinery offer their services to Conundrum out of the goodness of their heart? Somehow I doubt it. Did Jason agree to lease and host the white-label version of Mahalo Answers to them because of all of the warm fuzzies he knew it would give him? Of course not. In fact, he even refers to this deal as being Mahalo&#8217;s first &#8220;client&#8221;:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-wine-client.png" onmouseup="hl2l(event);" alt="Mahalos first client?" border="0"></p>
<p>&nbsp;</p>
<p>Whether the header link was part of the original deal negotiated to set up whatsyourconundrum.com, or a separate cash transaction, the fact remains that this was not a merit based link. It has it&#8217;s roots in a commercial transaction that most likely occurred between someone at Mahalo and the people doing Conundrum&#8217;s marketing. What I also can&#8217;t say is whether or not the clean sitewide links pointing to whatsyourconundrum.com on Jason&#8217;s private blog was part of the deal or not&#8230; but there are 5 of those links there as well, under the heading of &#8220;Daily Reads&#8221;:</p>
<p>&nbsp;</p>
<p><img src="/images/calacanis.com-paid-links.png" onmouseup="hl2l(event);" alt="Calacanis paid links?" border="0"></p>
<p>&nbsp;</p>
<p>Hopefully Matt won&#8217;t ask me to get all ninja and obtain a copy of the actual invoice&#8230; that might prove a little problematic for me. <img src='http://smackdown.blogsblogsblogs.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Jason Calacanis: Screw You Google, Now I&#8217;ll Sell Links Too</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/#comments</comments>
		<pubDate>Fri, 12 Mar 2010 13:45:43 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=605</guid>
		<description><![CDATA[By now Google has to be getting more than a little embarrassed about the behavior of Mr. Jason Calacanis and his site, Mahalo.com. Aaron Wall did a very well written piece explaining how Mahalo Makes Black Look White and the spammy techniques they were employing. This isn&#8217;t the first time Aaron has blogged about Mahalo [...]]]></description>
			<content:encoded><![CDATA[<p>By now Google has to be getting more than a little embarrassed about the behavior of Mr. Jason Calacanis and his site, Mahalo.com. Aaron Wall did a very well written piece explaining how <a href="http://www.seobook.com/black-hat-seo-case-study" target="_blank">Mahalo Makes Black Look White</a> and the spammy techniques they were employing. This isn&#8217;t the first time Aaron has <a href="http://www.seobook.com/mark-cubans-mahalo-wants-your-blood-and-gets-it-too" target="_blank">blogged about Mahalo</a> either, and talked about exactly how <a href="http://www.seobook.com/why-mahalo-and-other-content-scrapers-render-googles-spam-team-flaccid">this makes Google look bad</a>. For those who might not know, I <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">have also</a> <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/jason-calacanis-makes-matt-cutts-a-liar/" target="_blank">been blogging</a> <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/mahalo-com-meet-the-new-spam-worse-than-the-old-spam/" target="_blank">about this</a> <a href="http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/" target="_blank">recently</a>.</p>
<p>While Google will <a href="http://groups.google.com/groups/search?hl=en&#038;ie=UTF-8&#038;q=banned+google" target="_blank">ban smaller websites</a> from their search results or from AdSense on a whim, usually it takes heavier coverage<span id="more-605"></span> for bigger players to get hit. Like, for instance, when <a href="http://blogoscoped.com/archive/2006-02-01-n31.html" target="_blank">Google Blogoscoped outed BMW</a> for spammy doorway pages. The story spread relatively fast, and Google wound up banning BMW for a short period of time. So when someone has a &#8220;special&#8221; relationship with Google, as Jason appears to have, and keeps getting second (and third, and fourth, and fifth&#8230;) chances to clean up their act, yet continues to snub their nose in Google&#8217;s general direction, it makes one wonder. Google has to be at least somewhat concerned that someone in the mainstream media will eventually notice and start to ask why someone like Jason would continually be allowed to get away with this stuff. Considering the unfairness and lack of impartiality of letting a site like Mahalo slide while punishing so many smaller sites for lesser offenses, my guess is that Google doesn&#8217;t actually want to discuss their reasons behind ignoring it. And so, each time Google does nothing, Jason decides to push things a little more.</p>
<p>This time it looks like Jason has decided to go ahead and violate the rules <a href="http://www.mattcutts.com/blog/selling-links-that-pass-pagerank/" target="_blank">closest to Matt Cutts heart</a>. While the layout I am showing here will change with time, since the header contains rotating articles, currently if you go to Mahalo at the top of every page (on the non-Answers side, anyways) you will see the following block of stories that Mahalo is highlighting (usually this area contains trending or hot news items):</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-ad-in-header.png" target="_blank"><img src="/images/mahalo-ad-in-header-sm.png" onmouseup="hl2l(event);" alt="Conundrum?" border="0"></a><br />
(<em>Click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>While technically speaking just by looking at them there is nothing to distinguish one of those &#8220;featured stories&#8221; from another, the one that doesn&#8217;t actually belong is the third one in, with the caption &#8220;Best Pickup Line Ever?&#8221;. The reason that one is different from all the rest is simple&#8230; it&#8217;s not a Mahalo featured story at all, and has nothing to do with anything going on in the news. It&#8217;s an ad. It is a paid link that Mahalo.com sold, one that leads to a site built to <a href="http://www.whatsyourconundrum.com/love-and-relationships/best-pickup-line-ever" target="_blank" rel="nofollow">market a wine company</a>. There is nothing visual to distinguish or disclose that <em>as</em> an ad, and if we view the source of the page&#8230;</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-ad-in-header-source.png" target="_blank"><img src="/images/mahalo-ad-in-header-source-sm.png" onmouseup="hl2l(event);" alt="Wheres the nofollow...?" border="0"></a><br />
(<em>Click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>&#8230; we can see that there is nothing <em>machine readable</em> (ie. nofollow attribute) to distinguish it as an ad, either.</p>
<p>Matt Cutts has been very, very clear on his take on sites that sell links that pass PageRank, or ones that don&#8217;t disclose that they are in fact ads: they are spamming. No if, ands, or buts about it, they deserve to get punished. In fact, he has even gone to far to state that in his view undisclosed paid links <a href="http://www.mattcutts.com/blog/hidden-links/" target="_blank">violate FTC guidelines</a>.</p>
<p>So, Matt, recently you put out a <a href="http://www.mattcutts.com/blog/calling-for-link-spam-reports/" target="_blank">call for link spam reports</a>, including &#8220;paid links that pass PageRank&#8221;. Really, though, is there any point in reporting Mahalo to you? Are you going to actually take action, or, like you have done with Jason&#8217;s spam in the past, will you continue to simply look the other way? Any other site would be faced with a penalize/ban first, make nice nice with Google later. Hell, with the BMW site you penalized them <em>after</em> they cleaned it up, just to make an example. I get the strangest feeling, though, that won&#8217;t happen with Mahalo&#8230;</p>
<p>Can you at least say <em>something</em> about this issue&#8230;?</p>
<p><strong>Update:</strong> in response to a comment below and a question posed by Matt Cutts about what makes me believe that this is indeed a paid link:</p>
<p><a href="http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/">The Mahalo Paid Link Evidence Trail</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Jason Calacanis&#8217; Backup Plan For Replacing Content: Steal It From Wikipedia</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/#comments</comments>
		<pubDate>Thu, 11 Mar 2010 18:11:08 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=590</guid>
		<description><![CDATA[/sigh Ok Jason, we get it, you&#8217;re desperate. But stealing content from Wikipedia in order to replace what you deleted? Come on! I am flipping through Mahalo.com today, just seeing if you&#8217;re keeping your word or not, when all of a sudden I notice this huge amount of pages with odd names that somehow I [...]]]></description>
			<content:encoded><![CDATA[<p>/sigh</p>
<p>Ok Jason, we get it, you&#8217;re desperate. But stealing content from Wikipedia in order to replace what you deleted? Come on!</p>
<p>I am flipping through Mahalo.com today, just seeing if<span id="more-590"></span> you&#8217;re keeping your word or not, when all of a sudden I notice this <em>huge</em> amount of pages with odd names that somehow I missed before:</p>
<p><a href="http://www.mahalo.com/cgs-20625" target="_blank" rel="nofollow">http://www.mahalo.com/cgs-20625</a><br />
<a href="http://www.mahalo.com/cgs-9896" target="_blank" rel="nofollow">http://www.mahalo.com/cgs-9896</a><br />
<a href="http://www.mahalo.com/cp-154-526" target="_blank" rel="nofollow">http://www.mahalo.com/cp-154-526</a><br />
<a href="http://www.mahalo.com/daa-1097" target="_blank" rel="nofollow">http://www.mahalo.com/daa-1097</a><br />
<a href="http://www.mahalo.com/daa-1106" target="_blank" rel="nofollow">http://www.mahalo.com/daa-1106</a></p>
<p>These are all nothing more than content stolen from Wikipedia. Your version:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-cgs-20625.png" onmouseup="hl2l(event);" alt="Mahalo CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p><a href="http://en.wikipedia.org/wiki/CGS-20625" target="_blank">Wikipedia&#8217;s version</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/wikipedia-cgs-20625.png" onmouseup="hl2l(event);" alt="Wikipedia CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p>You even hyperlinked the same internal linking scheme to the same topics Wikipedia does, regardless of whether or not those pages exist on Mahalo.com. How the hell can you claim this is original content when it is nothing more than cut and paste? I mean, wtf, you just claimed that <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">Wikipedia is nothing more than a free for all</a> a little under 3 weeks ago&#8230; does the content somehow take on some magical value after you scrape it and host in on your servers, trying to pass it off as something one of your users wrote? THIS is the &#8220;<a href="http://twitter.com/Jason/status/10187602318" target="_blank" rel="nofollow">our users build it</a>&#8221; content that you were referring to? </p>
<p>&nbsp;</p>
<p><img src="/images/jason-calacanis-our-users-build-it.png" onmouseup="hl2l(event);" alt="Our users build it" border="0"></p>
<p>&nbsp;</p>
<p>News flash, Jason&#8230; your users and Wikipedia&#8217;s users are <em>not</em> the same people. You lumped Squidoo into the same category back then as well. If I look close enough, will I find content stolen from them too?</p>
<p>Now, to be fair, maybe this content existed all along but was much, much less noticeable when you had all of those <em>other</em> pages of fluff in there, but now that you have <a href="http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/" target="_blank">deleted 78% of that side of Mahalo</a>, these scraped pages are practically impossible to miss. None of the pages I looked at were actually indexed in Google, but it looks like <a href="" target="_blank">at least 27,900</a> of them currently are:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-wikipedia-pages-indexed.png" target="_blank"><img src="/images/mahalo-wikipedia-pages-indexed-sm.png" onmouseup="hl2l(event);" alt="27,900 indexed pages scraped from Wikipedia on Mahalo" border="0"></a><br />
(<em>Click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>Considering that Google only has a portion of those indexed, and that at last count there were only 128,324 pages left on that side of your site, that means that at minimum over 21% (and in all likelihood 30% &#8211; 40%) of the remaining pages on Mahalo.com are these scraped ones. Is that really what you want on your &#8220;Human Powered Search Engine&#8221;&#8230;?</p>
<p>There are 2 major differences between the original content and your version. 1) The original content cites the sources directly there on the page, whereas Mahalo does not, and 2) Mahalo is using each and every one of these scraped pages to automatically create 2 (and sometimes 3) additional contentless pages under the guise of questions being asked anonymously, questions that no one ever actually asked, but that are there solely for the purpose of bolstering your indexed page count in Google. The first question asked of every drug is <a href="http://www.mahalo.com/answers/health/what-are-the-side-effects-of-cgs-20625" target="_blank" rel="nofollow">What are the side effects of {insert drug}?</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-answers-cgs-20625.png" onmouseup="hl2l(event);" alt="What are the side effects of CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p>And the second is always <a href="http://www.mahalo.com/answers/health/where-can-i-get-cgs-20625" target="_blank" rel="nofollow">Where can I get {insert drug}</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-answers-cgs-20625-b.png" onmouseup="hl2l(event);" alt="Where can I get CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p>I also saw a &#8220;Who makes {insert drug}&#8221; question here and there as well. These are empty questions, asked by a bot, that for the most part will never get answered (or even looked at) and were never intended to. Three plus free pages (two of them <em>completely</em> devoid of content) for the price of one scraped page. Jason, seriously, do you really think you are slick doing this?</p>
<p>By the way, a huge number of those pages are less than 100 words in length, yet none of them have the noindex tag. Regardless of of what the length is, however, it&#8217;s not what you are claiming Mahalo.com <em>is</em> Jason, it&#8217;s more fluff. Wikipedia already gives us those articles. You add nothing to the interwebs by copying them. In all seriousness you should just go through and delete them all.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Dear Jason Calacanis: This Isn&#8217;t An &#8220;Absurd Microscope&#8221;</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/#comments</comments>
		<pubDate>Wed, 10 Mar 2010 13:43:25 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[nerdiness]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=578</guid>
		<description><![CDATA[Jason Calacanis replied to my post from yesterday. In it he discusses how he is indeed deleting many of the spammy pages that I had pointed out. Some, like the duplicate content doorway pages, he continues to defend. Either way, progress is being made. However, he still kinda kills it by tossing in at the [...]]]></description>
			<content:encoded><![CDATA[<p>Jason Calacanis <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/jason-calacanis-makes-matt-cutts-a-liar/comment-page-1/#comment-37066" target="_blank">replied</a> to my post from yesterday. In it he discusses how he is indeed deleting many of the spammy pages that I had pointed out. Some, like the duplicate content doorway pages, he continues to defend. Either way, progress is being made.</p>
<p>However, he still kinda kills it by tossing in at the end about how this whole scrutiny on his site is &#8220;absurd&#8221;, and anyone who calls him on it is being &#8220;vicious&#8221;:<span id="more-578"></span></p>
<blockquote><p>At the end of the day the absurd microscope we&#8217;re being put under by the SEO community is actually making our product better, and yes even improving the SEO of our best quality pages. For that, I thank you guys.</p>
<p>That being said, you really don&#8217;t have be so vicious about it. &#8211; <em>Jason Calacanis</em></p></blockquote>
<p>Jason, there is nothing &#8220;absurd&#8221; about this shit at all. Ya know, that whole Thou Dost Protest Too Much line? That applies here. </p>
<p>You&#8217;re sitting there calling this &#8220;vicious&#8221; and yet continuing to lie about the situation. You appear to be forgetting that I am not some moron who doesn&#8217;t have the facts. You made a statement over on HN and are making a similar statement here trying to implicate that <em>other people</em> put all of those short pages on your site. This isn&#8217;t the case, and for ffs, it was explained on <strong>Mahalo itself</strong> how they got there in the first place. Those pages that I blogged about, and that you are now saying are being noindexed (and which should <em>all</em> be getting removed, not just noindexed) were generated by a bot, one which ran on your site, one that turned people&#8217;s searches into static pages, which did so if and only if the page generated clicks, and which then <em>added those pages to your sitemap so Google could more easily find them</em>. This bot was once named &#8220;searchclick&#8221;, and you actual went out of your way to rename it to &#8220;stub&#8221; in order to make it sound less conspicuous. Wtf Jason, do you really think there are many things more indicative of awareness of the wrongdoing than going out of your way to actually <em>cover up evidence</em>&#8230;?</p>
<p>YOU are the spammer, Jason, you who had people modify whatever original software Mahalo was based on to behave like this. This is not some &#8220;consipiricy of the SEO&#8217;s&#8221; designed to get back at you for anything. You were spamming, acting like a dick when people called you on it, and then on top of that you are getting preferential treatment from Google during this whole thing to boot. Tons of people are getting banned from Google, or AdSense, or both, every day, and they don&#8217;t get to talk to a Google rep. They get a form letter and little to no information about what they did wrong (if anything). <em>You</em> get a personal talking to from the head of the Google spam team no less, and <em>you&#8217;re</em> response is to whine about how all the &#8220;trolls&#8221; are being mean to you? Give me a break, Jason.</p>
<p>Matt Cutts himself told you that if you didn&#8217;t clean things up that Google might take action. Matt is the head of the Web Spam team. You seriously want us to believe that you thought he was talking about something other than spam when he said that..?</p>
<p>To put things in perspective, not counting the Mahalo Answers side of things (which I haven&#8217;t even started to look at) you started out with 12 xml sitemaps each containing up to 50,000 pages in them. The total number of pages was 598,661. After the trimming down, you now have 3 xml sitemaps, with a total of 128,324 pages left. That means that you just whacked 78.6% of the main site, <em>and you still have empty pages in the sitemap</em>. Just looking by hand, it looks like between the remaining empty pages you told me you were going to eventually remove from the sitemap and the fluff that is in there, only about 1 out of 3 of the remaining pages should actually still exist. I pointed one example out on Twitter, but here&#8217;s another set so you can again see what I am talking about when I say fluff:</p>
<p><a href="http://www.mahalo.com/jay-z-brooklyn-go-hard" target="_blank" rel="nofollow">http://www.mahalo.com/jay-z-brooklyn-go-hard</a><br />
<a href="http://www.mahalo.com/jay-z-brooklyn-go-hard-video" target="_blank" rel="nofollow">http://www.mahalo.com/jay-z-brooklyn-go-hard-video</a><br />
<a href="http://www.mahalo.com/jay-z-brooklyn-we-go-hard" target="_blank" rel="nofollow">http://www.mahalo.com/jay-z-brooklyn-we-go-hard</a></p>
<p>Same topic with slight rewordings, none with any substantial amounts of content, those could easily be (and should be) a single page. Now, I understand that it takes time to get to all of those pages, but still, the numbers are kind of huge.</p>
<p>If this helps you get a better mental image of why this shit is not an &#8220;absurd microscope&#8221;, here are some cool infographics for you to look at (these are to scale, by the way, not some made up ratios):</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-shrink-infographic.png" onmouseup="hl2l(event);" alt="Mahalo shrinking site infographic" border="0"></p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-shrink-infographic2.png" onmouseup="hl2l(event);" alt="Mahalo shrinking site infographic #2" border="0"></p>
<p>&nbsp;</p>
<p>We <em>know</em> those pages didn&#8217;t add up to a ton of traffic, Jason, you keep telling us. As I said before it&#8217;s not about that, it&#8217;s about all the free PageRank they were pulling in. We&#8217;ll see where things sit after the cleanup actually gets finished and Google gets around to respidering what is and what isn&#8217;t left. My guess? You will be surprised how large an impact this &#8220;small portion&#8221; of your site was actually having.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

