<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Smackdown! &#187; scams</title>
	<atom:link href="http://smackdown.blogsblogsblogs.com/category/scams/feed/" rel="self" type="application/rss+xml" />
	<link>http://smackdown.blogsblogsblogs.com</link>
	<description>Smackdown!</description>
	<lastBuildDate>Tue, 22 Nov 2011 22:40:24 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Google Says &#8220;Fuck It&#8221; For The Christmas Season, Removes The Ability To Report AdSense Violations</title>
		<link>http://smackdown.blogsblogsblogs.com/2011/11/22/google-says-fuck-it-for-the-christmas-season-removes-the-ability-to-report-adsense-violations/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2011/11/22/google-says-fuck-it-for-the-christmas-season-removes-the-ability-to-report-adsense-violations/#comments</comments>
		<pubDate>Tue, 22 Nov 2011 20:57:49 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=1053</guid>
		<description><![CDATA[It has to be tough policing a program like AdSense. It must be exceptionally difficult during the holiday season, when the payoff to running scams grows so much more. It is so tough, in fact, that this year as the holiday shopping season grows near, with Black Friday just a few short days away, that [...]]]></description>
			<content:encoded><![CDATA[<div style="float:right; margin: 4px;"><img src="http://smackdown.blogsblogsblogs.com/images/googlecanthearyou.png" onmouseup="hl2l(event);" alt="Google Cant Hear You!"></div>
<p> It has to be tough policing a program like AdSense. It must be exceptionally difficult during the holiday season, when the payoff to running scams grows so much more. It is so tough, in fact, that this year as the holiday shopping season grows near, with Black Friday just a few short days away, that apparently Google has finally decided to say &#8220;fuck it&#8221;, make it easier on themselves, just remove the ability for anyone to report any violations of the program whatsoever, and allow the scammers to have a field day in the mean time.</p>
<p>While Google may want to give the impression to their stockholders and the public that they have both the search engine spam and advertising program cheaters fully under control, the truth is that they rely quite a bit on reports from the community and consumers for both spam and AdSense violations. For any spam that they find, Google asks <span id="more-1053"></span>people to submit a <a href="https://www.google.com/webmasters/tools/spamreport?hl=en" target="_blank">Google spam report</a>. At this point they require that someone log in before actually filing the report itself. This makes sense, since it helps prevent people erroneously filing large amount of spam reports against their competitors. For the AdSense violations they supply a separate form that does not require a log in, titled simply <a href="http://www.google.com/adsense/support/bin/topic.py?hl=en&#038;topic=1190500&#038;ctx=as2&#038;rd=1" target="_blank">Reporting a Violation &#8211; AdSense Help</a>. Usually I don&#8217;t run into offending sites with AdSense on them that fill me with enough of a sense of civic duty where I feel compelled to actually fill out a report, but I happened to land on one such today that actually tricked me into clicking on an ad in such a way that it really did annoy me. The page I landed on was <a href="http://www.bigsiteofamazingfacts.com/how-much-does-the-earth-weigh" target="_blank" rel="nofolow">BigSiteofAmazingFacts How Much Does The Earth Weigh</a> (yes, I was distracted by trivial shit again, don&#8217;t judge me), and in the right sidebar there was what appeared to be an embedded Youtube Video from Family Guy:</p>
<p>&nbsp;</p>
<p><img src="http://smackdown.blogsblogsblogs.com/images/howmuchdoestheearthweigh.png" onmouseup="hl2l(event);" alt="I see a video"></p>
<p>&nbsp;</p>
<p>Still distracted (of course) I clicked Play on the video, only instead of playing it suddenly brought me to a site trying to sell me bras. So, thinking I must have <em>missed</em> the rather large video in the sidebar when I tried to click on it, I hit the back button&#8230; and noticed that suddenly the video was gone altogether, and where before I had seen 2 AdSense blocks and a video, now there were 3 AdSense blocks instead:</p>
<p>&nbsp;</p>
<p><img src="http://smackdown.blogsblogsblogs.com/images/howmuchdoestheearthweigh2.png" onmouseup="hl2l(event);" alt="What video?"></p>
<p>&nbsp;</p>
<p>I hit refresh a few times but the video didn&#8217;t return. At that point I realized that it was actually a scam, so I cleared my cookies for that domain, hit refresh again, and viola, the &#8220;video&#8221; reappeared once again. At this point I was sufficiently irked that I actually decided I was going to report this asshole. It&#8217;s bad enough that a site with crap content like this is ranking #1 (the weight of the Earth is increasing each year from salt from the ocean spray? Seriously, wtf?), while people with content that is just fine are getting penalized supposedly from the Panda fallout. To add in that the guy who owns the site is ripping off advertisers as well just makes it so much worse. So, I headed on over to the AdSense Violation report to be a good citizen&#8230; and I was greeted by this:</p>
<p>&nbsp;</p>
<p><img src="http://smackdown.blogsblogsblogs.com/images/adsense-violation-report-missing.png" onmouseup="hl2l(event);" alt="What AdSense violation report?"></p>
<p>&nbsp;</p>
<p>An essentially blank page, with only a header, navigation, and a box asking me to tell AdSense how they can improve. Go figure.</p>
<p>From a financial perspective it does make sense for Google to make reporting AdSense violators more difficult, especially during the holidays. People who run scams like this actually generate Google money through the AdSense program, a program which currently has <a href="http://musictechpolicy.wordpress.com/2011/09/27/will-google-adsense-submit-the-power-of-google-to-voluntary-oversight/" target="_blank">absolutely no oversight</a>. It is exactly this lack of oversight that means that Google is the only one who knows how much, if any, of the advertising dollars are credited back to the advertisers once these scams are revealed. Hiding the violations report means that much fewer sites will be reported, more scams will be able to run for longer periods of time, and more money will wind up in Google&#8217;s pockets.</p>
<p>Is this profit motive really the reason that the report form is missing? If you ask Google I am sure they would say &#8220;of course not, we&#8217;re Google, you can trust us&#8221;. And since everything with Google is proprietary &#8220;behind closed doors&#8221; trade secrets with them, there is no way to know exactly how many violation reports suddenly went missing that apparently no one has noticed yet. My hunch though is that with something like this, as online shopping hits the holiday rush, the lack of reports that are coming in at the moment is actually too big for them not to have noticed by now, and them not fixing it for this long must be at least in some part intentional on their end.</p>
<p><strong>Update</strong>: As Jen from <a href="http://www.jensense.com/" target="_blank">JenSense.com</a> pointed out in the comments, there is another newer page available where you can actually file the report <a href="http://www.google.com/adsense/support/as/bin/static.py?page=ts.cs&#038;ts=1190500" target="_blank">located here</a>. However, I am not sure that makes it any better, and may in fact make it worse. I wound up on the empty page by actually going to Google and searching for [<a href="http://www.google.com/search?q=report+adsense+violation&#038;num=10" target="_blank">report adsense violation</a>]. The page that Jen provided is in the list, but it is down under the blank page that I found, another unhelpful blank page, and underneath a list of discussion of other people looking for the form. This begs the question&#8230; why did Google leave an otherwise empty page behind with just enough text (ie. header and title) and all of the old link juice there to outrank the &#8220;real&#8221; form? If they redesigned the site, then why not 301 redirect the old form(s) to the new one? It&#8217;s not like they don&#8217;t know how search engines work, ya know?</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2011/11/22/google-says-fuck-it-for-the-christmas-season-removes-the-ability-to-report-adsense-violations/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Matt Cutts Criticizes Deceptive Ads, Doesn&#8217;t Realize Google Is The One Serving Them</title>
		<link>http://smackdown.blogsblogsblogs.com/2011/01/30/matt-cutts-criticizes-deceptive-ads-doesnt-realize-google-is-the-one-serving-them/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2011/01/30/matt-cutts-criticizes-deceptive-ads-doesnt-realize-google-is-the-one-serving-them/#comments</comments>
		<pubDate>Sun, 30 Jan 2011 21:00:56 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=894</guid>
		<description><![CDATA[Yesterday over on Daggle.com Danny Sullivan published a post titled, Of Misleading Acai Berry Ads &#038; Fake Editorial Sites. In the article Danny discuses a rising trend of deceptive marketing practices involving fake news sites, the way they rip people off with products they are selling, and the fact that authority sites such as the [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday over on <a href="http://daggle.com" target="_blank">Daggle.com</a> Danny Sullivan published a post titled, <a href="http://daggle.com/misleading-acai-berry-ads-fake-editorial-sites-2435" target="_blank">Of Misleading Acai Berry Ads &#038; Fake Editorial Sites</a>. In the article Danny discuses a rising trend of deceptive marketing practices involving fake news sites, the way they rip people off with products they are selling, and the fact that authority sites such as the <a href="http://www.latimes.com/" target="_blank">LA Times</a> are the ones carrying these ads, lending them some credibility in the public eye. Danny states in the post that the ads showing are being served by Zedo, and that he wishes the ad network should raise it&#8217;s standards and not allow such blatantly misleading advertising:</p>
<blockquote><p>Personally, I’d like to see Zedo up its standards for the type of ads it will accept. This type of junk shouldn’t be allowed. <em>- Danny Sullivan</em></p></blockquote>
<p>He&#8217;s right, too, the ad networks <em>should</em> be policing this type of deception, by all means. Matt Cutts, Google&#8217;s head of the web spam team, agrees. He <a href="http://twitter.com/mattcutts/status/31751730140024832">tweeted about the story</a>, and also<span id="more-894"></span> commented his take on the matter in the post itself:</p>
<blockquote><p>    My favorite part of the disclaimer for those type of sites is &#8220;This website, and any page on the website, is based loosely off a true story, but has been modified in multiple ways including, but not limited to: the story, the photos, and the comments.&#8221;</p>
<p>    Oh, so I can trust the website except for the story, photos, and comments? In other words, the entire website?</p>
<p>    And if you read the disclaimer carefully, most of these sites promise a &#8220;free trial&#8221; with $1.95 in shipping, but actually set your card up with a recurring subscription. The &#8220;one weird old tip&#8221; ad that I clicked from the L.A. Times mentioned this in the fine print: &#8220;If you do not cancel within seven (7) days of the date that you enroll in the Program, we will charge the same card you provided at enrollment the non-refundable one-year membership fee of $149.95&#8243;. Then they also start charging you $12.95 a month. Grr. <em> &#8211; Matt Cutts, on deceptive &#8220;flat belly&#8221; ads</em></p></blockquote>
<p>Grr, indeed. </p>
<p>Danny also mentions in his post about how &#8220;The ad, unlike Google&#8217;s ads, doesn’t report what ad network is delivering them,&#8221; which if they did would be a form of disclosure. And Danny is right&#8230; except for one thing. Danny derived the fact that the ad was being served by Zedo by examining the url. However, if you view the source on the LA Times article and go to the spot on the page where the ad is showing, you don&#8217;t see the Zedo ad network code. The ad itself is being generated by Javascript that is being pulled from yet another ad network:</p>
<p>&nbsp;</p>
<p><img src="/images/latimes-source-doubleclick.png" onmouseup="hl2l(event);" alt="Doubleclick is the real culprit" border="0"></p>
<p>&nbsp;</p>
<p>The actual ad network that the LA Times has a relationship with, and the ones responsible for what ads show on their site, is Doubleclick. And who owns Doubeclick, you might ask? As most of you probably already know, <a href="http://www.google.com/doubleclick/" target="_blank">Google does</a>, since they <a href="http://www.nytimes.com/2007/04/14/technology/14DoubleClick.html" target="_blank">bought them back in 2007 for $3.1 billion</a>. So obviously not all of the ads Google delivers disclose what network they are from.</p>
<p>It gets better. AdSense, Google&#8217;s flagship advertising network, serves what are known as &#8220;contextual ads&#8221;, where in theory the ad targeting is based on the context of the page contents where the ad blocks are placed. Danny uses AdSense on his site, with one of the blocks being at the very top of the page. Due to the various feeds in the sidebar, the content of the article, and the title, &#8220;Acai Berry&#8221; is mentioned 8 times on that same page. Therefore it is only natural, of course, that this is what we see when we look at the ads being served on the top:</p>
<p>&nbsp;</p>
<p><a href="/images/fake-news-ads-daggle2.png" target="_blank"><img src="/images/fake-news-ads-daggle2-sm.png" onmouseup="hl2l(event);" alt="The worlds most resilient bittorrent site." border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>Now, can you guess where that ad leads? That&#8217;s right:</p>
<p>&nbsp;</p>
<p><a href="/images/fake-news-site2.png" target="_blank"><img src="/images/fake-news-site2-sm.png" onmouseup="hl2l(event);" alt="The worlds most resilient bittorrent site." border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>It&#8217;s a fake news site identical to the one Danny is discussing, with the same text, layout, and even images embedded in the &#8220;story&#8221;, with the only variation being that the one Danny landed on is &#8220;News 7&#8243;, and this one is &#8220;News 8&#8243;. </p>
<p>What makes this story particularly interesting is that recently Matt Cutts <a href="http://searchengineland.com/mr-cutts-goes-to-washington-61234" target="_blank">visited Washington D.C., lobbying the FTC</a> about Google&#8217;s integrity, trying to convince them that they don&#8217;t require government oversight, and how they could be trusted to police themselves. Google also happens to be in a very unique position to help clean up these kinds of abuses. Not only could they pull these ads from their own vast array of properties, and require their third party partners to do the same, but they could also warn publishers who use networks that continue to promote scams that their sites rankings could suffer, in the same way that they have punished websites in the past for what they said was deceptive marketing, in the form of <a href="http://www.mattcutts.com/blog/hidden-links/" target="_blank">undisclosed paid links</a>. Instead, they themselves appear to be participating in the problem, not the solution.</p>
<p>So, Matt, are you willing to back up your testimony to the FTC about Google&#8217;s integrity, and lobby within your own company to help eradicate deceptive marketing from the web? Do you feel that websites that allow deceptive advertising to be shown on their sites should have their trust revoked? </p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2011/01/30/matt-cutts-criticizes-deceptive-ads-doesnt-realize-google-is-the-one-serving-them/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>*Proof* That The New SEOmoz Tool Is At Least Half Accurate</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/09/09/proof-that-the-new-seomoz-tools-is-at-least-half-accurate/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/09/09/proof-that-the-new-seomoz-tools-is-at-least-half-accurate/#comments</comments>
		<pubDate>Thu, 09 Sep 2010 12:06:24 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[bad research]]></category>
		<category><![CDATA[coding]]></category>
		<category><![CDATA[nerdiness]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=828</guid>
		<description><![CDATA[There has been quite of bit of controversy over the past few days arising from the new LDA based tool recently released by SEOmoz. While there may have been some very well thought out, compelling arguments against giving this tool any credit whatsoever, I have to tell you that in my opinion no argument, no [...]]]></description>
			<content:encoded><![CDATA[<p>There has been quite of bit of <a href="http://www.seangolliher.com/2010/seo/185/" target="_blank">controversy</a> over the past few days arising from the <a href="http://www.huomah.com/Search-Engines/Algorithm-Matters/Google-Rankings-and-LDA.html" target="_blank">new LDA based tool</a> recently released by SEOmoz. While there may have been some very <a href="http://andybeard.eu/3122/seomoz-lda-tool.html" target="_blank">well thought out, compelling</a> arguments against giving this tool any credit whatsoever, I have to tell you that in my opinion no argument, no matter how well worded, is going to win over a good old fashioned demonstration.</p>
<p>I am a big one for testing, and test this tool I did. Now, I know, I may have voiced some opinions in the past as to my doubt of the sincerity of <a href="http://smackdown.blogsblogsblogs.com/2007/08/06/rand-fishkin-the-troll-defense/" target="_blank">Rand Fishkin</a> and the folks who run things over at <a href="http://smackdown.blogsblogsblogs.com/2007/09/21/why-seomoz-needs-a-for-entertainment-purposes-only-disclaimer/" target="_blank">SEOmoz</a>, but regardless of what I said before, for me seeing is definitely believing. I plugged both the url for the post introducing the tool itself, along with the phrase <span id="more-828"></span>[<a href="http://www.seomoz.org/blog/lda-and-googles-rankings-well-correlated" target="_blank">made up statistical bullshit</a>], into the tool&#8217;s interface, and sure as hell this is what the tool showed me:</p>
<p>&nbsp;</p>
<p><a href="/images/seomoz-tool-half-correct.png" target="_blank"><img src="/images/seomoz-tool-half-correct-sm.png" onmouseup="hl2l(event);" alt="Well, the tool was half right..." border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>I mean, c&#8217;mon now&#8230; those words weren&#8217;t used anywhere in the article, yet this tool was able to accurately determine that at least half* of everything that Rand said was relevant to that phrase?</p>
<p>I don&#8217;t know about you, but <em>I&#8217;m</em> convinced. <img src='http://smackdown.blogsblogsblogs.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>
<p>&nbsp;</p>
<div><em>* and yes, I know, the tool guessed numbers that were way low compared to the actual quantity of bullshit in the article, but seriously&#8230; you know as well as I do that any tool that can automatically detect even trace amounts of bullshit in a post is going to be a game changer. <img src='http://smackdown.blogsblogsblogs.com/wp-includes/images/smilies/icon_razz.gif' alt=':P' class='wp-smiley' /> </em></div>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/09/09/proof-that-the-new-seomoz-tools-is-at-least-half-accurate/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>I Finally Figured Out Who Jason Calacanis Reminds Me Of</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/08/17/i-finally-figured-out-who-jason-calacanis-reminds-me-of/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/08/17/i-finally-figured-out-who-jason-calacanis-reminds-me-of/#comments</comments>
		<pubDate>Tue, 17 Aug 2010 13:32:16 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[nerdiness]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=811</guid>
		<description><![CDATA[In all of the discussions and posts about Jason Calacanis, whenever and wherever he replied to people calling him on his bullshit, it always had the same tired familiar ring to it. His statements have that tone that all scam artists and con men have utilized throughout the ages, professing their innocence despite the preponderance [...]]]></description>
			<content:encoded><![CDATA[<p>In all of the discussions and posts about <a href="http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/" target="_blank">Jason Calacanis</a>, whenever and wherever he replied to people calling him on his bullshit, it always had the same tired familiar ring to it. His statements have that tone that all scam artists and con men have utilized throughout the ages, professing their innocence despite the preponderance of evidence against them. Finally it hit me who it is that he sounds like when he is trying to defend his spammy sites&#8230; <span id="more-811"></span></p>
<p><img src="/images/mahalo-nixon-sized2.jpg" border="0" alt="Jason -I am not a spammer- Calacanis" onmouseup="hl2l(event);"></p>
<p>Thanks to James Cook of the <a href="http://toonrefugee.com/toonblog/" target="_blank">TOONrefugee Cartoon Blog</a> for the cartoon of Jason as Nixon. The image is copyrighted and may not be reproduced in any format without his permission.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/08/17/i-finally-figured-out-who-jason-calacanis-reminds-me-of/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>zOMG! Jason Calacanis Lied Again?? Shocker!</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/#comments</comments>
		<pubDate>Mon, 21 Jun 2010 12:45:58 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=799</guid>
		<description><![CDATA[Last Thursday, in response to Matt Cutts stating that he needed more than &#8220;arbitrary inurl searches&#8221; to sway him (which was in turn in response to a Hacker News submission about Mahalo and the plethora of keyword rich domains they were apparently building out) I wrote a post explaining in some detail how the latest [...]]]></description>
			<content:encoded><![CDATA[<p>Last Thursday, in response to Matt Cutts stating that he needed more than &#8220;arbitrary inurl searches&#8221; to sway him (which was in turn in response to a <a href="http://news.ycombinator.com/item?id=1433676" target="_blank">Hacker News submission</a> about Mahalo and the plethora of keyword rich domains they were apparently building out) I wrote a post explaining in some detail how the latest <a href="http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/" target="_blank">Mahalo spam is in fact spam</a>. I demonstrated in the post how Jason had developed a linkfarm which was being used as a link source back to Mahalo.com. It wasn&#8217;t just that the individual sites were all linking back to the mother site, which would in fact be normal, but also that the pages were linking back to specific pages within the main site, pages that in many cases had few, if any, links going to them aside from the ones from this linkfarm.</p>
<p>Each time it happens Matt&#8217;s defense of Mahalo spamming Google just gets more perplexing. In this latest round he started by saying that his job was not to have knee jerk reactions, as if Mahalo hadn&#8217;t already established a <a href="http://www.seobook.com/official-mahalo-com-spam-according-googles-internal-spam-documents" target="_blank">pattern of spamming</a> over a long period of time, and that Matt is pretending he hadn&#8217;t already had a talk with Jason and told him that if he didn&#8217;t raise the bar with his site that <a href="http://outspokenmedia.com/internet-marketing-conferences/ask-the-search-engines/" target="_blank">Google would take action</a> on Mahalo. From there it got even weirder &#8211; Matt looked at the linkfarm and basically told me that a) he didn&#8217;t care as long as it wasn&#8217;t passing link juice, and b) he&#8217;s the only one who could tell if that was the case.</p>
<p>I could have sworn that it was if you were caught <em>trying</em> to spam you were penalized, and you couldn&#8217;t get the penalty removed unless you <em>promised not to do it again</em>. Now, where did I get such a crazy and wild idea? Oh yeah, I remember now&#8230; <span id="more-799"></span><em><a href="http://www.mattcutts.com/blog/reinclusion-request-howto/" target="_blank">it was from Matt Cutts</a></em>:</p>
<blockquote><p>Now we come to the heart of things: what goes into a reinclusion request. Fundamentally, Google wants to know two things: 1) that any spam on the site is gone or fixed, and 2) that it’s not going to happen again. &#8211; <em>Matt Cutts on the bare essentials of a reconsideration request</em></p></blockquote>
<p>The reasons Matt gives out for defending Mahalo seem to be getting more and more creative (even if not more believable). Jason&#8217;s, on the other hand, are the same old song and dance he has been spouting since I first <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">called him on his bs</a> and demonstrated that the <a href="http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/" target="_blank">vast majority of his site</a> was nothing more than empty, auto-generated pages. On Thursday&#8217;s post, before he started to lose it with his <a href="http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/comment-page-1/#comment-50660" target="_blank">&#8220;fuck you losers, I&#8217;m rich&#8221;</a> tirade, Jason made this statement:</p>
<blockquote><p>We have humans write pages of at least 300 words. We don’t index 99.99% of pages with < 300 (it would have to be something unique), and we police the system to get short pages up to 300 words within 30 days. - <em>Jason Calacanis, 4 days ago</em></p></blockquote>
<p>Orly? Let&#8217;s take a look at those claims, shall we?</p>
<p>The Mahalo coupon pages are about the crappiest pages I have found on the site. When I was doing my initial investigation I stumbled across quite a few of them. My guess is that [{brand} coupon] generates AdSense blocks with a decent eCPM since they are, after all, &#8220;targeted&#8221; pages. None of the Mahalo &#8220;coupon&#8221; pages actually have any coupons, which of course means that the end user is much more likely to click on one of the ads when they land there, and more required clicks does means a poorer user experience. What content these pages do have is fluff text that gives ample opportunity for Mahalo to link back to itself, and have spammy signals that are easy to spot like when there are near-identical versions of the same topic page, usually by doing one page for &#8220;coupons&#8221; and another for &#8220;printable coupons&#8221; (and no, there is nothing to print out on those pages either). Therefore i picked those as where I would look first to point out, yet again, how Jason was simply pulling these claims out of his ass with no supporting truths behind them.</p>
<p>Digging back into my old data, from March 13th, I was able to determine that from the day the site started adding content up until that point in time Mahalo had amassed 2,655 coupon based pages. When I re-scanned and looked this time I found that there was now 16,601 of these pages. That is a huge increase for 3 months, and a ton of content to create uniquely, even if you ditch quality altogether. Mahalo currently only has a grand total of 90,494 of actual pages on that side of things, so that means 18% of the site is made up of &#8220;coupon&#8221; pages &#8211; and by that I mean coupon pages that don&#8217;t actually <em>have</em> any coupons on them.</p>
<p>What&#8217;s more, it actually looks like there is a chance that 9,932 of those pages were added last week, over a <em>3 day period</em>. How the hell do you get writers to create 9,932 pages of even crappy content, all about <em>coupons</em>, in only 3 days?</p>
<p>As I started looking into it I suddenly understood&#8230; they didn&#8217;t just ditch the quality to create those pages, they went ahead and ditched the <em>content</em>, yet again. I checked over 30 pages, and time after time I found what I found was auto-generated pages that were nothing but ads, affiliate links, and scraper feeds.</p>
<p><a href="http://www.mahalo.com/1800pools-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1800pools-coupons</a>:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-1800Pools-coupons.png" target="_blank"><img src="/images/mahalo-1800Pools-coupons-sm.png" onmouseup="hl2l(event);" alt="Mahalo 1800pools (non)coupons" border="0"></a><br />
(<em>click to view full page screenshot</em>)</p>
<p>&nbsp;</p>
<p><a href="http://www.mahalo.com/tigerdirect-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/tigerdirect-coupons</a>:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-tigerdirect-coupons.png" target="_blank"><img src="/images/mahalo-tigerdirect-coupons-sm.png" onmouseup="hl2l(event);" alt="Mahalo TigerDirect (non)coupons" border="0"></a><br />
(<em>click to view full page screenshot</em>)</p>
<p>&nbsp;</p>
<p><a href="http://www.mahalo.com/topnotchcare-com-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/topnotchcare-com-coupons</a>:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-topnotchcare.com-coupons.png" target="_blank"><img src="/images/mahalo-topnotchcare.com-coupons-sm.png" onmouseup="hl2l(event);" alt="Mahalo TopNotchCare (non)coupons" border="0"></a><br />
(<em>click to view full page screenshot</em>)</p>
<p>&nbsp;</p>
<p>Most of the pages I checked had the affiliate links provided by Savings.com, and most linked to the same two questions pages: one discussing the Outback coupons page, and one discussing &#8220;grocery coupons&#8221;&#8230; and in every case neither question had anything to do with what the actual &#8220;coupon&#8221; page was supposedly about:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-topnotchcare.com-coupons-qna-sm.png" onmouseup="hl2l(event);" alt="Mahalo TopNotchCare coupons questions?" border="0"></p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-1800Pools-coupons-qna-sm.png" onmouseup="hl2l(event);" alt="Mahalo 1800pools coupons questions?" border="0"></p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-tigerdirect-coupons-qna-sm.png" onmouseup="hl2l(event);" alt="Mahalo TigerDirect coupons questions?" border="0"></p>
<p>&nbsp;</p>
<p>For the pages that did not have Savings.com affiliate feeds on them it was because they were using as keywords the names of sites that wouldn&#8217;t actually be Savings.com publishers, like <a href="http://www.gbb.org/" target="_blank">GBB.org</a> and <a href="http://rlsforum.net/" target="_blank" rel="_nofollow">RLS Forum</a>. It looks like Jason somehow got his hands on a list of sites that for some reason or another looked like they <em>might</em> have offered some sort of coupon. These were then dumped into the database in the form of pages, and were then checked to see if they matched up with the Savings.com feed. If they did, great, if not that&#8217;s ok too, they still had AdSense on them &#8211; despite the fact that putting AdSense on pages without actual content is a <a href="" target="_blank">direct violation of Google AdSense policies</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/adsense-policies.png" onmouseup="hl2l(event);" alt="Mahalo violates AdSense policies" border="0"></p>
<p>&nbsp;</p>
<p>That&#8217;s ok though, I am sure Jason doesn&#8217;t care that he is risking the bulk of the site&#8217;s revenue stream by violating the terms of the program, since it looks like the AdSense team is giving him just as much of a pass as the spam team is.</p>
<p>In addition to the pages simply being devoid of content, Jason also uses the tactic of creating near-duplicate versions of some of these pages in order to get the most out of the long-tail phrase variations:</p>
<p><a href="http://www.mahalo.com/1and1-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1-coupons</a><br />
<a href="http://www.mahalo.com/1and1-internet-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1-internet-coupons</a><br />
<a href="http://www.mahalo.com/1and1-web-hosting-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1-web-hosting-coupons</a><br />
<a href="http://www.mahalo.com/1and1affiliate-com-coupons" target="_blank" rel="nofollow">http://www.mahalo.com/1and1affiliate-com-coupons</a></p>
<p>Let&#8217;s look at Jason&#8217;s statements again&#8230;</p>
<p>&nbsp;</p>
<p><strong>We have humans write pages</strong></p>
<p>&nbsp;</p>
<p>Well, no. You have humans write <em>some</em> pages, but an assload are still auto-generated. In addition to the ones shown here, Google also says that you still have 13,200 pages that you <a href="http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/" target="_blank">scraped from Wikipedia</a> in their index:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-13.2k-wikipedia-pages.png" onmouseup="hl2l(event);" alt="Mahalo Wikipedia scraped pages" border="0"></p>
<p>&nbsp;</p>
<p>Adding the above auto-generated pages in with the Wikipedia ones, that means that at this point an estimated 33% of the Mahalo content pages are scraped or auto-generated, <em>and that&#8217;s just the stuff that&#8217;s easy to find</em>. Yay footprints.</p>
<p>&nbsp;</p>
<p><strong>of at least 300 words</strong></p>
<p>&nbsp;</p>
<p>Again, no, even on the human generated pages that is not always true. Take a look, for instance, at the 1and1 page on Mahalo.com that all 4 of the above coupons reference:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-1and1.png" onmouseup="hl2l(event);" alt="Mahalo 1and1 (very) short page" border="0"></p>
<p>&nbsp;</p>
<p>Including words of 3 letters and less that page still only has 212 words of human generated content on it. I also pointed out last week that some of the Wikipedia scraped pages remained thin, such as the one on &#8220;The Alice B. Toklas Cookbook&#8221;, which has only 261 words on it.</p>
<p>&nbsp;</p>
<p><strong>We don’t index 99.99% of pages with < 300 [words]</strong></p>
<p>&nbsp;</p>
<p>Bullshit. Not one single one of the pages I examined had a &#8220;noindex&#8221; tag on it, or was blocked by robots.txt. In fact, just the opposite &#8211; every single one of them was pushed to Mahalo&#8217;s sitemap, to make it <em>easier</em> for Google to find (and index) them.</p>
<p>&nbsp;</p>
<p><strong>we police the system to get short pages up to 300 words within 30 days</strong></p>
<p>&nbsp;</p>
<p>Again, bullshit. The 1and1 page has been that way since at least March 11th:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-1and1-lastmod.png" onmouseup="hl2l(event);" alt="Mahalo 1and1 page last modified March 11th" border="0"></p>
<p>&nbsp;</p>
<p>And the Alice B. Toklas one since March 12th:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-alice-b-toklas-lastmod.png" onmouseup="hl2l(event);" alt="Mahalo Alice B. Toklas page last modified March 12th" border="0"></p>
<p>&nbsp;</p>
<p>So Jason, please, enough with the bs. Quit claiming stuff that simply is not true, especially when it&#8217;s <em>so</em> damn easy to disprove what you say. I still have no idea why it is that Matt Cutts is choosing to ignore your spam, but to the rest of us it&#8217;s as plain as day. And no, Jason&#8230; going in now and trying to clean it up in no way changes the fact that you spammed in the first place.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/21/zomg-jason-calacanis-lied-again-shocker/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>Need Help Understanding The Latest Mahalo Spam?</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/#comments</comments>
		<pubDate>Thu, 17 Jun 2010 12:30:38 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=764</guid>
		<description><![CDATA[On Tuesday of this week someone posted the following question to the Hacker News website: How long has Mahalo been using keyword domains like this? The link in the story points to a search in Google, [inurl:tip_guidelines mahalo]. The results of this query show a list of somewhere between 180 and 270 sites (Google doesn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>On Tuesday of this week someone posted the following question to the Hacker News website: <a href="http://news.ycombinator.com/item?id=1433676" target="_blank">How long has Mahalo been using keyword domains like this?</a> The link in the story points to a search in Google, [<a href="http://www.google.com/search?q=inurl:tip_guidelines+mahalo&#038;hl=en&#038;filter=0" target="_blank">inurl:tip_guidelines mahalo</a>]. The results of this query show a list of somewhere between 180 and 270 sites (Google doesn&#8217;t show all of them, just the first 184 or so) all belonging to Mahalo.com, all keyword rich domains, all using the Mahalo Answers platform, and all covering material that Mahalo.com already covers. I am sure most of you are familiar with that fact that Google labels sites that have little or no content and are designed to drive affiliate conversions as <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66361" target="_blank">Thin Affiliate sites</a>:</p>
<blockquote><p>These sites usually have no original content and may be cookie-cutter sites or templates with no unique content. &#8211; <em>Google Webmasters Tools Help, on sites Google does not like</em></p></blockquote>
<p>These sites that Mahalo has started churning out, all that were apparently created just this year, would appear to be the AdSense version of the classic &#8220;thin affiliate&#8221; website.</p>
<p>I showed Matt Cutts the link to the search itself, and asked if he thought that the list of sites<span id="more-764"></span> being returned looked spammy to him. His reply?</p>
<p>&nbsp;</p>
<p><img src="/images/matt-defends-mahalo-spam.png" onmouseup="hl2l(event);" alt="@mvandemar it's not about arbitrary inurl searches that would sway me; it's impact on users (e.g. better/worse diversity) that matters most."></p>
<p>&nbsp;</p>
<p>So, yet again, for some inexplicable reason the head of the Google Web Spam team appears to be defending Jason Calacanis. Despite the <a href="http://www.seobook.com/official-mahalo-com-spam-according-googles-internal-spam-documents" target="_blank">numerous posts</a> that <a href="http://www.seobook.com/black-hat-seo-case-study" target="_blank">clearly demonstrate</a> that <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">Jason Calacanis is spamming Google</a>, Matt is saying that he needs <em>proof</em> that these new sites are spammy. Pretty much anyone else in the industry can tell at a glance what is going on, but Google&#8217;s foremost expert on the subject of spam still needs help seeing it.</p>
<p>That&#8217;s fine. Let&#8217;s go ahead and take a deeper look at what is happening behind the scenes with these sites. Here&#8217;s one of the new sites that deals with cooking, <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Acooking-questions.com" target="_blank">cooking-questions.com</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-off-site-cooking.png" onmouseup="hl2l(event);" alt="The new mahalo cooking site"></p>
<p>&nbsp;</p>
<p>336 pages indexed there. So, was the <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Amahalo.com%2Fanswers+%2Bcooking" target="_blank">&#8220;cooking&#8221; topic on Mahalo.com</a> not covered then&#8230;?</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-on-site-cooking.png" onmouseup="hl2l(event);" alt="The old mahalo cooking site"></p>
<p>&nbsp;</p>
<p>6,680 pages currently on Mahalo Answers, all about cooking. Looking at that list you can tell from the first two listings that Mahalo has one category for &#8220;Cooking Recipes&#8221;, and a completely separate category for &#8220;Cooking <em>and</em> Recipes&#8221;, whereas most non-spam directories, blogs, etc. would simply have picked one or the other. If they are struggling with diversity on the main site, how is it adding the same topic to an entirely new site is going to help? </p>
<p>Just for the record, the new, smaller site also seems to see the need to have both of those nearly identical categories as well:</p>
<p>&nbsp;</p>
<p><img src="/images/cooking-questions.com-cooking-recipes.png" onmouseup="hl2l(event);" alt="Cooking Recipes category"></p>
<p><img src="/images/cooking-questions.com-cooking-and-recipes.png" onmouseup="hl2l(event);" alt="The completely different Cooking *and* Recipes category"></p>
<p>&nbsp;</p>
<p>Also, in case you&#8217;re thinking that maybe it&#8217;s the individual questions themselves on these new sites that are &#8220;diverse&#8221;, one of the questions on the new site is &#8220;how-do-you-know-when-corn-on-the-cob-is-fully-cooked&#8221;. Mahalo.com already has 10 pages on corn on the cob, 3 of which are: &#8220;how-long-do-you-cook-corn-on-the-cob&#8221;, &#8220;how-many-minutes-do-you-think-is-the-perfect-time-to-cook-corn-on-the-cob&#8221;, and &#8220;what-is-the-perfect-amount-of-time-to-cook-corn-on-the-cob&#8221;. Yeah, that&#8217;s diversity for ya.</p>
<p>Let&#8217;s look at some of the other sites, see if maybe that first one was just a fluke. The new Mahalo site for Star Wars, <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Astarwarsanswers.com" target="_blank">starwarsanswers.com</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-off-site-star-wars.png" onmouseup="hl2l(event);" alt="The new mahalo Star Wars site"></p>
<p>&nbsp;</p>
<p>308 results. So, is <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Amahalo.com%2Fanswers+%2B%22star+wars%22" target="_blank">Star Wars not handled on Mahalo.com</a> then?</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-on-site-star-wars.png" onmouseup="hl2l(event);" alt="The Star Wars on Mahalo.com"></p>
<p>&nbsp;</p>
<p>1,200 results, so obviously this isn&#8217;t an example of &#8220;better diversity&#8221; either. Similar results for their Oklahoma City site, <a href="http://www.google.com/search?num=100&#038;hl=en&#038;safe=off&#038;q=site%3Awww.oklahomacityanswers.com" target="_blank">oklahomacityanswers.com</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-off-site-ok-city.png" onmouseup="hl2l(event);" alt="Oklahoma City questions?"></p>
<p>&nbsp;</p>
<p>177 pages, and yet again, <a href="http://www.google.com/search?q=site:mahalo.com/answers+%2B"Oklahoma+City"&#038;num=100&#038;hl=en&#038;safe=off" target="_blank">not new subject matter</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-on-site-ok-city.png" onmouseup="hl2l(event);" alt="Nothing new on OK City"></p>
<p>&nbsp;</p>
<p>632 pages. On Oklahoma City questions. How did people manage to come up with that many questions about Oklahoma City on a site that hardly anyone ever actually goes to on purpose?</p>
<p>The answer, of course, is that they didn&#8217;t. On the new site there are exactly <a href="/images/oklahomacityanswers.com-sitemap-question.xml" target="_blank">11 actual questions</a> as of this writing. The other 166 pages are (mostly empty) category pages, member pages, and other fluff that each of these <strong>&#8220;cookie-cutter sites or templates with no unique content&#8221;</strong> come with by default. Since this is all crap content then, why would Calacanis even bother with them? There is little to no interest in many of these subjects&#8230; what game could Jason possibly be playing here?</p>
<p>I know! Let&#8217;s all play:</p>
<p>&nbsp;</p>
<p><img src="/images/linkfarmville.png" onmouseup="hl2l(event);" alt="Lets Play LinkFarmVille!"></p>
<p>&nbsp;</p>
<p>Back in February I discussed how the internal pages on Mahalo.com <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">get almost no natural links</a>. Almost all of their PageRank (and thus ranking power) comes from either employees linking from their blogs or it comes from scrapers. With hundreds of thousands pages to support, however, relying on that kind of sketchy link profile has a good chance of not panning out in the long run. Someone must have pointed this out to Jason, and now these mini-sites are his solution. If you can&#8217;t build enough quality pages for people to want to link to you naturally, build sites and link to yourself. Look at, for instance, the cooking site&#8217;s question on &#8220;What is the most common way to make Angel Food Cake?&#8221;:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-cooking-angel-food-cake.png" onmouseup="hl2l(event);" alt="Angel food cake question"></p>
<p>&nbsp;</p>
<p>Within just the question itself, including the title, there are 14 links, <em>all pointing back to Mahalo.com topic pages</em>. If we examine the link profile for those pages being linked to, we see that these linkfarms that Jason is putting up provide the majority of their link juice. For example, the first page linked to is Malao&#8217;s page on &#8220;cake&#8221;:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-links-to-cake.png" target="_blank"><img src="/images/mahalo-links-to-cake-sm.png" onmouseup="hl2l(event);" alt="Cake links" border="0"></a><br />
(<em>click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>Of the 10 links listed, 4 come from scrapers, one is a url shortener that is actually on a Mahalo.com page, and the other 5 all come from Jason&#8217;s linkfarm: cooking-questions.com, parenting-questions.com, and foodiequestions.com. You can see the same is true throughout when you check the other links in the question, like the ones to the Mahalo pages on &#8220;chocolate&#8221; [<a href="http://search.yahoo.com/search?p=link%3Ahttp%3A%2F%2Fwww.mahalo.com%2Fchocolate+-site%3Amahalo.com" target="_blank">link:http://www.mahalo.com/chocolate -site:mahalo.com</a>] and &#8220;baking&#8221; [<a href="http://search.yahoo.com/search?p=link%3Ahttp%3A%2F%2Fwww.mahalo.com%2Fbaking+-site%3Amahalo.com" target="_blank">link:http://www.mahalo.com/baking -site:mahalo.com</a>].</p>
<p>Some of the pages being linked to don&#8217;t exist any more, since Jason did go in and delete some content in response to a good talking to Matt Cutts gave him a couple of months ago&#8230; but that&#8217;s ok, the links are there just in case they ever decide to build those pages back out.</p>
<p>The embedded links aren&#8217;t the only ones on these sites, either. You also have the sidebar links pointing back to Mahalo:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-cooking-site-sidebar-links.png" onmouseup="hl2l(event);" alt="Sidebar link spam too"></p>
<p>&nbsp;</p>
<p>Of course all of these little sites need some link juice themselves, in order to stay indexed and pick up some long tail rankings, which in turn with garner them some scraper backlink juice of their own. Jason&#8217;s got that covered too. For some of the pages he has redirected old pages from Mahalo.com, which acts as a reciprocal link exchange between Mahalo and the new sites. On iphoneqna.com, he has added a &#8220;Recommended Q&#038;A Communities&#8221; block to the sidebar, which adds in an extra level of interlinking to the equation:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-iphoneqna-sidebar-spam-links.png" onmouseup="hl2l(event);" alt="Sidebar link spam two, too"></p>
<p>&nbsp;</p>
<p>And of course there&#8217;s always the fall back tactic of Jason simply linking to some of these sites from his <a href="http://calacanis.com/2010/04/17/cool-site-for-folks-with-facebook-questions-knowledge-httpbit-ly9npq0n/" target="_blank" rel="nofollow">personal blog</a> and <a href="http://jasoncalacanis.tumblr.com/post/528859313/cool-site-for-folks-with-facebook-questions-knowledge" target="_blank" rel="nofollow">Tumblr account</a> to give them a little extra kick.</p>
<p>Like much of what Jason does, nothing in these tactics is new. Google has directly addressed the non-acceptability of this type of link building for quite a while now:</p>
<blockquote><p>&#8230; some webmasters engage in link exchange schemes and build partner pages exclusively for the sake of cross-linking, disregarding the quality of the links, the sources, and the long-term impact it will have on their sites. This is in violation of Google&#8217;s webmaster guidelines and can negatively impact your site&#8217;s ranking in search results. &#8211; <em><a href="http://www.google.com/support/webmasters/bin/answer.py?answer=66356" target="_blank">Google Webmaster Tools Help page on Link schemes</a></em></p></blockquote>
<p>It is spam, pure and simple. There is no added user experience, no diversity, and no reason for all of these sites, <em>including Mahalo.com</em>, to not get banned from Google.</p>
<div><em>Original <a href="http://www.flickr.com/photos/44702485@N04/4530409989/" target="_blank">FarmVille image</a> attribution goes to <a href="http://www.flickr.com/photos/44702485@N04/">tarikgore1</a>.</em></div>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/17/need-help-understanding-the-latest-mahalo-spam/feed/</wfw:commentRss>
		<slash:comments>80</slash:comments>
		</item>
		<item>
		<title>WordPress Hacking, Matt Mullenweg, And Some Screwed Up Priorities</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/06/01/wordpress-hacking-matt-mullenweg-and-some-screwed-up-priorities/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/06/01/wordpress-hacking-matt-mullenweg-and-some-screwed-up-priorities/#comments</comments>
		<pubDate>Tue, 01 Jun 2010 12:45:42 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[bad research]]></category>
		<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[priorities]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[Wordpress]]></category>
		<category><![CDATA[WTF]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=694</guid>
		<description><![CDATA[I clean WordPress installations for people who have been hacked. I can help fix non-Wordpress sites as well, but since often times the way people find me is through the guide I wrote on how to fix WordPress after you&#8217;ve been hacked it turns out that&#8217;s what they need me to do for them a [...]]]></description>
			<content:encoded><![CDATA[<p>I clean WordPress installations for people who have been hacked. I can help fix non-Wordpress sites as well, but since often times the way people find me is through the guide I wrote on <a href="http://smackdown.blogsblogsblogs.com/2008/06/24/how-to-completely-clean-your-hacked-wordpress-installation/" target="_blank">how to fix WordPress</a> after you&#8217;ve been hacked it turns out that&#8217;s what they need me to do for them a fair bit of the time. I have a process that I go through, and a specific set of things that I look for on every WordPress installation that I work on to make sure that it is indeed hacked, and to determine how bad the damage is. Different intrusions can leave various symptoms and clues as to how the hacker got in, and knowing this can be helpful in diagnosing the situation.</p>
<p>One of the hacks that has been around for a few years<span id="more-694"></span> has the symptoms of having an index.php in the root installation that has the following code in it:</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>16
17
18
19
20
21
</pre></td><td class="code"><pre class="php" style="font-family:monospace;"><span style="color: #009933; font-style: italic;">/** Loads the WordPress Environment and Template */</span>
<span style="color: #b1b100;">if</span> <span style="color: #009900;">&#40;</span><span style="color: #990000;">isset</span><span style="color: #009900;">&#40;</span><span style="color: #000088;">$_GET</span><span style="color: #009900;">&#91;</span><span style="color: #0000ff;">'license'</span><span style="color: #009900;">&#93;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
	<span style="color: #339933;">@</span><span style="color: #b1b100;">include</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">'http://wordpress.net.in/license.txt'</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
<span style="color: #009900;">&#125;</span> <span style="color: #b1b100;">else</span> <span style="color: #009900;">&#123;</span>
	<span style="color: #b1b100;">require</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">'./wp-blog-header.php'</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
<span style="color: #009900;">&#125;</span></pre></td></tr></table></div>

<p>The index.php found in a clean installation of WordPress does not have an IF statement in it, and the section that is actually delivering the hack is the statement telling the page to include() license.txt located on wordpress.net.in. If you try to view that page in a browser what you currently would see is version 3 of the <a href="http://www.gnu.org/licenses/gpl.html" target="_blank">GNU General Public License</a>. However, if the file is called as an include(), it instead delivers code that acts as a back door and allows, I believe, the injection of an erroneous administrator into the WordPress installation. From there pretty much anything can be done.</p>
<p>The wordpress.net.in domain itself, which is being used to deliver this hack, was originally registered back in April 2007, supposedly to some guy in Massachusetts:</p>
<p><img src="/images/wordpress.net.in-a.png" onmouseup="hl2l(event);" alt="Original registration information"></p>
<p>A little over 3 years later, after at least 31 changes in domain registration information, the domain is still supposedly registered to some guy in  Massachusetts, although not to the same person:</p>
<p><img src="/images/wordpress.net.in-d.png" onmouseup="hl2l(event);" alt="Current registration information"></p>
<p>The domain has been used for these hacking activities the entire time it has existed. There has never been a legitimate site residing on it.</p>
<p>By checking the IP address of where the site is now, it appears to be hosted by a firm operating under the name Extended Host Inc, which according to <a href="http://whois.domaintools.com/extendedhost.com" target="_blank">their whois information</a> is located in Canada. However, they don&#8217;t seem to actually have a website where someone could get hosting services, and their IP is <a href="http://www.db.ripe.net/whois?form_type=simple&#038;full_query_string=&#038;searchtext=194.110.161.180&#038;do_search=Search" target="_blank">located over in Amsterdam</a>. This is what <a href="http://www.spamhaus.org/sbl/sbl.lasso?query=SBL60306" target="_blank">Spamhaus had to say</a> about Extended Host:</p>
<p><img src="/images/spamhaus-extended-host.png" onmouseup="hl2l(event);" alt="Extended Host - spam, scam, cybercrime hosting"></p>
<p>While none of this actually tells us anything about who the real owner of the domain is, what it does tell us is that there is very little that can be done about it. The hosting company is a scam, so there is no one to contact to have them take the website down. Even if the current bandwidth provider did decide to take action against them they could simply move to a new provider. There are plenty of hacker and spammer friendly hosts out there to choose from. The sad truth is that there is little that honest concerned netizens such as you or I can do to take a website like this offline. It is a shame, too, because taking the website down would mean that the hack it is being used for, across however many thousands of WordPress installations out there that are infected, would no longer be effective. It would nullify the damage, even for those blog owners who do not know that they are hacked.</p>
<p>No, there really isn&#8217;t much that you or I could do about that site&#8230; but there is actually someone who could do something, if they so wanted. You see, the domain in question, wordpress.net.in, consists entirely of the WordPress trademark, a trademark owned by Automattic Inc, the company founded by one <a href="http://ma.tt/" target="_blank">Matt Mullenweg</a>, the original creator of WordPress. According to their website they are quite aware of the fact that <a href="http://wordpress.org/about/domains/" target="_blank">using WordPress in a domain is a trademark violation</a>, and trademark violations are pretty much the one thing that allows one person to legitimately take a domain from another person without their consent. According to the <a href="http://www.icann.org/en/udrp/udrp-policy-24oct99.htm" target="_blank">ICANN Domain Name Dispute Resolution Policy</a> there are 3 conditions that must be met for this to happen:</p>
<ul>
<li style="list-style: none;">(i) your domain name is identical or confusingly similar to a trademark or service mark in which the complainant has rights; and</li>
<li style="list-style: none;">(ii) you have no rights or legitimate interests in respect of the domain name; and</li>
<li style="list-style: none;">(iii) your domain name has been registered and is being used in bad faith.</li>
</ul>
<p>In this case 1 is a no-brainer&#8230; the domain name is an exact match for the trademark in question. I am pretty sure that unless the owner of the domain name turns out to be one of the other founders of Automattic number 2 will pass the test without question as well. As for requirement 3, I don&#8217;t think you could really get more &#8220;bad faith&#8221; than deliberately using the domain name to hack other websites. If Matt actually cared he would have no problem wrestling control of that domain name from whoever it is that actually owns it, and shutting it down altogether, and yet he has done nothing about it for over <em>three years</em> now. Apparently Matt is so obsessively concerned with his <a href="http://tomuse.com/matt-mullenweg-automattic-wordpress-themes-plugins-developer/" target="_blank">crusade against non-GPL WordPress plugin and theme developers</a> that he doesn&#8217;t have the time or energy to go after someone using his trademark to hack software he wrote.  In his campaign against the evils of non-GPL he has even gone so far as to start <a href="http://wordcamphowto.wordpress.com/2010/05/19/fyi-im-taking-over-as-central-liaison/" target="_blank">banning people from speaking at or sponsoring Wordcamp events</a> if they are &#8220;non-GPL-compliant&#8221; (a determination, btw, which is solely made by Matt and company, with apparently no procedure in place for appeals):</p>
<blockquote><p>They are welcome to attend, but WordCamps may not have non-GPL-compliant people as organizers, sponsors, or speakers. Events that want to move forward and include such individuals in these roles may need to use a name other than WordCamp if the appropriate adjustments can&#8217;t be made. &#8211; <em>Jane Wells, WordCamp &#8220;central liaison&#8221;</em></p></blockquote>
<p>That&#8217;s right&#8230; if you wish to put the effort into organizing an event that promotes WordPress in your community, and you take the time to raise the money yourself to do so, but you happen to be a person who directly sells premium themes, then you damn well better not use their trademarked name for the event. If you want to spread viruses, hack servers, and promote spam, however&#8230; hell, feel free to use their core trademark in your domain name. It&#8217;s not like they are going to actually do anything about it.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/06/01/wordpress-hacking-matt-mullenweg-and-some-screwed-up-priorities/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>The Mahalo Paid Link Evidence Trail</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/#comments</comments>
		<pubDate>Sat, 13 Mar 2010 22:53:45 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=636</guid>
		<description><![CDATA[Matt Cutts asked me in a Sphinn comment the following question relating the my post on paid links on Mahalo.com: &#160; &#160; Let&#8217;s ignore, for the moment anyways, that for absolutely any other website on the internet with the evidence that I presented the process would most likely entail the webmaster proving their innocence rather [...]]]></description>
			<content:encoded><![CDATA[<p>Matt Cutts asked me in a <a href="http://sphinn.com/story/144419/#75691" target="_blank">Sphinn comment</a> the following question relating the my post on <a href="http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/" target="_blank">paid links on Mahalo.com</a>:<span id="more-636"></span></p>
<p>&nbsp;</p>
<p><img src="/images/matt-cutts-paid-link-question.png" onmouseup="hl2l(event);" alt="mvandemar, the site you mention is whatsyourconundrum.com, but that site is hosted on ns1.mahalo.com (the same nameserver as Mahalo). That makes it a cross-link, but what's the evidence that it's a paid link?" border="0"></p>
<p>&nbsp;</p>
<p>Let&#8217;s ignore, for the moment anyways, that for absolutely any other website on the internet with the evidence that I presented the process would most likely entail the webmaster proving their innocence rather than the person reporting the paid link proving that they are guilty, and that they would have to do so <em>after</em> they actually got banned. Before I get started, keep in mind&#8230; whatsyourconundrum.com is <em>not</em> Conundrum Wine&#8217;s main website. Their actual website is <a href="http://conundrumwine.com/" target="_blank">http://conundrumwine.com/</a>. The site that is being linked to from Mahalo is a marketing device, a link or PageRank funnel, something that acts as an intermediary link bait, or simply to increase brand recognition. This is actually a great way to help increase your exposure without having to put gimmicky items on your company&#8217;s professional website. That main website is <em>not</em> hosted on Mahalo.com&#8217;s servers. </p>
<p>Now, Matt is correct. The whatsyourconundrum.com website is indeed hosted on Mahalo servers. Without even looking at the IP address, we know by looking at the <a href="http://whois.domaintools.com/whatsyourconundrum.com" target="_blank">whois record</a> the dns servers are ns1.mahalo.com and ns2.mahalo.com:</p>
<p>&nbsp;</p>
<p><img src="/images/whatsyourconundrum-whois.png" onmouseup="hl2l(event);" alt="whatsyourconundrum.com whois" border="0"></p>
<p>&nbsp;</p>
<p>The reason, I am guessing, is that apparently whatsyourconundrum.com is powered by a white label version of Mahalo&#8217;s Answers. What we also see in the whois record is that the domain is owned by Camus Vinyards, and that their administrative email is domainadmin@caymus.com. Following that hint, we see that the caymus.com domain is owned by Caymus Vinyards:</p>
<p>&nbsp;</p>
<p><img src="/images/caymus-whois.png" onmouseup="hl2l(event);" alt="caymus.com whois" border="0"></p>
<p>&nbsp;</p>
<p>Caymus.com is in turn being hosted on ewinerysolutions.com servers. Ewinerysolutions.com is being hosted <a href="http://whois.domaintools.com/ewinerysolutions.com" target="_blank">on their own servers</a>, so the dns trail stops there. If we go to the <a href="http://www.ewinerysolutions.com/" target="_blank">eWinery Solutions</a> website, we can see that what they do is offer marketing services to wineries:</p>
<blockquote><p>The most profitable and fastest-growing segment of the wine industry today is the consumer-direct channel. eWinery Solutions offers you the freedom to market your wines easily and effectively by creating a one-on-one dialogue with your best customers.</p></blockquote>
<p>Jot on over to their <a href="http://www.ewinerysolutions.com/portfolio" target="_blank">Portfolio page</a>, and we see that Conundrum Wine is in fact a client of eWinery Solutions:</p>
<p>&nbsp;</p>
<p><img src="/images/ewinery-clients.png" onmouseup="hl2l(event);" alt="Conundrum is an eWinery client" border="0"></p>
<p>&nbsp;</p>
<p>Now, does eWinery offer their services to Conundrum out of the goodness of their heart? Somehow I doubt it. Did Jason agree to lease and host the white-label version of Mahalo Answers to them because of all of the warm fuzzies he knew it would give him? Of course not. In fact, he even refers to this deal as being Mahalo&#8217;s first &#8220;client&#8221;:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-wine-client.png" onmouseup="hl2l(event);" alt="Mahalos first client?" border="0"></p>
<p>&nbsp;</p>
<p>Whether the header link was part of the original deal negotiated to set up whatsyourconundrum.com, or a separate cash transaction, the fact remains that this was not a merit based link. It has it&#8217;s roots in a commercial transaction that most likely occurred between someone at Mahalo and the people doing Conundrum&#8217;s marketing. What I also can&#8217;t say is whether or not the clean sitewide links pointing to whatsyourconundrum.com on Jason&#8217;s private blog was part of the deal or not&#8230; but there are 5 of those links there as well, under the heading of &#8220;Daily Reads&#8221;:</p>
<p>&nbsp;</p>
<p><img src="/images/calacanis.com-paid-links.png" onmouseup="hl2l(event);" alt="Calacanis paid links?" border="0"></p>
<p>&nbsp;</p>
<p>Hopefully Matt won&#8217;t ask me to get all ninja and obtain a copy of the actual invoice&#8230; that might prove a little problematic for me. <img src='http://smackdown.blogsblogsblogs.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Jason Calacanis: Screw You Google, Now I&#8217;ll Sell Links Too</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/#comments</comments>
		<pubDate>Fri, 12 Mar 2010 13:45:43 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=605</guid>
		<description><![CDATA[By now Google has to be getting more than a little embarrassed about the behavior of Mr. Jason Calacanis and his site, Mahalo.com. Aaron Wall did a very well written piece explaining how Mahalo Makes Black Look White and the spammy techniques they were employing. This isn&#8217;t the first time Aaron has blogged about Mahalo [...]]]></description>
			<content:encoded><![CDATA[<p>By now Google has to be getting more than a little embarrassed about the behavior of Mr. Jason Calacanis and his site, Mahalo.com. Aaron Wall did a very well written piece explaining how <a href="http://www.seobook.com/black-hat-seo-case-study" target="_blank">Mahalo Makes Black Look White</a> and the spammy techniques they were employing. This isn&#8217;t the first time Aaron has <a href="http://www.seobook.com/mark-cubans-mahalo-wants-your-blood-and-gets-it-too" target="_blank">blogged about Mahalo</a> either, and talked about exactly how <a href="http://www.seobook.com/why-mahalo-and-other-content-scrapers-render-googles-spam-team-flaccid">this makes Google look bad</a>. For those who might not know, I <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">have also</a> <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/jason-calacanis-makes-matt-cutts-a-liar/" target="_blank">been blogging</a> <a href="http://smackdown.blogsblogsblogs.com/2010/03/08/mahalo-com-meet-the-new-spam-worse-than-the-old-spam/" target="_blank">about this</a> <a href="http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/" target="_blank">recently</a>.</p>
<p>While Google will <a href="http://groups.google.com/groups/search?hl=en&#038;ie=UTF-8&#038;q=banned+google" target="_blank">ban smaller websites</a> from their search results or from AdSense on a whim, usually it takes heavier coverage<span id="more-605"></span> for bigger players to get hit. Like, for instance, when <a href="http://blogoscoped.com/archive/2006-02-01-n31.html" target="_blank">Google Blogoscoped outed BMW</a> for spammy doorway pages. The story spread relatively fast, and Google wound up banning BMW for a short period of time. So when someone has a &#8220;special&#8221; relationship with Google, as Jason appears to have, and keeps getting second (and third, and fourth, and fifth&#8230;) chances to clean up their act, yet continues to snub their nose in Google&#8217;s general direction, it makes one wonder. Google has to be at least somewhat concerned that someone in the mainstream media will eventually notice and start to ask why someone like Jason would continually be allowed to get away with this stuff. Considering the unfairness and lack of impartiality of letting a site like Mahalo slide while punishing so many smaller sites for lesser offenses, my guess is that Google doesn&#8217;t actually want to discuss their reasons behind ignoring it. And so, each time Google does nothing, Jason decides to push things a little more.</p>
<p>This time it looks like Jason has decided to go ahead and violate the rules <a href="http://www.mattcutts.com/blog/selling-links-that-pass-pagerank/" target="_blank">closest to Matt Cutts heart</a>. While the layout I am showing here will change with time, since the header contains rotating articles, currently if you go to Mahalo at the top of every page (on the non-Answers side, anyways) you will see the following block of stories that Mahalo is highlighting (usually this area contains trending or hot news items):</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-ad-in-header.png" target="_blank"><img src="/images/mahalo-ad-in-header-sm.png" onmouseup="hl2l(event);" alt="Conundrum?" border="0"></a><br />
(<em>Click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>While technically speaking just by looking at them there is nothing to distinguish one of those &#8220;featured stories&#8221; from another, the one that doesn&#8217;t actually belong is the third one in, with the caption &#8220;Best Pickup Line Ever?&#8221;. The reason that one is different from all the rest is simple&#8230; it&#8217;s not a Mahalo featured story at all, and has nothing to do with anything going on in the news. It&#8217;s an ad. It is a paid link that Mahalo.com sold, one that leads to a site built to <a href="http://www.whatsyourconundrum.com/love-and-relationships/best-pickup-line-ever" target="_blank" rel="nofollow">market a wine company</a>. There is nothing visual to distinguish or disclose that <em>as</em> an ad, and if we view the source of the page&#8230;</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-ad-in-header-source.png" target="_blank"><img src="/images/mahalo-ad-in-header-source-sm.png" onmouseup="hl2l(event);" alt="Wheres the nofollow...?" border="0"></a><br />
(<em>Click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>&#8230; we can see that there is nothing <em>machine readable</em> (ie. nofollow attribute) to distinguish it as an ad, either.</p>
<p>Matt Cutts has been very, very clear on his take on sites that sell links that pass PageRank, or ones that don&#8217;t disclose that they are in fact ads: they are spamming. No if, ands, or buts about it, they deserve to get punished. In fact, he has even gone to far to state that in his view undisclosed paid links <a href="http://www.mattcutts.com/blog/hidden-links/" target="_blank">violate FTC guidelines</a>.</p>
<p>So, Matt, recently you put out a <a href="http://www.mattcutts.com/blog/calling-for-link-spam-reports/" target="_blank">call for link spam reports</a>, including &#8220;paid links that pass PageRank&#8221;. Really, though, is there any point in reporting Mahalo to you? Are you going to actually take action, or, like you have done with Jason&#8217;s spam in the past, will you continue to simply look the other way? Any other site would be faced with a penalize/ban first, make nice nice with Google later. Hell, with the BMW site you penalized them <em>after</em> they cleaned it up, just to make an example. I get the strangest feeling, though, that won&#8217;t happen with Mahalo&#8230;</p>
<p>Can you at least say <em>something</em> about this issue&#8230;?</p>
<p><strong>Update:</strong> in response to a comment below and a question posed by Matt Cutts about what makes me believe that this is indeed a paid link:</p>
<p><a href="http://smackdown.blogsblogsblogs.com/2010/03/13/the-mahalo-paid-link-evidence-trail/">The Mahalo Paid Link Evidence Trail</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/12/jason-calacanis-screw-you-google-now-ill-sell-links-too/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Jason Calacanis&#8217; Backup Plan For Replacing Content: Steal It From Wikipedia</title>
		<link>http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/</link>
		<comments>http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/#comments</comments>
		<pubDate>Thu, 11 Mar 2010 18:11:08 +0000</pubDate>
		<dc:creator>Michael VanDeMar</dc:creator>
				<category><![CDATA[blogthropology]]></category>
		<category><![CDATA[Cuttisms]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[lackofmeds]]></category>
		<category><![CDATA[scams]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Social Media]]></category>

		<guid isPermaLink="false">http://smackdown.blogsblogsblogs.com/?p=590</guid>
		<description><![CDATA[/sigh Ok Jason, we get it, you&#8217;re desperate. But stealing content from Wikipedia in order to replace what you deleted? Come on! I am flipping through Mahalo.com today, just seeing if you&#8217;re keeping your word or not, when all of a sudden I notice this huge amount of pages with odd names that somehow I [...]]]></description>
			<content:encoded><![CDATA[<p>/sigh</p>
<p>Ok Jason, we get it, you&#8217;re desperate. But stealing content from Wikipedia in order to replace what you deleted? Come on!</p>
<p>I am flipping through Mahalo.com today, just seeing if<span id="more-590"></span> you&#8217;re keeping your word or not, when all of a sudden I notice this <em>huge</em> amount of pages with odd names that somehow I missed before:</p>
<p><a href="http://www.mahalo.com/cgs-20625" target="_blank" rel="nofollow">http://www.mahalo.com/cgs-20625</a><br />
<a href="http://www.mahalo.com/cgs-9896" target="_blank" rel="nofollow">http://www.mahalo.com/cgs-9896</a><br />
<a href="http://www.mahalo.com/cp-154-526" target="_blank" rel="nofollow">http://www.mahalo.com/cp-154-526</a><br />
<a href="http://www.mahalo.com/daa-1097" target="_blank" rel="nofollow">http://www.mahalo.com/daa-1097</a><br />
<a href="http://www.mahalo.com/daa-1106" target="_blank" rel="nofollow">http://www.mahalo.com/daa-1106</a></p>
<p>These are all nothing more than content stolen from Wikipedia. Your version:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-cgs-20625.png" onmouseup="hl2l(event);" alt="Mahalo CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p><a href="http://en.wikipedia.org/wiki/CGS-20625" target="_blank">Wikipedia&#8217;s version</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/wikipedia-cgs-20625.png" onmouseup="hl2l(event);" alt="Wikipedia CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p>You even hyperlinked the same internal linking scheme to the same topics Wikipedia does, regardless of whether or not those pages exist on Mahalo.com. How the hell can you claim this is original content when it is nothing more than cut and paste? I mean, wtf, you just claimed that <a href="http://smackdown.blogsblogsblogs.com/2010/02/22/apparently-jason-calacanis-knows-hes-spamming-he-just-thinks-its-no-big-deal/" target="_blank">Wikipedia is nothing more than a free for all</a> a little under 3 weeks ago&#8230; does the content somehow take on some magical value after you scrape it and host in on your servers, trying to pass it off as something one of your users wrote? THIS is the &#8220;<a href="http://twitter.com/Jason/status/10187602318" target="_blank" rel="nofollow">our users build it</a>&#8221; content that you were referring to? </p>
<p>&nbsp;</p>
<p><img src="/images/jason-calacanis-our-users-build-it.png" onmouseup="hl2l(event);" alt="Our users build it" border="0"></p>
<p>&nbsp;</p>
<p>News flash, Jason&#8230; your users and Wikipedia&#8217;s users are <em>not</em> the same people. You lumped Squidoo into the same category back then as well. If I look close enough, will I find content stolen from them too?</p>
<p>Now, to be fair, maybe this content existed all along but was much, much less noticeable when you had all of those <em>other</em> pages of fluff in there, but now that you have <a href="http://smackdown.blogsblogsblogs.com/2010/03/10/dear-jason-calacanis-this-isnt-an-absurd-microscope/" target="_blank">deleted 78% of that side of Mahalo</a>, these scraped pages are practically impossible to miss. None of the pages I looked at were actually indexed in Google, but it looks like <a href="" target="_blank">at least 27,900</a> of them currently are:</p>
<p>&nbsp;</p>
<p><a href="/images/mahalo-wikipedia-pages-indexed.png" target="_blank"><img src="/images/mahalo-wikipedia-pages-indexed-sm.png" onmouseup="hl2l(event);" alt="27,900 indexed pages scraped from Wikipedia on Mahalo" border="0"></a><br />
(<em>Click to enlarge</em>)</p>
<p>&nbsp;</p>
<p>Considering that Google only has a portion of those indexed, and that at last count there were only 128,324 pages left on that side of your site, that means that at minimum over 21% (and in all likelihood 30% &#8211; 40%) of the remaining pages on Mahalo.com are these scraped ones. Is that really what you want on your &#8220;Human Powered Search Engine&#8221;&#8230;?</p>
<p>There are 2 major differences between the original content and your version. 1) The original content cites the sources directly there on the page, whereas Mahalo does not, and 2) Mahalo is using each and every one of these scraped pages to automatically create 2 (and sometimes 3) additional contentless pages under the guise of questions being asked anonymously, questions that no one ever actually asked, but that are there solely for the purpose of bolstering your indexed page count in Google. The first question asked of every drug is <a href="http://www.mahalo.com/answers/health/what-are-the-side-effects-of-cgs-20625" target="_blank" rel="nofollow">What are the side effects of {insert drug}?</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-answers-cgs-20625.png" onmouseup="hl2l(event);" alt="What are the side effects of CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p>And the second is always <a href="http://www.mahalo.com/answers/health/where-can-i-get-cgs-20625" target="_blank" rel="nofollow">Where can I get {insert drug}</a>:</p>
<p>&nbsp;</p>
<p><img src="/images/mahalo-answers-cgs-20625-b.png" onmouseup="hl2l(event);" alt="Where can I get CGS-20625" border="0"></p>
<p>&nbsp;</p>
<p>I also saw a &#8220;Who makes {insert drug}&#8221; question here and there as well. These are empty questions, asked by a bot, that for the most part will never get answered (or even looked at) and were never intended to. Three plus free pages (two of them <em>completely</em> devoid of content) for the price of one scraped page. Jason, seriously, do you really think you are slick doing this?</p>
<p>By the way, a huge number of those pages are less than 100 words in length, yet none of them have the noindex tag. Regardless of of what the length is, however, it&#8217;s not what you are claiming Mahalo.com <em>is</em> Jason, it&#8217;s more fluff. Wikipedia already gives us those articles. You add nothing to the interwebs by copying them. In all seriousness you should just go through and delete them all.</p>
]]></content:encoded>
			<wfw:commentRss>http://smackdown.blogsblogsblogs.com/2010/03/11/jason-calacanis-backup-plan-for-replacing-content-steal-it-from-wikipedia/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
	</channel>
</rss>

