A 10 Minute Trick to Reduce your Blog Spam
|
| |
![]() | |
I was recently involved in a heated discussion on link spam and the like, and I found myself actually giving a few suggestions(if a bit sarcastically) on how to block blog spam, and I thought I’d share one with you guys, as it’s one that I haven’t done myself. It won’t do much to stop blackhats who already have your blog URL, but hey, it’ll do it a lot to stop future ones from finding you. This also helps prevent splogs from duping your content.
What are Footprints?
For you new readers, footprints are static pieces of text that show up on a piece of software. Where most content is dynamic in most software(wordpress, openld, phpld, etc), there is almost always some hardcoded phrase that shows up in most all of them. Searching for this phrase brings back a list of sites with the software installed.
How Your Blog Gets Found for Link Spam
Search engines, how else? All scripts have footprints, so we just look for a specific software set. Most of this post will be going through the footprints Wordpress leaves, and eliminating them. I’m sure I’ll miss a few(and perhaps not want to mention a few), so use your own imagination and judgement.
But XMCP, Why Are You Helping These People?! It Hurts Blackhats!
I will smack the first whiney blackhat that comes to me about this post. Ok. These are the people that track down your sites, and submit your comments on clean proxies as spam. You should not WANT to link spam them, and you’re an idiot if you do.
The Types of Footprints We’re After:
Within any software installation, a few words or phrases remain constant. We’ll break this down into 3 categories of footprints
- Exact Match Footprints - Phrases that show up almost 100% uniquely on sites running the given set of software. “Powered By XYZ” for example
- Combination Footprints - Single words or small phrases that all show up on a lot of different sites. When combining these words together though, and searching for them all at once, the pool of sites is greatly narrowed down, offering us a fairly targeted selection
- Filename Footprints - Some scripts have filenames that exist almost exclusively in sites running that particular piece of software. Combining these with a combination footprint is also quite common.
Dealing with Your Exact Footprints
Exact footprints are the easiest to handle, but they’re everywhere. And they all need to be eliminated. “Powered By Wordpress”? Yeah. Kill it. I don’t think Wordpress is having any problems ranking for their own software title name.
Now, they’re not all that simple. That “About Me” page you never edited? Change it. The affiliate program that was built into your web cart software but never setup? All the default phrases are footprints (Example). Run through your files, and just delete them. No harm done. Having backlinks to people that designed your theme are much lower risk, but still, you might want to kill it, depending on what the licensing agreement says, and if you care.
Make sure to look carefully. Things like “You can use these tags:” wordpress can be an issue.
Dealing With You Combination Footprints
Combination footprints are a tricky beast, but thankfully not as common. Swap out the 1 or 2 most common ones, and move on. Inserting spaces can even help. For example, a search for blogroll returns 32 times as many results as “Blog Roll”. Common category names, headings, and sections are all suspect here.
Dealing With Your Filename Footprints
Wordpress isn’t as bad as many with these. But still, start cleaning. Any filename that is somewhat unique to the software, open up the code and start swapping it out. Bear in mind as well, that included in this category are any arguments/query strings that come AFTER the filename. They’re all searchable, and are actually normally more unique than the filename itself?
An Example: Some of SlightlyShadySEO’s Footprints:
All text highlighted in red are exact footprints, everything in blue is a combination footprint
Yes, I’m FULLY aware Archives is spelled wrong. No, I’m not changing it. Consider it a footprint I don’t have, that you probably do!
With that, I’m off to finish eliminating a few of my own footprints. I don’t get much link spam, so probably just the exact ones for now ![]()
I’m not going to work to hard at it though. This entry actually contains several footprints in the content…so I’m kinda screwed. Haha.
Enjoy!
-XMCP





December 28th, 2007 at 6:16 pm
Hey dude - if you want to spell archives wrong, but not seem to you could always hide it:
Archivies
css:
. killFootprint { display: hidden; }
(I haven’t tested it)
But I guess you probably know that
December 29th, 2007 at 2:55 am
Yo, this has nothing to do with your post but since I dont know your contact info, I’ll post it here:
http://jpipes.com/presentations/mysql_perf_tuning.pdf
Some nice reading material IMO.
December 29th, 2007 at 3:28 am
@SellerGirl
It would definitely work, and nice to see ya commenting!
Yeah, that would work. But it got mentioned so many times in the early days of this blog(I think I deleted some comments, but some are still there), and it became a matter of principle.
After this entry, I actually bolded the extra “I”
Thanks for the tip though
December 29th, 2007 at 7:59 am
Oh, dear - that didn’t work, I didn’t realise.
Obviously the markup should be:
Archiv[span class=”killFootprint “]i[/span]es
Maybe that will show…
But I guess you got the idea.
I can appreciate the stubborness of principle
Doing good work with the blog - thanks for replying
December 29th, 2007 at 8:48 am
I am using akismet for this, and it works just great!
December 29th, 2007 at 7:03 pm
@Tomasz
Ironic, because your comment got blocked by akismet
July 8th, 2008 at 10:27 pm
[…] Ads, manual evaluation is de rigueur. The computer calculations that discount paid links seek to footprint paid links by looking for indicative nearby text such as “Sponsors,” “Ads” […]
July 8th, 2008 at 10:30 pm
[…] why annoy someone who’ll give you reputation management issues (like 1 and 1’s issues)? Silly RankRanker (aka Rank Ranker aka webmaster@rankranker.com). What do you guys think about […]
February 18th, 2009 at 5:06 am
[…] I also need to give credit where it’s due - my friend XMCP introduced me to footprinting with his post A 10 Minute Trick to Cut Blog Spam (by Eliminating Footprints). […]