• Home
  • About
  • Piqq.us Invite Feed
  • Links
  • RSS CULT
  • A 10 Minute Trick to Reduce your Blog Spam

    Add to Mixx!

    I was recently involved in a heated discussion on link spam and the like, and I found myself actually giving a few suggestions(if a bit sarcastically) on how to block blog spam, and I thought I’d share one with you guys, as it’s one that I haven’t done myself. It won’t do much to stop blackhats who already have your blog URL, but hey, it’ll do it a lot to stop future ones from finding you. This also helps prevent splogs from duping your content.

    What are Footprints?
    For you new readers, footprints are static pieces of text that show up on a piece of software. Where most content is dynamic in most software(wordpress, openld, phpld, etc), there is almost always some hardcoded phrase that shows up in most all of them. Searching for this phrase brings back a list of sites with the software installed.

    How Your Blog Gets Found for Link Spam
    Search engines, how else? All scripts have footprints, so we just look for a specific software set. Most of this post will be going through the footprints Wordpress leaves, and eliminating them. I’m sure I’ll miss a few(and perhaps not want to mention a few), so use your own imagination and judgement.

    But XMCP, Why Are You Helping These People?! It Hurts Blackhats!
    I will smack the first whiney blackhat that comes to me about this post. Ok. These are the people that track down your sites, and submit your comments on clean proxies as spam. You should not WANT to link spam them, and you’re an idiot if you do.
    The Types of Footprints We’re After:
    Within any software installation, a few words or phrases remain constant. We’ll break this down into 3 categories of footprints

    1. Exact Match Footprints - Phrases that show up almost 100% uniquely on sites running the given set of software. “Powered By XYZ” for example
    2. Combination Footprints - Single words or small phrases that all show up on a lot of different sites. When combining these words together though, and searching for them all at once, the pool of sites is greatly narrowed down, offering us a fairly targeted selection
    3. Filename Footprints - Some scripts have filenames that exist almost exclusively in sites running that particular piece of software. Combining these with a combination footprint is also quite common.

    Dealing with Your Exact Footprints
    Exact footprints are the easiest to handle, but they’re everywhere. And they all need to be eliminated. “Powered By Wordpress”? Yeah. Kill it. I don’t think Wordpress is having any problems ranking for their own software title name.
    Now, they’re not all that simple. That “About Me” page you never edited? Change it. The affiliate program that was built into your web cart software but never setup? All the default phrases are footprints (Example). Run through your files, and just delete them. No harm done. Having backlinks to people that designed your theme are much lower risk, but still, you might want to kill it, depending on what the licensing agreement says, and if you care.
    Make sure to look carefully. Things like “You can use these tags:” wordpress can be an issue.

    Dealing With You Combination Footprints
    Combination footprints are a tricky beast, but thankfully not as common. Swap out the 1 or 2 most common ones, and move on. Inserting spaces can even help. For example, a search for blogroll returns 32 times as many results as “Blog Roll”. Common category names, headings, and sections are all suspect here.

    Dealing With Your Filename Footprints
    Wordpress isn’t as bad as many with these. But still, start cleaning. Any filename that is somewhat unique to the software, open up the code and start swapping it out. Bear in mind as well, that included in this category are any arguments/query strings that come AFTER the filename. They’re all searchable, and are actually normally more unique than the filename itself?

    An Example: Some of SlightlyShadySEO’s Footprints:
    All text highlighted in red are exact footprints, everything in blue is a combination footprint
    SlightlyShadySEO’s wordpress footprintsYes, I’m FULLY aware Archives is spelled wrong. No, I’m not changing it. Consider it a footprint I don’t have, that you probably do!

    With that, I’m off to finish eliminating a few of my own footprints. I don’t get much link spam, so probably just the exact ones for now :-)
    I’m not going to work to hard at it though. This entry actually contains several footprints in the content…so I’m kinda screwed. Haha.

    Enjoy!
    -XMCP

    Share and Enjoy(You know you want to): These icons link to social bookmarking sites where readers can share and discover new web pages.
    • Technorati
    • StumbleUpon
    • Reddit
    • PlugIM
    • Blue Dot
    • Bumpzee
    • Simpy
    • Netscape
    • del.icio.us
    • blogmarks
    • Spurl
    • Furl
    • Fark
    • TailRank
    • BlinkList
    • NewsVine

    8 Responses to “A 10 Minute Trick to Reduce your Blog Spam”

    1. Seller Girl says:

      Hey dude - if you want to spell archives wrong, but not seem to you could always hide it:
      Archivies
      css:
      . killFootprint { display: hidden; }
      (I haven’t tested it)

      But I guess you probably know that :)

    2. imagenesis says:

      Yo, this has nothing to do with your post but since I dont know your contact info, I’ll post it here:

      http://jpipes.com/presentations/mysql_perf_tuning.pdf

      Some nice reading material IMO.

    3. admin says:

      @SellerGirl
      Yeah, that would work. But it got mentioned so many times in the early days of this blog(I think I deleted some comments, but some are still there), and it became a matter of principle.
      After this entry, I actually bolded the extra “I”
      Thanks for the tip though ;) It would definitely work, and nice to see ya commenting!

    4. Seller Girl says:

      Oh, dear - that didn’t work, I didn’t realise.
      Obviously the markup should be:
      Archiv[span class=”killFootprint “]i[/span]es
      Maybe that will show…
      But I guess you got the idea. :)
      I can appreciate the stubborness of principle :)

      Doing good work with the blog - thanks for replying :)

    5. Tomasz Mazur says:

      I am using akismet for this, and it works just great!

    6. admin says:

      @Tomasz
      Ironic, because your comment got blocked by akismet ;)

    7. Why Google is Broken | SEO ROI Services. Need an SEO Rockstar? Contact Us. Prices $3K-$10K/mo says:

      […] Ads, manual evaluation is de rigueur. The computer calculations that discount paid links seek to footprint paid links by looking for indicative nearby text such as “Sponsors,” “Ads” […]

    8. RankRanker (Rank Ranker “Free SEO System and Link Exchange With Extra Earning”) SPAM | SEO ROI Services. Need an SEO Rockstar? Contact Us. Prices $3K-$10K/mo says:

      […] why annoy someone who’ll give you reputation management issues (like 1 and 1’s issues)? Silly RankRanker (aka Rank Ranker aka webmaster@rankranker.com). What do you guys think about […]

    Leave a Reply

    XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>

    Marketing & SEO Blogs - Blog Top Sites
    © Slightly Shady SEO, All Rights Reserved. Scrape me, and I will eat your soul.