Scrapebox Footprints

Scrapebox has many uses from harvesting proxies, mass-commenting on blogs and other platforms, checking for live links, mass trackbacks, rss, and more; one of the powerful ways to utilize Scrapebox is to get better familiarized with the awesomeness of Scrapebox Footprints. Though it may seem confusing at first, looking through scrapebox footprints are one of the main reasons why I use scrapebox on a daily basis.

By utilizing scrapebox footprints, you can find .edu blogs, mediawiki wiki’s, .gov forums, and pretty much anything you put your mind to; if a website, blog, or any other platform has some kind of a footprint that is visible on all default packages (often the case), there are ways of harvesting those sites for use with other programs, bots or macros. There are countless of different combinations of scrapebox footprints that you can utilize to benefit your job as an Internet Marketer, for pretty much any backlinking situation you can think of.

ThinkPaid’s Super Custom Scrapebox Footprints List: Want One?

Now that you have the basics down, would you like a large harvested list of custom footprints to make your life easier? Of course you do! You can download the file below!

 

Utilizing Scrapebox Footprints are often overlooked, but you can find many golden nuggets by effectively using Scrapebox. If you wan’t a basic introduction on How to use Scrapebox, feel free to check out the guide here at Thinkpaid.

Lets start with Basic Scrapebox Footprints:

Lets first start off with one of the most basic type of Scrapebox Footprints, and that is for the use with different blog platforms. If you want to filter for blog platforms, in the 1st corner, select custom footprint (like shown in the picture below):

Scrapebox Footprints: Blog Platforms

“powered by wordpress” | “Powered by MovableType” | “Powered by BlogEngine.net” | “powered by b2evolution”

What this will do is it will search for blogs that are only WordPress, MovableType, BlogEngine, or b2evolution. This is great as these are the most common ones you would be commenting on with Scrapebox; these are one of the Scrapebox footprints that will allow you to just comment on those blog platforms, without messing about and accidentally finding non-blog sites in search results. The | separator between each quoted footprint is an OR operand. What it basically means is that it will need to find one of the four different types of platforms for it to successfully be harvested.

Scrapebox Footprints

This is where you would enter the Scrapebox Footprints

Due to an unfortunate loss, the linkdomain function in Yahoo no longer works, so many features and footprints like finding a competitors backlinks, or even filtering it down to finding a competitors .edu or .gov backlinks is no longer available; the reason for this is because Yahoo! has recently phased out Yahoo Site Explorer, leaving a bit of a void in some of the more common and powerful footprints that were once possible. Fear not, there are plenty more great Scrapebox Footprints out there, and once you understand the basics you can go out and find some of your own.

If you want to be more specific on trying to find default-layout blogs that often don’t have things like akismet set up (or any other type of anti-spam plugin), then you may be able to set up something like the following set of scrapebox footprints. These could be used individually.

“post a comment” -”comments closed”
“leave a comment” -”comments closed”
“leave a reply” -”comments closed”
“post a reply” -”comments closed”
“leave your reply” -”comments closed”
“leave a response” -”comments closed”
“leave your response” -”comments closed”
-”comments closed”

If you’ve tried this and managed to get some results: congratulations, you’ve used one of your first scrapebox footprints. These footprints are crucial when using scrapebox efficiently, especially when coupled with keywords, or other data. For example, if you wanted to find blogs based on moveable type but wanted to use custom footprints to give you more control over things, you can use the following custom footprints:

“Powered by Movable Type”
“Powered by Movable Type” “Post A Comment”
“Powered by Movable Type” “Leave A Comment”
“Powered by Movable Type” “Name” “Email Address” “URL”
“Powered by Movable Type” “Comments (You may use HTML tags for style)”
“Powered by Movable Type Pro”
“Powered by Movable Type 5.02″
“You may use HTML tags for style” “Movable Type”

Scrapebox Footprints: .Edu/.Gov blogs on WordPress

This will show you how powerful some of the Scrapebox Footprints can be. You can use one of the following to find .edu or .gov blogs that use the wordpress platform. The “powered by wordpress” bit is a very common footprint that is left on most wordpress blogs that have the default template + other templates that like to link back to wordpress (which is also somewhat common).

site:.edu “Powered by wordpress”
site:.gov “Powered by wordpress”

The fun doesn’t stop with wordpress, as there are tons of available blog platforms out there. The majority of them have some kind of default footprint that links back to their website and/or is shown as a badge that has the platforms name on it, even if its in the footer. From there, you can find loads of sites; the majority of people do not realize how easy it is to leave manual (but powerful) comments on blogs with low OBL’s that are current, especially if you have a useful comment.

Scrapebox Footprints: Forum Platforms

While may not be as useful for the core operations of Scrapebox, you could be in search for loads of forums out there that have their own default footprints. Do they exist? Of course! Thats how those big sites can get backlinks and recognition to increase their “social wealth” and viral awareness. Keep in mind these are only a small selection of forum footprints.

“Powered by BlogEngine”
“Powered by Blogsmith”
“powered by Typepad”
“powered by scoop”
“Powered by PHPbb”
“Powered by vBulletin”
“Powered by SMF”
“powered by Simple Machines”
“powered by punBB”
“powered by expressionengine” “yourkeyword”
“Powered by Tagbox”
“Powered by DRBGuestbook”
“powered by bbPressPowered”
“Powered by PunBB” register.php
“powered by vbulletinpowered”

You get the idea. What is more interesting is you can use the site:.edu footprint coupled with the forum footprints to find your very own .edu/.gov forums using your very own footprint. That is only the beginning of what Scrapebox can do, and this is sadly overlooked by many internet marketers. This method is still a viable way of finding genuine .edu/.gov links that are more powerful than the ones that you can spam. 1 high quality .edu link is better than 20 crappy/spammy .edu links, any day and any time.

Hopefully this gives you a small idea of what Scrapebox is capable of and how powerful such a small piece of software can be. If you haven’t grabbed it, you should consider it. If you do not wish to spend $97 on it, you can always use the Scrapebox Discount available on Thinkpaid, which can literally cut the price in half for you.

Scrapebox Footprints: Article Directories

Lets say you have Article Marketing Robot and wish to scrape yourself a new list of directories to use. There are various directory platforms that work, each of which that have their own structure and footprints. These are one of the scrapebox footprints that you can use for ArticleMS.

ArticleMS + inurl:/articles/
ArticleMS + inurl:/art/
ArticleMS + inurl:/category/
ArticleMS + inurl:/articlems/
ArticleMS + inurl:/artms/

You can use these separately, or you can use the | operator (which acts like “OR” for those with programming knowledge) which can be used like near the beginning of the post (first example). This is a neat way to scrape yourself a new list of Article Directories which could get you a nice collection of new and indexed article directories (which is rather important).

Another quick set of scrapebox footprints for article directories is found directly below. Keep in mind that this is only a small list.

“Powered By: Article Friendly”
inurl:/submitart.php
“View My Bio & Articles”
“Powered by Article Dashboard”
“Excellent Articles in our Database from”
“Use of our service is protected by our”
inurl:/index.php?page=mostpopulararticles
“Powered by ArticleMS”
“Powered by WordPress · Using Article Directory plugin”
“registered authors in our article directory.”
“Proudly powered by WordPress”
inurl:/submission-policy/
inurl:/wp-login.php?action=register

Learning Scrapebox Footprints makes you more Efficient with the Program

With Scrapebox, it is still a very powerful and useful tool for any Internet Marketer. Due to Google Panda, you cannot scrapebox spam 15,000 blogs + link them to your money site and expect big things to happen. This is no longer the case and will most likely get your sandboxed (though some exceptions apply, and there are those that believe that once you get out of the sandbox you will be stronger than before – I do not agree nor disagree with that ;) ).

Scrapebox, like Article Marketing Robot, is a program that is only as good as the person who uses it. There are countless guides out there and I intend to write a rather in-depth guide (with videos) in the next few days or so (perhaps maybe a week). I’ll go over every aspect of Scrapebox, in order to help you out with learning how to use Scrapebox. For now, this will conclude this little scrapebox footprints tutorial, but if you have any questions or footprint requests, then show me some love and comment below!

Comments

  1. Lars says

    I just ended up finishing this post. That is a huge list of scrapebox footprints!

    Just by looking at a few, I’ve come up with some rather nice combinations :)

  2. says

    It is getting harder and harder to scrape using these now common footprints. Is it possible Google and Yahoo are starting to limit searches using these footprints?

    • says

      I wouldn’t necessarily say that Google or Yahoo are trying to limit searches with these scrapebox footprints…

      It’s more so websites and blogs posting massive lists which are diluting the results of what it used to be (which is why I put mine in a file).

      You have to be a bit more creative (especially when it comes to finding inurl matches).

      For example, I look for register pages that have specific footprints. It helps get rid of all the crud.

  3. says

    Very useful post. I just bought Scrapebox recently and I’m still learning how to harvest URLs to comment on manually and additional article directories for my AMR. Your post helped and gave me some ideas. Thanks!

Leave a Reply