Dec 28 2006

Google-Mining

Look at you, you came back! We knew you just couldn't keep away for long. Why not make visiting us easy by subscribing to our RSS feed (or the audio RSS feed). Stick around and be sure to speak up and post a comment or two!

Google’s databases are estimated to contain over 20 billion objects. This includes everything from web pages to text files, pdf’s, spreadsheets, etc. Everything that has been found by Googlebot, while traversing the Internet Link Graph.

As you can imagine there are all sorts of treasures hidden within this massive data pile. The trick is formulating effective queries that will deliver the goods. I call this Google-Mining and below you will find some of my best tools for striking gold.

A Word of Caution: As well-intentioned as Googlebot is, sometimes it indexes information that was not intended for public perusal. Please respect other’s privacy and digital property! Don’t save or even view any data that is obviously intended to be private. This includes media objects, personal files, business data, etc.

Finding Applications

Use the following search syntax to locate applications:

“parent directory ” /appz/ -xxx -html -htm -php -shtml -opendivx -md5 -md5sums

inurl:[manufacturer ie.Sun] filetype:iso

Finding MP3’s

Use the following search syntax to locate MP3’s:

“parent directory ” MP3 -xxx -html -htm -php -shtml -opendivx -md5 -md5sums

?intitle:index.of? mp3

Finding Games

Use the following search syntax to locate games:

“parent directory ” Gamez -xxx -html -htm -php -shtml -opendivx -md5 -md5sums

Finding Movies

Use the following search syntax to locate movies:

“parent directory “Xvid -xxx -html -htm -php -shtml -opendivx -md5 -md5sums

Post comments RSS feed Like this post? Subscribe to the RSS feed and get lots more!

One Response to “Google-Mining”

  1. mining news says:

    Great post google mining is the gold mining of this generation.

Leave a Reply

Ja, ich möchte bei Kommentaren benachrichtigt werden!