Google MD5 Hash Search Engine
I came across an interesting combination of blackhat SEO and "knowledge belongs to the people" hacker attitude. It’s about storing unique MD5 hashes in the title of numerous pages spidered by Google . You may call it an implementation of an hash search engine using Google.
Unlike other implementations, the aim here is to get Google to store the word and associated hash. We do this by putting them into the title where it will always be stored by Google’s spider. Dynamically generating them means they’re only there when Google’s spider wants them.
If I read it right they present different content to humans vs. search engines, isn’t this a cloaking blackhat SEO technique?
Anyway it’s a nice PoC of the ubiquity of Google search, but I still think that GData’s free online MD5 cracker kicks ass with it’s 168,678,430 unique entries.
Thank you for reading this post. You can now Read Comments (5) or Leave A Trackback.
Print This Post
Post Info
This entry was posted on Friday, June 22nd, 2007 . Tagged with:You can follow any responses to this entry through the Comments Feed. You can Leave A Comment, or A Trackback.
Previous Post: Acunetix Web Vulnerability Scanner 5 Review »
Next Post: New Whitelist Based Squid Redirector – White Trash »
Read More
Related Reading:- Animated Presentation on Sony PSN Hack
- ArcSight Tip #1 – arcsight managersetup notification test
- I’m a CISSP
- Operation:Payback or Social Vendetta is Here
- I got owned by Malware Destructor 2011 Virus
- New Downtime Cost Calculator by Storagepipe.com. What if ?
- Securing Your Network from Web Threats
- My Twitter Notes on 2010-07-25
- New NetWitness Visualize : Welcome To The Future!
- My Twitter Notes on 2010-07-18




June 23rd, 2007 21:01
It’s a rather interesting concept: leaving storage and indexing up to google, instead of having to handle it all yourself. So all you do is generate the hashes as google requests the page. The only problem with it is, I don’t know:
1) how far Google is willing to crawl: a search for “site:www.nth-dimension.org.uk” reveals some 7K results for the domain name.
2) Poking around in the URL, it looks like they only have about 2400 words (change the startline value)
3) There’s one that is more methodical: http://reverse.me.uk/, but a search for that domain reveals a measily 49 results… so somehow Google will stop early if it all looks the same? I’m not sure.
4) I’ve tried to make my own, just for the fun of it. My blog is searchable by google, so hopefully there’ll be more than 49 results… we’ll see? Here’s the hash generator: http://darwin.servehttp.com/cgi-bin/hash.pl
June 24th, 2007 05:18
No not at all, everyone sees the same content. The point here is that Google doesn’t always cache the body of the web page, but will always cache the title. Hence, when we present results for any given word, we present them in both areas. The ideas is that should we ever feel like closing the PoC, Google will continue to remember
.
October 16th, 2007 21:00
but how will you binefit from storing them into google
October 17th, 2007 11:41
@photoshop
It helps a lot because even if the site goes offline, google cache will still have the hash
July 14th, 2008 16:25
but how will you binefit from storing them into google
It helps a lot because even if the site goes offline, google cache will still have the hash
Actually, the benefit is the storage and the ability to search hundreds of gigabytes, even terabytes of result data from the index.
And I don’t like the idea to disturb one of the best internet service with a great team behind.