Pantek Library
Hosting Provided By
CybrHost
High Speed Hosting

RE: Possible DOS against search engines?

From: Rob Shein <shoten(at)starpower.net>
Date: Mon Feb 03 2003 - 18:45:00 EST


I see a few problems here. Problems are listed below each concept, for clarity, and assume a decent webcrawler.

>
> 1. You create a generator for fake web pages, whose purpose

I doubt this would make even a slight dent in things. Seeing as how webcrawlers already walk the entire internet, with its various languages, enormous expanse, and endless misspellings, I think anything you could create would end up being a drop in the bucket.

>
> 2. You place that generator somewhere and submit the URL to
 

But they don't crawl indefinitely. What do they do if they hit two sites that link to each other? They notice this, and move on.

> 4. Upon adding the gathered words to the search engine's
 

But who would search on them?

> - craft fake words so that they attack a specific hash

Do you need help?X

This would be noticed by the search engine long before it became a real problem, and it would be addressed. This is how they deal with many things, including people who try to influence their ranking using various means.  

> - craft fake words so that they disbalance a b-tree

It is my belief that, again, they will notice the impact on their database and quickly address the issue. What about a bit of code that states that if more then 5% of the words in a page are unique in the database, that that page is dropped?

> If the above-mentioned things are feasible, then one can even

No, but I'd notice an abrupt lack of space on my web server. And the sudden oddly-named URLS in my logs. And the corresponding oddly-named pages in my site. And if I didn't notice, my hosting provider would.

> Please note that the setup described differs from the
Received on Mon Feb 3 18:54:26 2003

This archive was generated by hypermail 2.1.8 : Wed Aug 23 2006 - 14:07:37 EDT


Contact Us  Legal Notices  Order Services Online 
Pantek Home  Privacy Policy  IT news  Site Map  Pantek Library