Disallowing bots (was Re: Man, they'll try anything to hack your system...)

Bill McGonigle bill at bfccomputing.com
Fri Jan 27 11:58:00 EST 2006


On Jan 27, 2006, at 10:41, Larry Cook wrote:

> How do you keep out the bad ones, the ones that ignore robots.txt?

The bad ones usually _read_ robots.txt to figure out where the "juicy 
stuff" is.

So you can do:

  Disallow: /robottrap.html

And then have something tail your access log and instantly iptables 
anything that accesses /robottrap.html.

-Bill

-----
Bill McGonigle, Owner           Work: 603.448.4440
BFC Computing, LLC              Home: 603.448.1668
bill at bfccomputing.com           Cell: 603.252.2606
http://www.bfccomputing.com/    Page: 603.442.1833
Blog: http://blog.bfccomputing.com/
VCard: http://bfccomputing.com/vcard/bill.vcf




More information about the gnhlug-discuss mailing list