Feb 2005 Issue of Linux Magazine?

Christopher Schmidt crschmidt at crschmidt.net
Mon Mar 7 20:05:01 EST 2005


From: Christopher Schmidt <crschmidt at crschmidt.net>
To: discuss at gnhlug.org
Subject: Re: Feb 2005 Issue of Linux Magazine?
Date: Mon, 7 Mar 2005 20:08:43 -0500
X-FOAF: http://crschmidt.net/foaf.rdf


On Mon, Mar 07, 2005 at 11:30:49AM -0500, David Berube wrote:
> Hello,
> 
> It's entitled *:* Ruby Web Spiders - Build your own spider for automated 
> web searches 
> <http://www.linux-magazine.com/issue/51/Ruby_Web_Spiders.pdf>, and you 
> can get it at:
> 
> http://www.linux-magazine.com/issue/51/Ruby_Web_Spiders.pdf
> 
> It talks about using Ruby to spider the web - in particular, using Ruby 
> to spider a Livejournal weblog, and how the techniques can be adapted. 
> Tonight I'll be speaking on a similar topic at CentraLUG: creating 
> custom webservers in Ruby. You might find that interesting.

For the record, using the techniques discussed in that article is a great
way to get your IP address banned from LiveJournal.

For the proper way to get information out of LiveJournal, you should use 
the interfaces which are provided which are "bot friendly", as discussed
on http://www.livejournal.com/bots/ .

(Sorry. As a long time LiveJournal hacker, before I got told to fuck off,
I'm a bit more knowledgable about the site than most, and this kind of 
thing is a pretty common problem that people don't realize exists.)

-- 
Christopher Schmidt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://mail.gnhlug.org/mailman/private/gnhlug-discuss/attachments/20050307/51dbef84/attachment.bin


More information about the gnhlug-discuss mailing list