HTML scraping in python

Paul Lussier p.lussier at comcast.net
Thu Jun 11 10:20:13 EDT 2009


Paul Lussier <p.lussier at comcast.net> writes:

> I stumbled up BeautifulSoup and am now trying to get that and the
> mechanize module installed.

Okay, I've got that installed.  I've figured out enough BS to get me a
single row of the table into a list comprised of elements like:
'<td>data</td>'

Now I just need to figure out how to strip the html off of the data.
I could do it by writing a regexp, I suppose, but I'm hoping there's a
method which already does this.

-- 
Seeya,
Paul


More information about the gnhlug-discuss mailing list