Wanted: OSS to monitor changes to websites or diff HTML files
Larry Cook
lcook at sybase.com
Fri Feb 4 15:21:01 EST 2005
Drew,
> Something that converts HTML to text, like this?
>
> http://www.icewalkers.com/Linux/Software/51170/html2txt.html
Thanks for the pointer. Unfortunately it doesn't do any better of a job than
my simple Perl filter, and it even appears to have a bug where it prints
letters in the first column twice. But there were also two other HTML to Text
converters. One only did a little better than my script, but the other one
actually does some parsing, so it appears to get rid of some the M$ crap.
Maybe I'll be able to enhance it to get rid of it all.
Thanks,
Larry
More information about the gnhlug-discuss
mailing list