web site change monitoring tool recommendations?

Larry Cook lcook at sybase.com
Wed Sep 13 12:16:00 EDT 2006


Hi Kevin,

> ${extended_family_member} isn't a hard-core computer person.

Does ${extended_family_member} run Linux?

Attached are my scripts I use to monitor some small websites.  I wrote 
these because I wanted to monitor a complete website, not just a few 
existing pages.  They are extremely primitive.

Disadvantages:

1) Copies the website so you don't want to use this on anything large.

2) The output for differences is from diff, so a bit confusing for 
non-geeks.

Larry
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mindit.sh
Type: application/x-sh
Size: 717 bytes
Desc: not available
Url : http://mail.gnhlug.org/mailman/private/gnhlug-discuss/attachments/20060913/2ea959bc/mindit.sh
-------------- next part --------------
A non-text attachment was scrubbed...
Name: htmldiff.sh
Type: application/x-sh
Size: 752 bytes
Desc: not available
Url : http://mail.gnhlug.org/mailman/private/gnhlug-discuss/attachments/20060913/2ea959bc/htmldiff.sh
-------------- next part --------------
#!/usr/local/bin/perl

#
# This script will strip all HTML tags from the input.
# It also removes any extra white space.  The result
# is a fairly readable text file suitable for diff'ing.
#

# Read stdin into a single string
while ($line = <>) {
    $_ .= $line;
}

s,\s+, ,g;	# Remove extra white space
s,\s*<,\n<,g;	# Remove white space and insert newline before HTML tag
s,>\s*,>\n,g;	# Remove white space and insert newline after HTML tag
s,;\s*,;\n,g;	# Remove white space and insert newline after semicolons

s,<[^>]*>\s*,,g; # Now get ride of HTML tags

# Write it to stdout
print;		


More information about the gnhlug-discuss mailing list