linkchecker bugs

Ian Smith-Heisters ismithhe at marlboro.edu
Thu Apr 27 16:14:42 UTC 2006


The linkchecker package says bugs should be reported here, so here goes:

1) Uncommenting any of the ignoreN= lines in the default linkcheckerrc 
results in the following cryptic error:

Traceback (most recent call last):
   File "/usr/bin/linkchecker", line 511, in ?
     config.read()
   File "/usr/lib/python2.4/site-packages/linkcheck/configuration.py", 
line 243, in read
     self.read_config(cfiles)
   File "/usr/lib/python2.4/site-packages/linkcheck/configuration.py", 
line 262, in read_config
     self.read_filtering_config(cfgparser)
   File "/usr/lib/python2.4/site-packages/linkcheck/configuration.py", 
line 437, in read_filtering_config
     linkcheck.log.error(
TypeError: error() takes at least 2 arguments (1 given)

By changing line 436 in configuration.py from
                 if len(ctuple)!=2:
to
                 if len(ctuple)!=1:
makes the error go away, and the software appear to behave as expected. 
Sorry, no patch since it's so simple, and I have no idea if this "fix" 
has other ramifications.

2) The IGNORE WARNINGS list from "$ linkchecker -h" incomplete. 
According to "grep -R tag= /usr/lib/python2.4/site-packages/linkcheck/*" 
, the complete list of ignore tags is

file-missing-slash
file-system-path
ftp-missing-slash
http-robots-denied
http-no-anchor-support
http-robots-denied
http-moved-permanent
http-wrong-redirect
http-empty-content
http-cookie-store-error
http-decompress-error
http-unsupported-encoding
http-unsupported-encoding
ignore-url
mail-no-addresses
mail-no-mx-host
mail-unverified-address
mail-no-connection
nntp-no-server
nttp-no-newsgroup
nttp-busy
url-effective-url
url-unicode-domain
url-unnormed
url-error-getting-content
url-anchor-not-found
url-warnregex-found
url-content-too-large

While I haven't gone through them all, http-robots-denied is definitely 
missing from the help output.

3) The default config file, linkcheckerrc, uses something like 
url-not-normed as an ignore flag. The correct flag would seem to be 
url-unnormed.

That's it. This package seems to suit me, but it seems a little old and 
crufty. Can anyone recommend a better package for doing automated link 
checking (and perhaps even accessibility checks)? I saw ht://check, but 
I didn't like that it stored everything in MySQL; I'd rather just get a 
nightly email of errors. And it hadn't been updated in even longer than 
linkchecker.

Thanks!
-- 
Ian Smith-Heisters
Webmaster
Marlboro College
(802) 451.7559 (x.559)
ismithhe at marlboro.edu
http://marlboro.edu
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3186 bytes
Desc: S/MIME Cryptographic Signature
URL: <https://lists.ubuntu.com/archives/ubuntu-users/attachments/20060427/771f13f2/attachment.bin>


More information about the ubuntu-users mailing list