rawdog 2.10 utf8 codec can't decode bytes

Glen Stewart root at associate.com
Sat Sep 20 12:49:42 UTC 2008


Package: rawdog
Version: 2.10.dfsg.1-1
Severity: important
Tags: patch

*** Please type your report below this line ***
While parsing various RSS feeds, RawDog has been increasingly failing with the 
error noted below.  Upon installing the 2.11 version at the author's web site, 
http://offog.org/code/rawdog.html  the problem was resolved.  The error:

Traceback (most recent call last):
  File "/usr/bin/rawdog", line 31, in <module>
    launch()
  File "/usr/bin/rawdog", line 24, in launch
    sys.exit(main(sys.argv[1:]))
  File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1489, in 
main
    rawdog.write(config)
  File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1371, in 
write
    self.write_output_file(articles, article_dates, config)
  File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1305, in 
write_output_file
    self.write_article(f, article, config)
  File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 1165, in 
write_article
    description = detail_to_html(entry_info[key], False, config, 
force_preformatted)
  File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 151, in 
detail_to_html
    return sanitise_html(html, detail["base"], inline, config)
  File "/var/lib/python-support/python2.5/rawdoglib/rawdog.py", line 79, in 
sanitise_html
    html = feedparser._resolveRelativeURIs(html, baseurl, "UTF-8")
  File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1633, 
in _resolveRelativeURIs
    p.feed(htmlSource)
  File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1480, 
in feed
    sgmllib.SGMLParser.feed(self, data)
  File "/usr/lib/python2.5/sgmllib.py", line 99, in feed
    self.goahead(0)
  File "/usr/lib/python2.5/sgmllib.py", line 133, in goahead
    k = self.parse_starttag(i)
  File "/usr/lib/python2.5/sgmllib.py", line 291, in parse_starttag
    self.finish_starttag(tag, attrs)
  File "/usr/lib/python2.5/sgmllib.py", line 333, in finish_starttag
    self.unknown_starttag(tag, attrs)
  File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1628, 
in unknown_starttag
    _BaseHTMLProcessor.unknown_starttag(self, tag, attrs)
  File "/var/lib/python-support/python2.5/rawdoglib/feedparser.py", line 1497, 
in unknown_starttag
    value = unicode(value, self.encoding)
  File "/usr/lib/python2.5/encodings/utf_8.py", line 16, in decode
    return codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode bytes in position 22-23: invalid 
data


-- System Information:
Debian Release: lenny/sid
  APT prefers hardy-updates
  APT policy: (500, 'hardy-updates'), (500, 'hardy-security'), (500, 'hardy-
proposed'), (500, 'hardy-backports'), (500, 'hardy')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.24-21-generic (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages rawdog depends on:
ii  python-support              0.7.5ubuntu1 automated rebuilding support for 
p

rawdog recommends no packages.

-- no debconf information


-------------- next part --------------
A non-text attachment was scrubbed...
Name: rawdog-2.11.tar.gz
Type: application/x-compressed-tar
Size: 103118 bytes
Desc: not available
URL: <https://lists.ubuntu.com/archives/ubuntu-users/attachments/20080920/c463e2bd/attachment.bin>


More information about the ubuntu-users mailing list