HTTP headers overrule the html <head> section -- so even if in the <head> you say utf-8, if the http header says iso-8859-1, the page will be interpreted as iso-8859-1 -- WebHTTrack is UTF-8 unaware https://launchpad.net/bugs/5434