[Bug 1857824] [NEW] kmimetypefinder5 misidentifies mimetype of python files containing certain strings

Launchpad Bug Tracker 1857824 at bugs.launchpad.net
Wed Sep 23 15:16:39 UTC 2020

You have been subscribed to a public bug:

Expected behavior:

    $ kmimetypefinder5 example.py 


    $ kmimetypefinder5 example.py 


    $ kmimetypefinder5 example.py 

Actual behavior:

    $ kmimetypefinder5 example.py 

Summary: Python scripts with a string containing HTML can be
misidentified as HTML files by kmimetypefinder5.

For example, this python script is identified as

#! /usr/bin/env python3
example_string = \
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
<html xmlns="http://www.w3.org/1999/xhtml">
    <title>Example title</title>
    <p>Example body</p>
print('Hello, world!')

This difficulty is not shared by other mimetype identification tools.

    $ kmimetypefinder5 example.py 
    $ cat example2.py #! /usr/bin/env python3
    print('Hello, world!')
    $ kmimetypefinder5 example2.py 
    $ mimetype 'example.py'
    example.py: text/x-python
    $ mimetype 'example2.py'
    example2.py: text/x-python
    $ file --mime-type 'example.py'
    example.py: text/plain
    $ file --mime-type 'example2.py'
    example2.py: text/plain

$ lsb_release -rd
Description:	Ubuntu 18.04.3 LTS
Release:	18.04
$ apt-cache policy kde-cli-tools
  Installed: 4:5.12.8-0ubuntu0.1
  Candidate: 4:5.12.8-0ubuntu0.1
  Version table:
 *** 4:5.12.8-0ubuntu0.1 500
        500 http://us.archive.ubuntu.com/ubuntu bionic-updates/universe amd64 Packages
        100 /var/lib/dpkg/status
     4:5.12.4-0ubuntu1 500
        500 http://us.archive.ubuntu.com/ubuntu bionic/universe amd64 Packages

ProblemType: Bug
DistroRelease: Ubuntu 18.04
Package: kde-cli-tools 4:5.12.8-0ubuntu0.1
ProcVersionSignature: Ubuntu 4.15.0-72.81-generic 4.15.18
Uname: Linux 4.15.0-72-generic x86_64
ApportVersion: 2.20.9-0ubuntu7.9
Architecture: amd64
CurrentDesktop: KDE
Date: Sun Dec 29 13:28:37 2019
InstallationDate: Installed on 2018-12-12 (381 days ago)
InstallationMedia: Kubuntu 18.04.1 LTS "Bionic Beaver" - Release amd64 (20180725)
SourcePackage: kde-cli-tools
UpgradeStatus: No upgrade log present (probably fresh install)

** Affects: kde-cli-tools (Ubuntu)
     Importance: Undecided
         Status: New

** Tags: amd64 apport-bug bionic third-party-packages
kmimetypefinder5 misidentifies mimetype of python files containing certain strings
You received this bug notification because you are a member of Kubuntu Bugs, which is subscribed to kde-cli-tools in Ubuntu.

More information about the kubuntu-bugs mailing list