Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Tika, mail # dev - Support detecting 7-zip format


Copy link to this message
-
Support detecting 7-zip format
Marco Quaranta 2012-06-20, 08:42
Hello,
I'd like to contribute to tika-mimetypes.xml file-format registry.
I've added 7zip format to mine custom-mimetype.xml:

<mime-type type="application/x-7z-compressed">
        <acronym>7zip</acronym>
        <_comment>7-zip archive</_comment>
        <magic priority="50">
            <!-- Magic: '7', 'z', 0xBC, 0xAF, 0x27, 0x1C -->
            <match value="7z" type="string" offset="0:1" >
                <match value="0xBCAF271C" type="string" offset="2:5" />
            </match>
        </magic>
        <glob pattern="*.7z" />
    </mime-type>
This is an override version for kml google fomat:

  <mime-type type="application/vnd.google-earth.kml+xml">
    <root-XML localName="kml"/>
    <root-XML namespaceURI="http://www.opengis.net/kml/2.2"
localName="kml"/>
    <acronym>KML</acronym>
    <_comment>Keyhole Markup Language</_comment>
    <glob pattern="*.kml"/>
    <sub-class-of type="application/xml"/>
  </mime-type>

Regards,
Marco