Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Tika, mail # user - Problem indexing msg files


Copy link to this message
-
Re: Problem indexing msg files
Nick Burch 2011-11-12, 20:19
On Fri, 11 Nov 2011, Swapna Vuppala wrote:
> Am using Tika to index .msg files of Outlook. It has been working very
> good for me but am facing problem while indexing some .msg files. The
> indexing fails with the below Solr exception
>
> SEVERE: org.apache.solr.common.SolrException: Invalid Date String:' Fri, 14 Oct 2011 12:35:51 +0100'
>                at org.apache.solr.schema.DateField.parseMath(DateField.java:165)
>                at org.apache.solr.schema.TrieField.createField(TrieField.java:387)
>                at org.apache.solr.schema.TrieDateField.createField(TrieDateField.java:120)
>                at org.apache.solr.schema.SchemaField.createField(SchemaField.java:104)
>                at org.apache.solr.update.DocumentBuilder.addField(DocumentBuilder.java:203)

This would look to be a SOLR issue. Tika will try to return an ISO-8601
date string where it can, but sometimes it gets a random String.
Downstream apps will need to handle this

Nick