Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Nutch, mail # user - Using latest version of Tika with nutch


+
Paul Rogers 2011-03-15, 11:43
Copy link to this message
-
Re: Using latest version of Tika with nutch
Andrzej Bialecki 2011-03-15, 11:53
On 3/15/11 12:43 PM, Paul Rogers wrote:
> Dear All
>
> I'm currently having great difficulty building nutch from trunk.  The
> reason that I'm attempting this is that I wish to use the latest
> version of Tika and I thought there might be an alternative.
> Therefore:
>
> Is it possible to determine the version of Tika included with the
> binary version (1.2) of nutch?

Yes, take a look at the lib/tika-*.jar. The 1.2 release shipped with
Tika 0.7.

>
> Is it possible to add the latest version of Tika (0.9 I believe) to
> the binary version (1.2) nutch? Or is it all bound up in the
> executable?

It's not possible - the Tika API has changed. Your best bet is to
encourage Nutch devs to work on NUTCH-967 and then wait a couple weeks
more and upgrade to Nutch 1.3.
--
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com
+
Paul Rogers 2011-03-15, 12:21
+
Paul Rogers 2011-03-15, 12:31
+
Andrzej Bialecki 2011-03-15, 12:44
+
Paul Rogers 2011-03-15, 13:02