Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Nutch, mail # user - Re: how are CSV/TXT files handled


Copy link to this message
-
Re: how are CSV/TXT files handled
remi tassing 2012-02-07, 07:58
The point that made me start thinking is because I got this error message:

"failed(2,0): Can't retrieve Tika parser for mime-type application/ms-excel"

I'm using Nutch-1.2 and my nutch-site.xml has:

"<property>
  <name>plugin.includes</name>

<value>protocol-httpclient|urlfilter-regex|parse-(text|html|js|tika)|index-(basic|anchor)|q..."

Remi

On Tue, Feb 7, 2012 at 9:16 AM, remi tassing <[EMAIL PROTECTED]> wrote:

> Hey guys,
>
> I checked the mailing-list archive but couldn't get an answer on this. I
> think CSV and TXT don't need any kind of parsing, but how.are handled by
> default?
>
> Remi
+
remi tassing 2012-02-07, 08:08
+
Markus Jelsma 2012-02-07, 09:17
+
remi tassing 2012-02-08, 09:22
+
Lewis John Mcgibbney 2012-02-08, 10:50
+
remi tassing 2012-02-08, 14:04
+
Lewis John Mcgibbney 2012-02-10, 21:16
+
remi tassing 2012-02-15, 13:33
+
remi tassing 2012-02-07, 14:37