Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 81 to 90 from 6046 (0.436s).
Loading phrases to help you
refine your search...
Re: FW: [Tika Wiki] Update of "RecursiveMetadata" by domtheo - Tika - [mail # dev]
...On Thu, 7 Mar 2013, Mattmann, Chris A (388J) wrote:  I think you need to ask infra  Nick...
   Author: Nick Burch, 2013-03-07, 11:23
Questions about java TIKA project. - Tika - [mail # dev]
...//----------------------------------------------------------------------------------- I notice that the java TIKA project is for file format support using java and various Office file format...
   Author: A Z, 2013-03-07, 08:28
FW: [Tika Wiki] Update of "RecursiveMetadata" by domtheo - Tika - [mail # dev]
...Guys I reverted this spammer but don't know how to block him. Help?  Cheers, Chris  On 3/6/13 7:12 PM, "Apache Wiki"  wrote:  ...
   Author: Mattmann, Chris A, 2013-03-07, 03:29
Re: Improvement in Metadata Class - Tika - [mail # user]
...Oh and thanks for taking the patch into Tika. I hope it will be a *bit* clearer for folks in a similar position as us (in Nutch) to see exactly what should be pulled from Tika. Lewis  O...
   Author: Lewis John Mcgibbney, 2013-03-06, 18:50
Re: Improvement in Metadata Class - Tika - [mail # user]
...Hi Chris, Thanks for the input. RE#3 Yeah, me and Sebastien are now discussing this and will address it within NUTCH-1537 Thanks Lewis  On Sun, Mar 3, 2013 at 9:41 PM, Mattmann, Chris A...
   Author: Lewis John Mcgibbney, 2013-03-06, 18:49
Javascript 'incorrectly' extracted as text - Tika - [mail # user]
...Hi,  In following case Javascript is extracted:      This is strictly speaking correct behaviour but we all know this is an error in the HTML where the opening tag is clo...
   Author: Markus Jelsma, 2013-03-06, 14:49
Re: [jira] [Commented] (TIKA-245) Support of CHM Format - Tika - [mail # dev]
...Tika chm support has its limitations, can you provide such file(s) for further investigation ?  BR, Oleg   On Wed, Mar 6, 2013 at 1:10 AM, Tejas Patil (JIRA)  wrote:  ...
   Author: Oleg Tikhonov, 2013-03-06, 03:49
[TIKA-245] Support of CHM Format - Tika - [issue]
...It might be a good idea to support the CHM File format of Windows. Some information about http://en.wikipedia.org/wiki/Microsoft_Compiled_HTML_Help#Extracting_to_HTML. The CHM format contain...
http://issues.apache.org/jira/browse/TIKA-245    Author: Karl Heinz Marbaise, 2013-03-05, 23:09
Re: how to add more metadata to tika extraction? - Tika - [mail # dev]
...On Wed, 27 Feb 2013, eShard wrote:  Looks like the metadata you want isn't being pulled out as metadata by  Tika   Metadata != content  I'd suspect that if you look at th...
   Author: Nick Burch, 2013-03-05, 21:33
Re: How to hide some Excel content - Tika - [mail # user]
...OK. I was just wondering if there was a built-in way to specify a customer handler that could do something like this to avoid compiling a custom version of the project.    I see. G...
   Author: CL, 2013-03-05, 17:34
Sort:
project
Lucene (129935)
Solr (103805)
ElasticSearch (33653)
Mahout (31245)
Nutch (16523)
ManifoldCF (15113)
Tika (5954)
Lucene.Net (5782)
PyLucene (1905)
Droids (1663)
Lucy (1356)
OpenRelevance (286)
type
javadoc (1746)
mail # dev (1433)
mail # user (1274)
issue (1097)
source code (357)
Sematext # blog (92)
web site (38)
wiki (9)
date
last 7 days (2)
last 30 days (13)
last 90 days (127)
last 6 months (460)
last 9 months (3943)
author
Jukka Zitting (530)
Nick Burch (410)
Mattmann, Chris A (324)
Michael McCandless (176)
Ken Krugler (161)
buildbot@...)
Oleg Tikhonov (58)
Markus Jelsma (56)
Mark Kerzner (53)
Dave Meikle (49)
Maxim Valyanskiy (46)
Keith R. Bennett (45)
Ray Gauss II (40)
Antoni Mylka (37)
Benson Margulies (37)