| clear query|facets|time |
Search criteria: .
Results from 101 to 110 from
161 (0.101s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: buildbot failure in ASF Buildbot on tika-trunk - Tika - [mail # dev]
|
|
...This looks like another case of the Maven "site" target build dependency bug: http://ci.apache.org/builders/tika-trunk/builds/49/steps/compile/logs/stdio The TikaCLI code ...
|
|
|
Author: Ken Krugler,
2010-07-12, 18:05
|
|
|
Boilerpipe integration - Tika - [mail # dev]
|
|
...I've resolved TIKA-420, by rolling in modified versions of Christian's patches. You can try out this new functionality by using the TikaCLI with the new -T option, which ...
|
|
|
Author: Ken Krugler,
2010-07-12, 17:34
|
|
|
TIKA-420 patch for boilerplate removal - Tika - [mail # dev]
|
|
...I've submitted a revised patch (https://issues.apache.org/jira/browse/TIKA-420 ), and had one key question. Currently the BoilerpipeContentHandler calls a delegate Content...
|
|
|
Author: Ken Krugler,
2010-07-10, 00:23
|
|
|
[TIKA-459] Improve handling of incorrect charset names in HTTP response header - Tika - [issue]
|
|
...While crawling a few million pages, I collected stats for charset names that weren't valid.The attached patch "fixes up" most of these that I encountered, and thus should improve the accurac...
|
|
|
http://issues.apache.org/jira/browse/TIKA-459
Author: Ken Krugler,
2010-07-08, 18:56
|
|
|
Re: buildbot failure in ASF Buildbot on tika-trunk - Tika - [mail # dev]
|
|
...Hi Chris, Wait, I think the issue is that the 'site' target triggers the compilation of the sources, before the jars are built. Yup, just found a good description - it's a...
|
|
|
Author: Ken Krugler,
2010-07-08, 18:44
|
|
|
Re: buildbot failure in ASF Buildbot on tika-trunk - Tika - [mail # dev]
|
|
...Hi Chris, Thanks for digging into this. The tika-core sources are being built before the tika-parsers sources, since tika-parsers has a dependency on tika-core, and Maven...
|
|
|
Author: Ken Krugler,
2010-07-08, 18:35
|
|
|
[TIKA-359] Calls to Charset.isSupported() will throw exceptions for invalid charset names - Tika - [issue]
|
|
...The HtmlParser and TXTParser code currently call Charset.isSupported() to determine if charset hint info (from meta tags or incoming metadata).But this method throws IllegalCharsetNameExcept...
|
|
|
http://issues.apache.org/jira/browse/TIKA-359
Author: Ken Krugler,
2010-07-08, 00:34
|
|
|
Re: Tika 0.7 And Solr - Tika - [mail # dev]
|
|
...Hi Rohan, On Jul 7, 2010, at 4:01am, rohanpatil wrote: Back in January I'd run into the same issue: To fix it, you could manually install the bcprov-jdk14.jar Ken ...
|
|
|
Author: Ken Krugler,
2010-07-07, 16:44
|
|
|
[TIKA-307] Better handling of partial/truncated input data to parsers - Tika - [issue]
|
|
...Some parsers (e.g. ZipParser) can hang if they prematurely reach the end of the input stream.As a way of avoiding this issue, Jukka had suggested the following approach on the list:The input...
|
|
|
http://issues.apache.org/jira/browse/TIKA-307
Author: Ken Krugler,
2010-07-05, 21:29
|
|
|
Re: Getting started - Tika - [mail # dev]
|
|
...Are you sure your new parser is on the classpath? E.g. put a break on getSupportedTypes() and make sure that's getting called - if not, then the parser isn't being "found" by Ti...
|
|
|
Author: Ken Krugler,
2010-06-21, 17:04
|
|
|
|