Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Tika, mail # user - Test suite for Tika?


Copy link to this message
-
Re: Test suite for Tika?
Mattmann, Chris A 2010-07-10, 04:07
Hi David,

The unit tests for the tika-parsers modules contains the test documents in the directory here:

http://svn.apache.org/repos/asf/tika/trunk/tika-parsers/src/test/resources/test-documents/

HTH,
Chris

On 7/9/10 8:32 PM, "David Kovar" <[EMAIL PROTECTED]> wrote:

Good evening,

Is there an available set of documents that is used to validate Tika's performance? I am working on validating the performance of some ediscovery tools and such a test set would be very useful.

Thank you.

-David

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: [EMAIL PROTECTED]
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++