Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Solr, mail # user - Re: curl or nutch


+
findbestopensource 2012-05-16, 09:29
Copy link to this message
-
Re: curl or nutch
Tolga 2012-05-16, 12:11
Can nutch crawl/index files as well?

On 5/16/12 12:29 PM, findbestopensource wrote:
> You could very well use Solr. It has support to index the PDF and XML
> files. If you want to index websites and search using page rank then choose
> Nutch.
>
> Regards
> Aditya
> www.findbestopensource.com
>
>
> On Wed, May 16, 2012 at 1:13 PM, Tolga<[EMAIL PROTECTED]>  wrote:
>
>> Hi,
>>
>> I have been trying for a week. I really want to get a start, so what
>> should I use? curl or nutch? I want to be able to index pdf, xml etc. and
>> search within them as well.
>>
>> Regards,
>>
+
Otis Gospodnetic 2012-05-17, 04:50