Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: schema.xml.   Results from 1 to 10 from 14 (6.363s).
Loading phrases to help you
refine your search...
schema.xml bloat? - Solr - [mail # user]
...Hi,  it seems from my limited experience thus far that as new data types are added, schema.xml will tend to become bloated with many different field and fieldtype definitions...
   Author: Fred Zimmerman, 2011-10-23, 23:34
Re: schema.xml bloat? - Solr - [mail # user]
...So, basically, yes, it is a real problem and there is no designed solution?  e.g. optional sub-schema files that can be turned off and on?  On Sun, Oct 23, 2011 at 6:38 PM, Erik Hatcher...
   Author: Fred Zimmerman, 2011-10-24, 00:23
1) success 2) how to tell Nutch "index everything" - Nutch - [mail # user]
...1) I resolved the issues with solrindex. It turned out to be a matter of adding all the nutch schema-specific fields to solr's schema.xml.  there was one gotcha which is that the latest...
... solr schema does not have a default fieldtype "text" as in Nutch 1.3/schema.xml; you must use "text_general".  A comment for developers is that the use case of copying the nutch schema...
[+ show more]
   Author: Fred Zimmerman, 2011-10-26, 14:37
fixed schema problems, now running out of memory? - Solr - [mail # user]
...It's a small indexing job coming from nutch.  2011-10-26 15:07:29,039 WARN  mapred.LocalJobRunner - job_local_0011 java.io.IOException: org.apache.solr.client.solrj.SolrServerExcep...
   Author: Fred Zimmerman, 2011-10-26, 15:09
limiting searches to particular sources - Solr - [mail # user]
...I want to be able to list some searches to particular sources, e.g. "wiki only", "crawled only", etc.  So I think I need to create a source field in the schema.xml.  However, the native...
   Author: Fred Zimmerman, 2011-11-02, 14:52
Re: fixed schema problems, now running out of memory? - Solr - [mail # user]
...http://wiki.apache.org/solr/SolrPerformanceFactors#Schema_Design_Considerations  The number of indexed fields greatly increases the following:   http://lucene.472066.n3.nabble...
   Author: Fred Zimmerman, 2011-10-26, 17:18
Re: fixed schema problems, now running out of memory? - Solr - [mail # user]
...More on what's happening. It seems to be timing out during the commit.  The new documents are small, but the existing index is large (11 million records).  INFO: Closing Searcher@4...
   Author: Fred Zimmerman, 2011-10-26, 17:01
Re: solrindexer parameters -- input path does not exist: crawl_fetch, parse_data, etc. - Nutch - [mail # user]
...I added just the  field ... I have already modified solr's schema.xml to accommodate some other data types.  Now when starting solr ...  INFO: SolrUpdateServlet.init() done 2011-10-26 13...
   Author: Fred Zimmerman, 2011-10-26, 13:31
where is solr data import handler looking for my file? - Solr - [mail # user]
...Solr dataimport is reporting file not found when it looks for foo.xml.  Where is it looking for /data? is this an url off the apache2/htdocs on the server, or is it an URL within example/solr/...?     ...
   Author: Fred Zimmerman, 2011-10-19, 20:49
Re: advice, config files for crawling private wikipedia mirror - Nutch - [mail # user]
...so let me make sure I understand.  what this guy did is that he made an XML file from his local backup of wikipedia but he didn't crawl it?maybe I don't need to crawl it, either, since...
... the XML file can include the "id" field which is where Solr keeps URLs, right?  What I want to be able to do is submit a search to Solr, get back an answer set as a file using wt=csv, use a shell...
   Author: Fred Zimmerman, 2011-10-10, 14:41
Sort:
project
Solr (11)
Nutch (3)
type
mail # user (14)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (14)
author
Chris Hostetter (1359)
Yonik Seeley (735)
Erick Erickson (633)
Erik Hatcher (508)
Jack Krupansky (395)
Otis Gospodnetic (383)
Uwe Schindler (327)
Shalin Shekhar Mangar (323)
Lance Norskog (321)
Ryan McKinley (308)
Markus Jelsma (294)
Robert Muir (279)
Mark Miller (249)
Shawn Heisey (248)
Grant Ingersoll (247)
Fred Zimmerman