| clear query|facets|time |
Search criteria: schema.xml.
Results from 1 to 10 from
14 (6.363s).
|
|
|
Loading phrases to help you refine your search...
|
|
schema.xml bloat? - Solr - [mail # user]
|
|
...Hi, it seems from my limited experience thus far that as new data types are added, schema.xml will tend to become bloated with many different field and fieldtype definitions...
|
|
|
Author: Fred Zimmerman,
2011-10-23, 23:34
|
|
|
Re: schema.xml bloat? - Solr - [mail # user]
|
|
...So, basically, yes, it is a real problem and there is no designed solution? e.g. optional sub-schema files that can be turned off and on? On Sun, Oct 23, 2011 at 6:38 PM, Erik Hatcher...
|
|
|
Author: Fred Zimmerman,
2011-10-24, 00:23
|
|
|
1) success 2) how to tell Nutch "index everything" - Nutch - [mail # user]
|
|
...1) I resolved the issues with solrindex. It turned out to be a matter of adding all the nutch schema-specific fields to solr's schema.xml. there was one gotcha which is that the latest...
|
|
... solr schema does not have a default fieldtype "text" as in Nutch 1.3/schema.xml; you must use "text_general". A comment for developers is that the use case of copying the nutch schema...
|
[+ show more]
[- hide]
| ... to overwrite the solr one only works for people who are beginning their indexing with a crawl. More detailed instructions on how to modify solr/schema.xml for nutch would be helpful, or better yet... |
|
|
Author: Fred Zimmerman,
2011-10-26, 14:37
|
|
|
fixed schema problems, now running out of memory? - Solr - [mail # user]
|
|
...It's a small indexing job coming from nutch. 2011-10-26 15:07:29,039 WARN mapred.LocalJobRunner - job_local_0011 java.io.IOException: org.apache.solr.client.solrj.SolrServerExcep...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 15:09
|
|
|
limiting searches to particular sources - Solr - [mail # user]
|
|
...I want to be able to list some searches to particular sources, e.g. "wiki only", "crawled only", etc. So I think I need to create a source field in the schema.xml. However, the native...
|
|
|
Author: Fred Zimmerman,
2011-11-02, 14:52
|
|
|
Re: fixed schema problems, now running out of memory? - Solr - [mail # user]
|
|
...http://wiki.apache.org/solr/SolrPerformanceFactors#Schema_Design_Considerations The number of indexed fields greatly increases the following: http://lucene.472066.n3.nabble...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 17:18
|
|
|
Re: fixed schema problems, now running out of memory? - Solr - [mail # user]
|
|
...More on what's happening. It seems to be timing out during the commit. The new documents are small, but the existing index is large (11 million records). INFO: Closing Searcher@4...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 17:01
|
|
|
Re: solrindexer parameters -- input path does not exist: crawl_fetch, parse_data, etc. - Nutch - [mail # user]
|
|
...I added just the field ... I have already modified solr's schema.xml to accommodate some other data types. Now when starting solr ... INFO: SolrUpdateServlet.init() done 2011-10-26 13...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 13:31
|
|
|
where is solr data import handler looking for my file? - Solr - [mail # user]
|
|
...Solr dataimport is reporting file not found when it looks for foo.xml. Where is it looking for /data? is this an url off the apache2/htdocs on the server, or is it an URL within example/solr/...? ...
|
|
|
Author: Fred Zimmerman,
2011-10-19, 20:49
|
|
|
Re: advice, config files for crawling private wikipedia mirror - Nutch - [mail # user]
|
|
...so let me make sure I understand. what this guy did is that he made an XML file from his local backup of wikipedia but he didn't crawl it?maybe I don't need to crawl it, either, since...
|
|
... the XML file can include the "id" field which is where Solr keeps URLs, right? What I want to be able to do is submit a search to Solr, get back an answer set as a file using wt=csv, use a shell...
|
|
|
Author: Fred Zimmerman,
2011-10-10, 14:41
|
|
|
|