| clear query|facets|time |
Search criteria: schema.html.
Results from 1 to 10 from
10 (0.629s).
|
|
|
Loading phrases to help you refine your search...
|
|
schema.xml bloat? - Solr - [mail # user]
|
|
...Hi, it seems from my limited experience thus far that as new data types are added, schema.xml will tend to become bloated with many different field and fieldtype definitions...
|
|
|
Author: Fred Zimmerman,
2011-10-23, 23:34
|
|
|
Re: schema.xml bloat? - Solr - [mail # user]
|
|
...So, basically, yes, it is a real problem and there is no designed solution? e.g. optional sub-schema files that can be turned off and on? On Sun, Oct 23, 2011 at 6:38 PM, Erik Hatcher...
|
|
|
Author: Fred Zimmerman,
2011-10-24, 00:23
|
|
|
fixed schema problems, now running out of memory? - Solr - [mail # user]
|
|
...It's a small indexing job coming from nutch. 2011-10-26 15:07:29,039 WARN mapred.LocalJobRunner - job_local_0011 java.io.IOException: org.apache.solr.client.solrj.SolrServerExcep...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 15:09
|
|
|
Re: fixed schema problems, now running out of memory? - Solr - [mail # user]
|
|
...http://wiki.apache.org/solr/SolrPerformanceFactors#Schema_Design_Considerations The number of indexed fields greatly increases the following: http://lucene.472066.n3.nabble...
|
|
....com/What-is-omitNorms-td2987547.html 1. length normalization will not work on the specific field-- Gak! 11,000,000 * 1 * 31 = 31 x 10M = 310MB RAM all by itself. On Wed, Oct 26, 2011 at 1:01 PM, Fred Zimmerman wrote: ...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 17:18
|
|
|
Re: fixed schema problems, now running out of memory? - Solr - [mail # user]
|
|
...More on what's happening. It seems to be timing out during the commit. The new documents are small, but the existing index is large (11 million records). INFO: Closing Searcher@4...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 17:01
|
|
|
1) success 2) how to tell Nutch "index everything" - Nutch - [mail # user]
|
|
...1) I resolved the issues with solrindex. It turned out to be a matter of adding all the nutch schema-specific fields to solr's schema.xml. there was one gotcha which is that the latest...
|
|
... solr schema does not have a default fieldtype "text" as in Nutch 1.3/schema.xml; you must use "text_general". A comment for developers is that the use case of copying the nutch schema...
|
[+ show more]
[- hide]
| ... to overwrite the solr one only works for people who are beginning their indexing with a crawl. More detailed instructions on how to modify solr/schema.xml for nutch would be helpful, or better yet... |
|
|
Author: Fred Zimmerman,
2011-10-26, 14:37
|
|
|
limiting searches to particular sources - Solr - [mail # user]
|
|
...I want to be able to list some searches to particular sources, e.g. "wiki only", "crawled only", etc. So I think I need to create a source field in the schema.xml. However, the native...
|
|
|
Author: Fred Zimmerman,
2011-11-02, 14:52
|
|
|
strategy for post-processing answer set - Solr - [mail # user]
|
|
... I would like to take the HTML documents that are the result of a Solr search and combine them into a single HTML document that combines the body text of each individual document. What...
|
|
|
Author: Fred Zimmerman,
2011-09-22, 23:44
|
|
|
Re: solrindexer parameters -- input path does not exist: crawl_fetch, parse_data, etc. - Nutch - [mail # user]
|
|
...I added just the field ... I have already modified solr's schema.xml to accommodate some other data types. Now when starting solr ... INFO: SolrUpdateServlet.init() done 2011-10-26 13...
|
|
|
Author: Fred Zimmerman,
2011-10-26, 13:31
|
|
|
Re: strategy for post-processing answer set - Solr - [mail # user]
|
|
... to me as if Solr just brings back the URLs. what I want to do is to get the actual documents in the answer set, simplify their HTML and remove all the javascript, ads, etc., and append them...
|
|
|
Author: Fred Zimmerman,
2011-09-24, 15:22
|
|
|
|