Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Nutch, mail # dev - Re: [jira] [Commented] (NUTCH-1390) readdb -url $url throws NPE with gora-cassandra


Copy link to this message
-
Re: [jira] [Commented] (NUTCH-1390) readdb -url $url throws NPE with gora-cassandra
Lewis John Mcgibbney 2012-08-22, 15:58
Thanks

This is most probably a bug in Gora's CassandraClient code then.

Thanks for reporting.

On Wed, Aug 22, 2012 at 3:09 PM, lin weijian <[EMAIL PROTECTED]> wrote:
>
> I test this situation with Hbase 0.92.1, but  it works just right, no matter trancearoundtheworld.com or other domain.
>
>
>
> 在 2012-8-22,下午9:59, Lewis John McGibbney (JIRA) 写道:
>
>>
>>    [ https://issues.apache.org/jira/browse/NUTCH-1390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13439539#comment-13439539 ]
>>
>> Lewis John McGibbney commented on NUTCH-1390:
>> ---------------------------------------------
>>
>> Can anyone confirm is this is the case with any other backend other than Cassandra? If needs to I'll do test runs on all backends if I get no feedback. Ta Lewis
>>
>>> readdb -url $url throws NPE with gora-cassandra
>>> -----------------------------------------------
>>>
>>>                Key: NUTCH-1390
>>>                URL: https://issues.apache.org/jira/browse/NUTCH-1390
>>>            Project: Nutch
>>>         Issue Type: Bug
>>>         Components: crawldb
>>>   Affects Versions: nutchgora
>>>           Reporter: Lewis John McGibbney
>>>            Fix For: 2.1
>>>
>>>
>>> After successfully injecting, generating, fetching (without parsing enabled), parsing, updatingdb, then executinga readdb passing a particular -url argument I get a lovely NPE
>>> {code}
>>> lewis@lewis:~/ASF/nutchgora/runtime/local$ ./bin/nutch readdb -url http://www.trancearoundtheworld.com
>>> WebTableReader: java.lang.NullPointerException
>>>      at org.apache.gora.cassandra.store.CassandraClient.getFamilyMap(CassandraClient.java:220)
>>>      at org.apache.gora.cassandra.store.CassandraStore.execute(CassandraStore.java:108)
>>>      at org.apache.nutch.crawl.WebTableReader.read(WebTableReader.java:234)
>>>      at org.apache.nutch.crawl.WebTableReader.run(WebTableReader.java:476)
>>>      at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>>>      at org.apache.nutch.crawl.WebTableReader.main(WebTableReader.java:412)
>>> {code}
>>
>> --
>> This message is automatically generated by JIRA.
>> If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
>> For more information on JIRA, see: http://www.atlassian.com/software/jira
>>
>>
>

--
Lewis