Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Threaded View
Mahout, mail # user - ERROR: Split: Couldn't determine class of the input values


Copy link to this message
-
ERROR: Split: Couldn't determine class of the input values
pricila rr 2012-07-27, 13:27
I have one file .txt of 1 gb and I am trying to split using the SPLIT
mahout command
When I run occurs the error: Couldn't determine class of the input values
How to solve this? Is there another way to split a file into hadoop or
mahout?

Thank you

Pricila

-------------------------
hduser@master:~/hadoop$ $MAHOUT_HOME/bin/mahout split --input atividade
--testOutput atividade/teste --randomSelectionPct 20 --overwrite
--mapRedOutputDir atividade/outputDir
Warning: $HADOOP_HOME is deprecated.

Running on hadoop, using /home/hduser/hadoop/bin/hadoop and HADOOP_CONF_DIRMAHOUT-JOB:
/home/hduser/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar
Warning: $HADOOP_HOME is deprecated.

12/07/27 10:21:27 WARN driver.MahoutDriver: No split.props found on
classpath, will use command-line arguments only
12/07/27 10:21:28 INFO common.AbstractJob: Command line arguments:
{--endPhase=[2147483647], --input=[atividade],
--mapRedOutputDir=[atividade/outputDir], --method=[mapreduce],
--overwrite=null, --randomSelectionPct=[20], --startPhase=[0],
--tempDir=[temp], --testOutput=[atividade/teste]}
Exception in thread "main" java.lang.IllegalStateException: Couldn't
determine class of the input values
at org.apache.mahout.utils.SplitInputJob.run(SplitInputJob.java:104)
at org.apache.mahout.utils.SplitInput.splitDirectory(SplitInput.java:286)
at org.apache.mahout.utils.SplitInput.splitDirectory(SplitInput.java:265)
at org.apache.mahout.utils.SplitInput.run(SplitInput.java:135)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.mahout.utils.SplitInput.main(SplitInput.java:141)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)