提问者:小点点

Apache Nutch 2.3:抛出错误失败,退出值为255


我正在使用阿帕奇坚果 2.3 版本。我的Hadoop版本是2.6.0.Hadoop在单节点上运行。

当我按照nitch的命令运行时

./crawl --index ~/test/seed ~/test -1

上述命令的输出如下。

InjectorJob: starting at 2016-01-04 12:03:26
InjectorJob: Injecting urlDir: --index
InjectorJob: Using class org.apache.gora.memory.store.MemStore as the    
Gora storage class.
InjectorJob:    
org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input
path does not exist: file:/usr/local/nutch/runtime/local/bin/--index
at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus
(FileInputFormat.java:235)
at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits
(FileInputFormat.java:252)
at org.apache.hadoop.mapred.JobClient.writeNewSplits
(JobClient.java:1054)
at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:1071)
at org.apache.hadoop.mapred.JobClient.access$700(JobClient.java:179)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:983)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:936)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:422)
at org.apache.hadoop.security.UserGroupInformation.doAs
(UserGroupInformation.java:1190
at org.apache.hadoop.mapred.JobClient.submitJobInternal
(JobClient.java:936)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:550)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:580)
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:50)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:231)
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:252)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:275)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:284)

Error running:
/usr/local/nutch/runtime/local/bin/nutch inject --index -crawlId    
/home/jalaj/test/seed
Failed with exit value 255.

坚果有什么问题?我需要安装Apache Gora吗?


共1个答案

匿名用户

问题就在这里:org.apache.hadoop.mapreduce.lib.input.InvalidInputException:输入路径不存在:file:/usr/local/nutch/runtime/local/bin/-index

Nutch尝试读取种子文件,但无法读取。请确保你的命令是正确的。

希望这能有所帮助,

Le Quoc Do公司