org.archive.access.nutch
Class ImportArcs.WaxParseOutputFormat
java.lang.Object
org.apache.nutch.parse.ParseOutputFormat
org.archive.access.nutch.ImportArcs.WaxParseOutputFormat
- All Implemented Interfaces:
- org.apache.hadoop.mapred.OutputFormat
- Enclosing class:
- ImportArcs
public static class ImportArcs.WaxParseOutputFormat
- extends org.apache.nutch.parse.ParseOutputFormat
Copy so I can add collection prefix to produced signature and link
CrawlDatums.
- Author:
- stack
|
Field Summary |
org.apache.commons.logging.Log |
LOG
|
|
Method Summary |
org.apache.hadoop.mapred.RecordWriter |
getRecordWriter(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.mapred.JobConf job,
java.lang.String name,
org.apache.hadoop.util.Progressable progress)
|
| Methods inherited from class org.apache.nutch.parse.ParseOutputFormat |
checkOutputSpecs |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LOG
public final org.apache.commons.logging.Log LOG
ImportArcs.WaxParseOutputFormat
public ImportArcs.WaxParseOutputFormat()
getRecordWriter
public org.apache.hadoop.mapred.RecordWriter getRecordWriter(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.mapred.JobConf job,
java.lang.String name,
org.apache.hadoop.util.Progressable progress)
throws java.io.IOException
- Specified by:
getRecordWriter in interface org.apache.hadoop.mapred.OutputFormat- Overrides:
getRecordWriter in class org.apache.nutch.parse.ParseOutputFormat
- Throws:
java.io.IOException
Copyright © 2005-2007 Internet Archive. All Rights Reserved.