org.archive.access.nutch
Class ImportArcs.WaxParseOutputFormat

java.lang.Object
  extended by org.apache.nutch.parse.ParseOutputFormat
      extended by org.archive.access.nutch.ImportArcs.WaxParseOutputFormat
All Implemented Interfaces:
org.apache.hadoop.mapred.OutputFormat
Enclosing class:
ImportArcs

public static class ImportArcs.WaxParseOutputFormat
extends org.apache.nutch.parse.ParseOutputFormat

Copy so I can add collection prefix to produced signature and link CrawlDatums.

Author:
stack

Field Summary
 org.apache.commons.logging.Log LOG
           
 
Constructor Summary
ImportArcs.WaxParseOutputFormat()
           
 
Method Summary
 org.apache.hadoop.mapred.RecordWriter getRecordWriter(org.apache.hadoop.fs.FileSystem fs, org.apache.hadoop.mapred.JobConf job, java.lang.String name, org.apache.hadoop.util.Progressable progress)
           
 
Methods inherited from class org.apache.nutch.parse.ParseOutputFormat
checkOutputSpecs
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public final org.apache.commons.logging.Log LOG
Constructor Detail

ImportArcs.WaxParseOutputFormat

public ImportArcs.WaxParseOutputFormat()
Method Detail

getRecordWriter

public org.apache.hadoop.mapred.RecordWriter getRecordWriter(org.apache.hadoop.fs.FileSystem fs,
                                                             org.apache.hadoop.mapred.JobConf job,
                                                             java.lang.String name,
                                                             org.apache.hadoop.util.Progressable progress)
                                                      throws java.io.IOException
Specified by:
getRecordWriter in interface org.apache.hadoop.mapred.OutputFormat
Overrides:
getRecordWriter in class org.apache.nutch.parse.ParseOutputFormat
Throws:
java.io.IOException


Copyright © 2005-2007 Internet Archive. All Rights Reserved.