org.archive.access.nutch
Class ImportArcs.WaxFetcherOutputFormat

java.lang.Object
  extended by org.apache.nutch.fetcher.FetcherOutputFormat
      extended by org.archive.access.nutch.ImportArcs.WaxFetcherOutputFormat
All Implemented Interfaces:
org.apache.hadoop.mapred.OutputFormat
Enclosing class:
ImportArcs

public static class ImportArcs.WaxFetcherOutputFormat
extends org.apache.nutch.fetcher.FetcherOutputFormat

Override of nutch FetcherOutputFormat so I can substitute my own ParseOutputFormat, ImportArcs.WaxParseOutputFormat. While I'm here, removed content references. NutchWAX doesn't save content.

Author:
stack

Constructor Summary
ImportArcs.WaxFetcherOutputFormat()
           
 
Method Summary
 org.apache.hadoop.mapred.RecordWriter getRecordWriter(org.apache.hadoop.fs.FileSystem fs, org.apache.hadoop.mapred.JobConf job, java.lang.String name, org.apache.hadoop.util.Progressable progress)
           
 
Methods inherited from class org.apache.nutch.fetcher.FetcherOutputFormat
checkOutputSpecs
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ImportArcs.WaxFetcherOutputFormat

public ImportArcs.WaxFetcherOutputFormat()
Method Detail

getRecordWriter

public org.apache.hadoop.mapred.RecordWriter getRecordWriter(org.apache.hadoop.fs.FileSystem fs,
                                                             org.apache.hadoop.mapred.JobConf job,
                                                             java.lang.String name,
                                                             org.apache.hadoop.util.Progressable progress)
                                                      throws java.io.IOException
Specified by:
getRecordWriter in interface org.apache.hadoop.mapred.OutputFormat
Overrides:
getRecordWriter in class org.apache.nutch.fetcher.FetcherOutputFormat
Throws:
java.io.IOException


Copyright © 2005-2007 Internet Archive. All Rights Reserved.