org.archive.wayback.hadoop
Class CDXSort.CDXCanonicalizerMapClass

java.lang.Object
  extended by org.apache.hadoop.mapred.MapReduceBase
      extended by org.archive.wayback.hadoop.CDXSort.CDXCanonicalizerMapClass
All Implemented Interfaces:
Closeable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
Enclosing class:
CDXSort

public static class CDXSort.CDXCanonicalizerMapClass
extends org.apache.hadoop.mapred.MapReduceBase
implements org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>

Mapper which reads an identity CDX line, outputting: key - canonicalized original URL + timestamp val - everything else

Version:
$Date: 2010-09-29 05:28:38 +0700 (Wed, 29 Sep 2010) $, $Revision: 3262 $
Author:
brad

Constructor Summary
CDXSort.CDXCanonicalizerMapClass()
           
 
Method Summary
 void map(org.apache.hadoop.io.LongWritable lineNumber, org.apache.hadoop.io.Text line, org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text> output, org.apache.hadoop.mapred.Reporter reporter)
           
 
Methods inherited from class org.apache.hadoop.mapred.MapReduceBase
close, configure
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.mapred.JobConfigurable
configure
 
Methods inherited from interface java.io.Closeable
close
 

Constructor Detail

CDXSort.CDXCanonicalizerMapClass

public CDXSort.CDXCanonicalizerMapClass()
Method Detail

map

public void map(org.apache.hadoop.io.LongWritable lineNumber,
                org.apache.hadoop.io.Text line,
                org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text> output,
                org.apache.hadoop.mapred.Reporter reporter)
         throws IOException
Specified by:
map in interface org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
Throws:
IOException


Copyright © 2005-2011 Internet Archive. All Rights Reserved.