org.archive.wayback.hadoop
Class CDXSort.CDXMapClass
java.lang.Object
org.apache.hadoop.mapred.MapReduceBase
org.archive.wayback.hadoop.CDXSort.CDXMapClass
- All Implemented Interfaces:
- Closeable, org.apache.hadoop.mapred.JobConfigurable, org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
- Enclosing class:
- CDXSort
public static class CDXSort.CDXMapClass
- extends org.apache.hadoop.mapred.MapReduceBase
- implements org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
Mapper which reads a canonicalized CDX line, splitting into: key - URL +
timestamp val - everything else
- Version:
- $Date: 2010-09-29 05:28:38 +0700 (Wed, 29 Sep 2010) $, $Revision: 3262 $
- Author:
- brad
|
Method Summary |
void |
map(org.apache.hadoop.io.LongWritable lineNumber,
org.apache.hadoop.io.Text line,
org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text> output,
org.apache.hadoop.mapred.Reporter reporter)
|
| Methods inherited from class org.apache.hadoop.mapred.MapReduceBase |
close, configure |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface org.apache.hadoop.mapred.JobConfigurable |
configure |
CDXSort.CDXMapClass
public CDXSort.CDXMapClass()
map
public void map(org.apache.hadoop.io.LongWritable lineNumber,
org.apache.hadoop.io.Text line,
org.apache.hadoop.mapred.OutputCollector<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text> output,
org.apache.hadoop.mapred.Reporter reporter)
throws IOException
- Specified by:
map in interface org.apache.hadoop.mapred.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
- Throws:
IOException
Copyright © 2005-2011 Internet Archive. All Rights Reserved.