org.archive.wayback.hadoop
Class CDXCanonicalizingMapper

java.lang.Object
  extended by org.apache.hadoop.mapreduce.Mapper<Object,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
      extended by org.archive.wayback.hadoop.CDXCanonicalizingMapper
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable

public class CDXCanonicalizingMapper
extends org.apache.hadoop.mapreduce.Mapper<Object,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
implements org.apache.hadoop.conf.Configurable

Author:
brad

Nested Class Summary
 
Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper
org.apache.hadoop.mapreduce.Mapper.Context
 
Field Summary
static int MODE_FULL
           
static int MODE_GLOBAL
           
 
Constructor Summary
CDXCanonicalizingMapper()
           
 
Method Summary
 org.apache.hadoop.conf.Configuration getConf()
           
 void map(Object y, org.apache.hadoop.io.Text value, org.apache.hadoop.mapreduce.Mapper.Context context)
           
 void setConf(org.apache.hadoop.conf.Configuration conf)
           
static void setMapMode(org.apache.hadoop.conf.Configuration conf, int mode)
           
 
Methods inherited from class org.apache.hadoop.mapreduce.Mapper
cleanup, run, setup
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

MODE_GLOBAL

public static int MODE_GLOBAL

MODE_FULL

public static int MODE_FULL
Constructor Detail

CDXCanonicalizingMapper

public CDXCanonicalizingMapper()
Method Detail

map

public void map(Object y,
                org.apache.hadoop.io.Text value,
                org.apache.hadoop.mapreduce.Mapper.Context context)
         throws IOException,
                InterruptedException
Overrides:
map in class org.apache.hadoop.mapreduce.Mapper<Object,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>
Throws:
IOException
InterruptedException

setMapMode

public static void setMapMode(org.apache.hadoop.conf.Configuration conf,
                              int mode)
Parameters:
conf - Configuration for the Job
mode - String mode to use, one of MODE_GLOBAL, MODE_FULL

getConf

public org.apache.hadoop.conf.Configuration getConf()
Specified by:
getConf in interface org.apache.hadoop.conf.Configurable

setConf

public void setConf(org.apache.hadoop.conf.Configuration conf)
Specified by:
setConf in interface org.apache.hadoop.conf.Configurable


Copyright © 2005-2011 Internet Archive. All Rights Reserved.