org.archive.wayback.resourceindex.indexer.hadoop
Class Driver

java.lang.Object
  extended by org.archive.wayback.resourceindex.indexer.hadoop.Driver

public class Driver
extends java.lang.Object

Hadoop Driver for generation of alphabetically partitioned Wayback CDX files using the Hadoop framework.

Version:
$Date$, $Revision$
Author:
brad

Nested Class Summary
static class Driver.MapClass
          Mapper which converts an ARCRecord into a CDX line.
 
Constructor Summary
Driver()
           
 
Method Summary
static void main(java.lang.String[] args)
          The main driver for sort program.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

Driver

public Driver()
Method Detail

main

public static void main(java.lang.String[] args)
                 throws java.io.IOException
The main driver for sort program. Invoke this method to submit the map/reduce job.

Parameters:
args -
Throws:
java.io.IOException - When there is communication problems with the job tracker.


Copyright © 2005-2009 Internet Archive. All Rights Reserved.