org.archive.wayback.resourceindex
Class NutchResourceIndex

java.lang.Object
  extended by org.archive.wayback.resourceindex.NutchResourceIndex
All Implemented Interfaces:
ResourceIndex

public class NutchResourceIndex
extends java.lang.Object
implements ResourceIndex

Version:
$Date: 2008-08-12 18:07:20 -0700 (Tue, 12 Aug 2008) $, $Revision: 2542 $
Author:
brad

Constructor Summary
NutchResourceIndex()
           
 
Method Summary
protected  org.w3c.dom.Document getHttpDocument(java.lang.String url)
           
 int getMaxRecords()
           
protected  java.lang.String getNodeContent(org.w3c.dom.Element e, java.lang.String key)
           
protected  java.lang.String getNodeNutchContent(org.w3c.dom.Element e, java.lang.String key)
           
protected  java.lang.String getRequestUrl(WaybackRequest wbRequest)
           
protected  org.w3c.dom.NodeList getSearchChannel(org.w3c.dom.Document d)
           
protected  org.w3c.dom.NodeList getSearchItems(org.w3c.dom.Document d)
           
 java.lang.String getSearchUrlBase()
           
 void init()
           
 SearchResults query(WaybackRequest wbRequest)
          Transform a WMRequest into a ResourceResults.
 void setMaxRecords(int maxRecords)
           
 void setSearchUrlBase(java.lang.String searchUrlBase)
           
 void shutdown()
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

NutchResourceIndex

public NutchResourceIndex()
Method Detail

init

public void init()
          throws ConfigurationException
Throws:
ConfigurationException

query

public SearchResults query(WaybackRequest wbRequest)
                    throws ResourceIndexNotAvailableException,
                           ResourceNotInArchiveException,
                           BadQueryException,
                           AccessControlException
Description copied from interface: ResourceIndex
Transform a WMRequest into a ResourceResults.

Specified by:
query in interface ResourceIndex
Returns:
ResourceResults containing ResourceResult objects matching the WMRequest
Throws:
ResourceIndexNotAvailableException
ResourceNotInArchiveException
BadQueryException
AccessControlException

getSearchChannel

protected org.w3c.dom.NodeList getSearchChannel(org.w3c.dom.Document d)

getSearchItems

protected org.w3c.dom.NodeList getSearchItems(org.w3c.dom.Document d)

getRequestUrl

protected java.lang.String getRequestUrl(WaybackRequest wbRequest)
                                  throws BadQueryException
Throws:
BadQueryException

getNodeNutchContent

protected java.lang.String getNodeNutchContent(org.w3c.dom.Element e,
                                               java.lang.String key)

getNodeContent

protected java.lang.String getNodeContent(org.w3c.dom.Element e,
                                          java.lang.String key)

getHttpDocument

protected org.w3c.dom.Document getHttpDocument(java.lang.String url)
                                        throws java.io.IOException,
                                               org.xml.sax.SAXException
Throws:
java.io.IOException
org.xml.sax.SAXException

getSearchUrlBase

public java.lang.String getSearchUrlBase()
Returns:
the searchUrlBase

setSearchUrlBase

public void setSearchUrlBase(java.lang.String searchUrlBase)
Parameters:
searchUrlBase - the searchUrlBase to set

getMaxRecords

public int getMaxRecords()
Returns:
the maxRecords

setMaxRecords

public void setMaxRecords(int maxRecords)
Parameters:
maxRecords - the maxRecords to set

shutdown

public void shutdown()
              throws java.io.IOException
Specified by:
shutdown in interface ResourceIndex
Throws:
java.io.IOException


Copyright © 2005-2009 Internet Archive. All Rights Reserved.