Regain 2.1.0-STABLE API

net.sf.regain.crawler.preparator
Class ZipPreparator

java.lang.Object
  extended by net.sf.regain.crawler.document.AbstractPreparator
      extended by net.sf.regain.crawler.preparator.ZipPreparator
All Implemented Interfaces:
Pluggable, Preparator, WriteablePreparator

public class ZipPreparator
extends AbstractPreparator

Prepares archive files (zipped content) for indexing

The following information will be extracted: filename (toLowerCase)

Author:
Thomas Tesche, cluster:Systems CSG GmbH, http://www.clustersystems.info

Field Summary
 
Fields inherited from interface net.sf.regain.crawler.document.Preparator
DEFAULT_BUFFER_SIZE
 
Constructor Summary
ZipPreparator()
          Creates a new instance of ZipPreparator.
 
Method Summary
 void prepare(RawDocument rawDocument)
          Prepares the document for indexing
 
Methods inherited from class net.sf.regain.crawler.document.AbstractPreparator
accepts, addAdditionalField, cleanUp, close, concatenateStringParts, getAdditionalFields, getCleanedContent, getCleanedMetaData, getHeadlines, getPath, getPriority, getSummary, getTitle, init, setCleanedContent, setCleanedMetaData, setHeadlines, setPath, setPriority, setSummary, setTitle, setUrlRegex
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ZipPreparator

public ZipPreparator()
              throws RegainException
Creates a new instance of ZipPreparator.

Throws:
RegainException - If creating the preparator failed.
Method Detail

prepare

public void prepare(RawDocument rawDocument)
             throws RegainException
Prepares the document for indexing

Parameters:
rawDocument - the document
Throws:
RegainException - if preparation goes wrong

Regain 2.1.0-STABLE API

Regain 2.1.0-STABLE, Copyright (C) 2004-2010 Til Schneider, www.murfman.de, Thomas Tesche, www.clustersystems.info