net.sf.regain.crawler.preparator
Class DispatcherPreparator
java.lang.Object
net.sf.regain.crawler.document.AbstractPreparator
net.sf.regain.crawler.preparator.DispatcherPreparator
- All Implemented Interfaces:
- Pluggable, Preparator, WriteablePreparator
public class DispatcherPreparator
- extends AbstractPreparator
- Author:
- Florian Schluefter
This Class is a Preparaor for files with wrong content. For instance a *.doc
file witch contains rtf data, or a *.xls file that contains xml data.
This Preparator should identify the real content and pass the file to the
right preparator.
Methods inherited from class net.sf.regain.crawler.document.AbstractPreparator |
accepts, addAdditionalField, cleanUp, close, concatenateStringParts, getAdditionalFields, getCleanedContent, getCleanedMetaData, getHeadlines, getPath, getPriority, getSummary, getTitle, setCleanedContent, setCleanedMetaData, setHeadlines, setPath, setPriority, setSummary, setTitle, setUrlRegex |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
_signature_offset
public static final int _signature_offset
- See Also:
- Constant Field Values
signatures
HashMap signatures
mLog
private static org.apache.log4j.Logger mLog
- The logger for this class
DispatcherPreparator
public DispatcherPreparator()
throws RegainException
- Creates a new instance of DispatcherPreparator.
- Throws:
RegainException
- If creating the preparator failed.
init
public void init(PreparatorConfig config)
throws RegainException
- Liest die Konfiguration des Pr�partors aus.
- Specified by:
init
in interface Pluggable
- Overrides:
init
in class AbstractPreparator
- Parameters:
config
- Präparator-Konfiguration, d.h. der Abschnitt zum
DispatcherPreprarator
- Throws:
RegainException
- Wenn die Pr�paration fehl schlug.
prepare
public void prepare(RawDocument rawDocument)
throws RegainException
- Präpariert ein Dokument für die Indizierung.
- Parameters:
rawDocument
- Das zu pr�pariernde Dokument.
- Throws:
RegainException
- Wenn die Pr�paration fehl schlug.
Regain 2.1.0-STABLE, Copyright (C) 2004-2010 Til Schneider, www.murfman.de, Thomas Tesche, www.clustersystems.info