Regain 2.1.0-STABLE API

net.sf.regain.crawler.preparator
Class SwingRtfPreparator

java.lang.Object
  extended by net.sf.regain.crawler.document.AbstractPreparator
      extended by net.sf.regain.crawler.preparator.SwingRtfPreparator
All Implemented Interfaces:
Pluggable, Preparator, WriteablePreparator

public class SwingRtfPreparator
extends AbstractPreparator

Präpariert ein RTF-Dokument für die Indizierung. Dazu wird der RTF-Parser von Swing genutzt.

Dabei werden die Rohdaten des Dokuments von Formatierungsinformation befreit.

Author:
Til Schneider, www.murfman.de

Field Summary
private  RTFEditorKit mRTFEditorKit
          Das RTFEditorKit, das zum laden von RTF-Dokumenten verwendet wird.
 
Fields inherited from interface net.sf.regain.crawler.document.Preparator
DEFAULT_BUFFER_SIZE
 
Constructor Summary
SwingRtfPreparator()
          Creates a new instance of SwingRtfPreparator.
 
Method Summary
 void close()
          Frees all resources reserved by the preparator.
 void prepare(RawDocument rawDocument)
          Präpariert ein Dokument für die Indizierung.
 
Methods inherited from class net.sf.regain.crawler.document.AbstractPreparator
accepts, addAdditionalField, cleanUp, concatenateStringParts, getAdditionalFields, getCleanedContent, getCleanedMetaData, getHeadlines, getPath, getPriority, getSummary, getTitle, init, setCleanedContent, setCleanedMetaData, setHeadlines, setPath, setPriority, setSummary, setTitle, setUrlRegex
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

mRTFEditorKit

private RTFEditorKit mRTFEditorKit
Das RTFEditorKit, das zum laden von RTF-Dokumenten verwendet wird.

Constructor Detail

SwingRtfPreparator

public SwingRtfPreparator()
                   throws RegainException
Creates a new instance of SwingRtfPreparator.

Throws:
RegainException - If creating the preparator failed.
Method Detail

prepare

public void prepare(RawDocument rawDocument)
             throws RegainException
Präpariert ein Dokument für die Indizierung.

Parameters:
rawDocument - Das zu pr�pariernde Dokument.
Throws:
RegainException - Wenn die Pr�paration fehl schlug.

close

public void close()
           throws RegainException
Frees all resources reserved by the preparator.

Is called at the end of the crawler process after all documents were processed.

Specified by:
close in interface Preparator
Overrides:
close in class AbstractPreparator
Throws:
RegainException - If freeing the resources failed.

Regain 2.1.0-STABLE API

Regain 2.1.0-STABLE, Copyright (C) 2004-2010 Til Schneider, www.murfman.de, Thomas Tesche, www.clustersystems.info