|
Regain 2.1.0-STABLE API | ||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectnet.sf.regain.crawler.document.AbstractPreparator
net.sf.regain.crawler.preparator.AbstractJacobMsOfficePreparator
net.sf.regain.crawler.preparator.JacobMsExcelPreparator
public class JacobMsExcelPreparator
Präpariert ein Microsoft-Excel-Dokument für die Indizierung mit Hilfe der Jacob-API, wobei Jacobgen genutzt wurde, um den Zugriff zu erleichtern.
Dabei werden die Rohdaten des Dokuments von Formatierungsinformation befreit, es wird der Titel extrahiert.
Field Summary | |
---|---|
private de.filiadata.lucene.spider.generated.msoffice2000.excel.Application |
mExcelApplication
Die Excel-Applikation. |
Fields inherited from interface net.sf.regain.crawler.document.Preparator |
---|
DEFAULT_BUFFER_SIZE |
Constructor Summary | |
---|---|
JacobMsExcelPreparator()
Creates a new instance of JacobMsExcelPreparator. |
Method Summary | |
---|---|
void |
close()
Frees all resources reserved by the preparator. |
de.filiadata.lucene.spider.generated.msoffice2000.excel.Range |
getCells(de.filiadata.lucene.spider.generated.msoffice2000.excel.Worksheet sheet,
int row,
int col)
Wrapper for calling the ActiveX-Method with input-parameter(s). |
void |
init(PreparatorConfig config)
Initializes the preparator. |
void |
prepare(RawDocument rawDocument)
Präpariert ein Dokument für die Indizierung. |
Methods inherited from class net.sf.regain.crawler.preparator.AbstractJacobMsOfficePreparator |
---|
readProperties |
Methods inherited from class net.sf.regain.crawler.document.AbstractPreparator |
---|
accepts, addAdditionalField, cleanUp, concatenateStringParts, getAdditionalFields, getCleanedContent, getCleanedMetaData, getHeadlines, getPath, getPriority, getSummary, getTitle, setCleanedContent, setCleanedMetaData, setHeadlines, setPath, setPriority, setSummary, setTitle, setUrlRegex |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
private de.filiadata.lucene.spider.generated.msoffice2000.excel.Application mExcelApplication
null
, solange noch kein Dokument
bearbeitet wurde.
Constructor Detail |
---|
public JacobMsExcelPreparator() throws RegainException
RegainException
- If creating the preparator failed.Method Detail |
---|
public void init(PreparatorConfig config) throws RegainException
init
in interface Pluggable
init
in class AbstractJacobMsOfficePreparator
config
- The configuration
RegainException
- If the configuration has an error.public de.filiadata.lucene.spider.generated.msoffice2000.excel.Range getCells(de.filiadata.lucene.spider.generated.msoffice2000.excel.Worksheet sheet, int row, int col)
row
- an input-parameter of type intcol
- an input-parameter of type int
public void prepare(RawDocument rawDocument) throws RegainException
rawDocument
- Das zu pr�pariernde Dokument.
RegainException
- Wenn die Pr�paration fehl schlug.public void close() throws RegainException
Is called at the end of the crawler process after all documents were processed.
close
in interface Preparator
close
in class AbstractPreparator
RegainException
- If freeing the resources failed.
|
Regain 2.1.0-STABLE API | ||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |