|
Regain 2.1.0-STABLE API | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectnet.sf.regain.crawler.document.AbstractPreparator
net.sf.regain.crawler.preparator.AbstractJacobMsOfficePreparator
net.sf.regain.crawler.preparator.JacobMsExcelPreparator
public class JacobMsExcelPreparator
Präpariert ein Microsoft-Excel-Dokument für die Indizierung mit Hilfe der Jacob-API, wobei Jacobgen genutzt wurde, um den Zugriff zu erleichtern.
Dabei werden die Rohdaten des Dokuments von Formatierungsinformation befreit, es wird der Titel extrahiert.
| Field Summary | |
|---|---|
private de.filiadata.lucene.spider.generated.msoffice2000.excel.Application |
mExcelApplication
Die Excel-Applikation. |
| Fields inherited from interface net.sf.regain.crawler.document.Preparator |
|---|
DEFAULT_BUFFER_SIZE |
| Constructor Summary | |
|---|---|
JacobMsExcelPreparator()
Creates a new instance of JacobMsExcelPreparator. |
|
| Method Summary | |
|---|---|
void |
close()
Frees all resources reserved by the preparator. |
de.filiadata.lucene.spider.generated.msoffice2000.excel.Range |
getCells(de.filiadata.lucene.spider.generated.msoffice2000.excel.Worksheet sheet,
int row,
int col)
Wrapper for calling the ActiveX-Method with input-parameter(s). |
void |
init(PreparatorConfig config)
Initializes the preparator. |
void |
prepare(RawDocument rawDocument)
Präpariert ein Dokument für die Indizierung. |
| Methods inherited from class net.sf.regain.crawler.preparator.AbstractJacobMsOfficePreparator |
|---|
readProperties |
| Methods inherited from class net.sf.regain.crawler.document.AbstractPreparator |
|---|
accepts, addAdditionalField, cleanUp, concatenateStringParts, getAdditionalFields, getCleanedContent, getCleanedMetaData, getHeadlines, getPath, getPriority, getSummary, getTitle, setCleanedContent, setCleanedMetaData, setHeadlines, setPath, setPriority, setSummary, setTitle, setUrlRegex |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
private de.filiadata.lucene.spider.generated.msoffice2000.excel.Application mExcelApplication
null, solange noch kein Dokument
bearbeitet wurde.
| Constructor Detail |
|---|
public JacobMsExcelPreparator()
throws RegainException
RegainException - If creating the preparator failed.| Method Detail |
|---|
public void init(PreparatorConfig config)
throws RegainException
init in interface Pluggableinit in class AbstractJacobMsOfficePreparatorconfig - The configuration
RegainException - If the configuration has an error.
public de.filiadata.lucene.spider.generated.msoffice2000.excel.Range getCells(de.filiadata.lucene.spider.generated.msoffice2000.excel.Worksheet sheet,
int row,
int col)
row - an input-parameter of type intcol - an input-parameter of type int
public void prepare(RawDocument rawDocument)
throws RegainException
rawDocument - Das zu pr�pariernde Dokument.
RegainException - Wenn die Pr�paration fehl schlug.
public void close()
throws RegainException
Is called at the end of the crawler process after all documents were processed.
close in interface Preparatorclose in class AbstractPreparatorRegainException - If freeing the resources failed.
|
Regain 2.1.0-STABLE API | ||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||