Regain 2.1.0-STABLE API

net.sf.regain
Class RegainToolkit.WrapperAnalyzer

java.lang.Object
  extended by org.apache.lucene.analysis.Analyzer
      extended by net.sf.regain.RegainToolkit.WrapperAnalyzer
All Implemented Interfaces:
Closeable
Enclosing class:
RegainToolkit

private static class RegainToolkit.WrapperAnalyzer
extends org.apache.lucene.analysis.Analyzer

An analyzer that changes a document in lowercase before delivering it to a nested analyzer. For the field "groups" an analyzer is used that only tokenizes the input without stemming the tokens.


Field Summary
private  org.apache.lucene.analysis.Analyzer mNestedAnalyzer
          The nested analyzer.
private  org.apache.lucene.analysis.Analyzer mNoStemmingAnalyzer
          The analyzer to use for a field that shouldn't be stemmed.
private  HashSet<String> mUntokenizedFieldNames
          The names of the fields that should not be tokenized.
 
Constructor Summary
RegainToolkit.WrapperAnalyzer(org.apache.lucene.analysis.Analyzer nestedAnalyzer, String[] untokenizedFieldNames)
          Creates a new instance of WrapperAnalyzer.
 
Method Summary
 org.apache.lucene.analysis.TokenStream tokenStream(String fieldName, Reader reader)
          Creates a TokenStream which tokenizes all the text in the provided Reader.
 
Methods inherited from class org.apache.lucene.analysis.Analyzer
close, getOffsetGap, getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

mNoStemmingAnalyzer

private org.apache.lucene.analysis.Analyzer mNoStemmingAnalyzer
The analyzer to use for a field that shouldn't be stemmed.


mNestedAnalyzer

private org.apache.lucene.analysis.Analyzer mNestedAnalyzer
The nested analyzer.


mUntokenizedFieldNames

private HashSet<String> mUntokenizedFieldNames
The names of the fields that should not be tokenized.

Constructor Detail

RegainToolkit.WrapperAnalyzer

public RegainToolkit.WrapperAnalyzer(org.apache.lucene.analysis.Analyzer nestedAnalyzer,
                                     String[] untokenizedFieldNames)
Creates a new instance of WrapperAnalyzer.

Parameters:
nestedAnalyzer - The nested analyzer.
untokenizedFieldNames - The names of the fields that should not be tokenized.
Method Detail

tokenStream

public org.apache.lucene.analysis.TokenStream tokenStream(String fieldName,
                                                          Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.

Specified by:
tokenStream in class org.apache.lucene.analysis.Analyzer

Regain 2.1.0-STABLE API

Regain 2.1.0-STABLE, Copyright (C) 2004-2010 Til Schneider, www.murfman.de, Thomas Tesche, www.clustersystems.info