org.apache.lucene.analysis.nl
Class DutchStemFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.nl.DutchStemFilter
- All Implemented Interfaces:
- java.io.Closeable
public final class DutchStemFilter
- extends TokenFilter
A TokenFilter
that stems Dutch words.
It supports a table of words that should
not be stemmed at all. The stemmer used can be changed at runtime after the
filter object is created (as long as it is a DutchStemmer
).
NOTE: This stemmer does not implement the Snowball algorithm correctly,
specifically doubled consonants. It is recommended that you consider using
the "Dutch" stemmer in the snowball package instead. This stemmer will likely
be deprecated in a future release.
Method Summary |
boolean |
incrementToken()
Returns the next token in the stream, or null at EOS |
void |
setExclusionTable(java.util.HashSet exclusiontable)
Set an alternative exclusion list for this filter. |
void |
setStemDictionary(java.util.HashMap dict)
Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair. |
void |
setStemmer(DutchStemmer stemmer)
Set a alternative/custom DutchStemmer for this filter. |
Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString |
Methods inherited from class java.lang.Object |
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
DutchStemFilter
public DutchStemFilter(TokenStream _in)
DutchStemFilter
public DutchStemFilter(TokenStream _in,
java.util.Set exclusiontable)
- Builds a DutchStemFilter that uses an exclusion table.
DutchStemFilter
public DutchStemFilter(TokenStream _in,
java.util.Set exclusiontable,
java.util.Map stemdictionary)
- Parameters:
stemdictionary
- Dictionary of word stem pairs, that overrule the algorithm
incrementToken
public boolean incrementToken()
throws java.io.IOException
- Returns the next token in the stream, or null at EOS
- Specified by:
incrementToken
in class TokenStream
- Returns:
- false for end of stream; true otherwise
- Throws:
java.io.IOException
setStemmer
public void setStemmer(DutchStemmer stemmer)
- Set a alternative/custom
DutchStemmer
for this filter.
setExclusionTable
public void setExclusionTable(java.util.HashSet exclusiontable)
- Set an alternative exclusion list for this filter.
setStemDictionary
public void setStemDictionary(java.util.HashMap dict)
- Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
Copyright © 2000-2011 Apache Software Foundation. All Rights Reserved.