|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.lucene.analysis.WordlistLoader
public class WordlistLoader
Loader for text files that represent a list of stopwords.
Constructor Summary | |
---|---|
WordlistLoader()
|
Method Summary | |
---|---|
static java.util.HashMap<java.lang.String,java.lang.String> |
getStemDict(java.io.File wordstemfile)
Reads a stem dictionary. |
static java.util.HashSet<java.lang.String> |
getWordSet(java.io.File wordfile)
Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). |
static java.util.HashSet<java.lang.String> |
getWordSet(java.io.File wordfile,
java.lang.String comment)
Loads a text file and adds every non-comment line as an entry to a HashSet (omitting leading and trailing whitespace). |
static java.util.HashSet<java.lang.String> |
getWordSet(java.io.Reader reader)
Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). |
static java.util.HashSet<java.lang.String> |
getWordSet(java.io.Reader reader,
java.lang.String comment)
Reads lines from a Reader and adds every non-comment line as an entry to a HashSet (omitting leading and trailing whitespace). |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public WordlistLoader()
Method Detail |
---|
public static java.util.HashSet<java.lang.String> getWordSet(java.io.File wordfile) throws java.io.IOException
wordfile
- File containing the wordlist
java.io.IOException
public static java.util.HashSet<java.lang.String> getWordSet(java.io.File wordfile, java.lang.String comment) throws java.io.IOException
wordfile
- File containing the wordlistcomment
- The comment string to ignore
java.io.IOException
public static java.util.HashSet<java.lang.String> getWordSet(java.io.Reader reader) throws java.io.IOException
reader
- Reader containing the wordlist
java.io.IOException
public static java.util.HashSet<java.lang.String> getWordSet(java.io.Reader reader, java.lang.String comment) throws java.io.IOException
reader
- Reader containing the wordlistcomment
- The string representing a comment.
java.io.IOException
public static java.util.HashMap<java.lang.String,java.lang.String> getStemDict(java.io.File wordstemfile) throws java.io.IOException
word\tstem(i.e. two tab seperated words)
java.io.IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |