public class SmartChineseWordTokenFilterFactory extends BaseTokenFilterFactory
WordTokenFilter
Note: this class will currently emit tokens for punctuation. So you should either add
a WordDelimiterFilter after to remove these (with concatenate off), or use the
SmartChinese stoplist with a StopFilterFactory via:
words="org/apache/lucene/analysis/cn/smart/stopwords.txt"
log
args, luceneMatchVersion
Constructor and Description |
---|
SmartChineseWordTokenFilterFactory() |
Modifier and Type | Method and Description |
---|---|
org.apache.lucene.analysis.TokenFilter |
create(org.apache.lucene.analysis.TokenStream input)
Transform the specified input TokenStream
|
assureMatchVersion, getArgs, getBoolean, getBoolean, getInt, getInt, getInt, getSnowballWordSet, getWordSet, init, warnDeprecated
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getArgs, init
public org.apache.lucene.analysis.TokenFilter create(org.apache.lucene.analysis.TokenStream input)
TokenFilterFactory