public class ICUNormalizer2FilterFactory extends BaseTokenFilterFactory implements MultiTermAwareComponent
ICUNormalizer2Filter
Supports the following attributes:
UnicodeSet
pattern. Codepoints outside the set are
always left unchanged. Default is [] (the null set, no filtering).
ICUNormalizer2Filter
,
Normalizer2
,
FilteredNormalizer2
Modifier and Type | Field and Description |
---|---|
private com.ibm.icu.text.Normalizer2 |
normalizer |
log
args, luceneMatchVersion
Constructor and Description |
---|
ICUNormalizer2FilterFactory() |
Modifier and Type | Method and Description |
---|---|
org.apache.lucene.analysis.TokenStream |
create(org.apache.lucene.analysis.TokenStream input)
Transform the specified input TokenStream
|
java.lang.Object |
getMultiTermComponent()
Returns an analysis component to handle analysis if multi-term queries.
|
void |
init(java.util.Map<java.lang.String,java.lang.String> args)
init will be called just once, immediately after creation. |
assureMatchVersion, getArgs, getBoolean, getBoolean, getInt, getInt, getInt, getSnowballWordSet, getWordSet, warnDeprecated
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getArgs
public void init(java.util.Map<java.lang.String,java.lang.String> args)
TokenFilterFactory
init
will be called just once, immediately after creation.
The args are user-level initialization parameters that may be specified when declaring the factory in the schema.xml
init
in interface TokenFilterFactory
init
in class BaseTokenStreamFactory
public org.apache.lucene.analysis.TokenStream create(org.apache.lucene.analysis.TokenStream input)
TokenFilterFactory
create
in interface TokenFilterFactory
public java.lang.Object getMultiTermComponent()
MultiTermAwareComponent
getMultiTermComponent
in interface MultiTermAwareComponent