public class WordDelimiterFilterFactory extends BaseTokenFilterFactory implements ResourceLoaderAware
<fieldType name="text_wd" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.WordDelimiterFilterFactory" protected="protectedword.txt" preserveOriginal="0" splitOnNumerics="1" splitOnCaseChange="1" catenateWords="0" catenateNumbers="0" catenateAll="0" generateWordParts="1" generateNumberParts="1" stemEnglishPossessive="1" types="wdfftypes.txt" /> </analyzer> </fieldType>
Modifier and Type | Field and Description |
---|---|
private int |
flags |
(package private) char[] |
out |
static java.lang.String |
PROTECTED_TOKENS |
private org.apache.lucene.analysis.CharArraySet |
protectedWords |
private static java.util.regex.Pattern |
typePattern |
static java.lang.String |
TYPES |
(package private) byte[] |
typeTable |
log
args, luceneMatchVersion
Constructor and Description |
---|
WordDelimiterFilterFactory() |
Modifier and Type | Method and Description |
---|---|
WordDelimiterFilter |
create(org.apache.lucene.analysis.TokenStream input)
Transform the specified input TokenStream
|
void |
inform(ResourceLoader loader) |
void |
init(java.util.Map<java.lang.String,java.lang.String> args)
init will be called just once, immediately after creation. |
private java.lang.String |
parseString(java.lang.String s) |
private java.lang.Byte |
parseType(java.lang.String s) |
private byte[] |
parseTypes(java.util.List<java.lang.String> rules) |
assureMatchVersion, getArgs, getBoolean, getBoolean, getInt, getInt, getInt, getSnowballWordSet, getWordSet, warnDeprecated
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
getArgs
public static final java.lang.String PROTECTED_TOKENS
public static final java.lang.String TYPES
private org.apache.lucene.analysis.CharArraySet protectedWords
private int flags
byte[] typeTable
private static java.util.regex.Pattern typePattern
char[] out
public void inform(ResourceLoader loader)
inform
in interface ResourceLoaderAware
public void init(java.util.Map<java.lang.String,java.lang.String> args)
TokenFilterFactory
init
will be called just once, immediately after creation.
The args are user-level initialization parameters that may be specified when declaring the factory in the schema.xml
init
in interface TokenFilterFactory
init
in class BaseTokenStreamFactory
public WordDelimiterFilter create(org.apache.lucene.analysis.TokenStream input)
TokenFilterFactory
create
in interface TokenFilterFactory
private byte[] parseTypes(java.util.List<java.lang.String> rules)
private java.lang.Byte parseType(java.lang.String s)
private java.lang.String parseString(java.lang.String s)