All Classes
-
All Classes Interface Summary Class Summary Class Description DefaultICUTokenizerConfig DefaultICUTokenizerConfig
that is generally applicable to many languages.ICUCollationKeyAnalyzer FiltersKeywordTokenizer
withICUCollationKeyFilter
.ICUCollationKeyFilter Converts each token into itsCollationKey
, and then encodes the CollationKey withIndexableBinaryStringTools
, to allow it to be stored as an index term.ICUFoldingFilter A TokenFilter that applies search term folding to Unicode text, applying foldings from UTR#30 Character Foldings.ICUNormalizer2Filter Normalize token text with ICU'sNormalizer2
ICUTokenizer Breaks text into words according to UAX #29: Unicode Text Segmentation (http://www.unicode.org/reports/tr29/)ICUTokenizerConfig Class that allows for tailored Unicode Text Segmentation on a per-writing system basis.ICUTransformFilter ATokenFilter
that transforms text with ICU.LaoBreakIterator Syllable iterator for Lao text.ScriptAttribute This attribute stores the UTR #24 script value for a token of text.ScriptAttributeImpl Implementation ofScriptAttribute
that stores the script as an integer.