7#ifndef WHITESPACETOKENIZER_H
8#define WHITESPACETOKENIZER_H
#define LUCENE_CLASS(Name)
Definition: LuceneObject.h:24
An abstract base class for simple, character-oriented tokenizers.
Definition: CharTokenizer.h:15
A WhitespaceTokenizer is a tokenizer that divides text at whitespace. Adjacent sequences of non-White...
Definition: WhitespaceTokenizer.h:16
WhitespaceTokenizer(const AttributeSourcePtr &source, const ReaderPtr &input)
Construct a new WhitespaceTokenizer using a given AttributeSource.
WhitespaceTokenizer(const ReaderPtr &input)
Construct a new WhitespaceTokenizer.
virtual ~WhitespaceTokenizer()
virtual bool isTokenChar(wchar_t c)
Collects only characters which do not satisfy Character#isWhitespace(char).
WhitespaceTokenizer(const AttributeFactoryPtr &factory, const ReaderPtr &input)
Construct a new WhitespaceTokenizer using a given AttributeSource.AttributeFactory.
Definition: AbstractAllTermDocs.h:12
boost::shared_ptr< AttributeSource > AttributeSourcePtr
Definition: LuceneTypes.h:520
boost::shared_ptr< Reader > ReaderPtr
Definition: LuceneTypes.h:547
boost::shared_ptr< AttributeFactory > AttributeFactoryPtr
Definition: LuceneTypes.h:519