Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
View Source
var ENGLISH_STOP_WORDS_SET = map[string]bool{ "a": true, "an": true, "and": true, "are": true, "as": true, "at": true, "be": true, "but": true, "by": true, "for": true, "if": true, "in": true, "into": true, "is": true, "it": true, "no": true, "not": true, "of": true, "on": true, "or": true, "such": true, "that": true, "the": true, "their": true, "then": true, "there": true, "these": true, "they": true, "this": true, "to": true, "was": true, "will": true, "with": true, }
An unmodifiable set containing some common English words that are not usually useful for searching.
Functions ¶
This section is empty.
Types ¶
type LowerCaseFilter ¶
type LowerCaseFilter struct { *TokenFilter // contains filtered or unexported fields }
Normalizes token text to lower case.
You may specify the Version compatibility when creating LowerCaseFilter:
- As of 3.1, supplementary characters are properly lowercased.
func NewLowerCaseFilter ¶
func NewLowerCaseFilter(matchVersion util.Version, in TokenStream) *LowerCaseFilter
Create a new LowerCaseFilter, that normalizes token text to lower case.
func (*LowerCaseFilter) IncrementToken ¶
func (f *LowerCaseFilter) IncrementToken() (bool, error)
type StopFilter ¶
type StopFilter struct { *FilteringTokenFilter // contains filtered or unexported fields }
Removes stop words from a token stream.
You may specify the Version compatibility when creating StopFilter:
- As of 3.1, StopFilter correctly handles Unicode 4.0 supplementary characters in stopwords and position increments are preserved
func NewStopFilter ¶
func NewStopFilter(matchVersion util.Version, in TokenStream, stopWords map[string]bool) *StopFilter
Constructs a filter which removes words from the input TokenStream that are named in the Set.
func (*StopFilter) Accept ¶
func (f *StopFilter) Accept() bool
Click to show internal directories.
Click to hide internal directories.