Documentation ¶
Index ¶
Constants ¶
View Source
const ( ADJ = 1 << iota ADP ADV AFFIX CONJ DET NOUN NUM PRON PRT PUNCT UNKN VERB X ANY = ADJ | ADP | ADV | AFFIX | CONJ | DET | NOUN | NUM | PRON | PRT | PUNCT | UNKN | VERB | X )
Part of speech
View Source
const (
// Whitespace is the default separator
Whitespace = " "
)
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Join ¶
type Join struct {
// contains filtered or unexported fields
}
Join is the default joiner
type Joiner ¶
type Joiner interface {
Join(PoSDetermer, Tokenizer) ([]string, error)
}
Joiner joines a tokenizer taking the part of speech determinator into account
type NLP ¶
type NLP struct {
// contains filtered or unexported fields
}
NLP tokenizes a text using NLP
func (*NLP) TokenizeEntities ¶
TokenizeEntities returns nested tokenized entities
func (*NLP) TokenizeText ¶
TokenizeText tokenizes a text
type PoSDeterm ¶
type PoSDeterm struct {
// contains filtered or unexported fields
}
PoSDeterm represents the default part of speech determinator
func NewPoSDetermer ¶
NewPoSDetermer returns a new default part of speech determinator
type PoSDetermer ¶
PoSDetermer determinates if part of speech tags should be deleted
Click to show internal directories.
Click to hide internal directories.