Documentation ¶
Index ¶
Constants ¶
View Source
const ( ADJ = 1 << iota ADP ADV AFFIX CONJ DET NOUN NUM PRON PRT PUNCT UNKN VERB X ANY = ADJ | ADP | ADV | AFFIX | CONJ | DET | NOUN | NUM | PRON | PRT | PUNCT | UNKN | VERB | X )
Part of speech
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Lang ¶
type Lang string
Lang defines the language used to examine the text. Both ISO and BCP-47 language codes are accepted
var AutoLang Lang = "auto"
AutoLang tries to automatically recognize the language
type NLP ¶
type NLP struct {
// contains filtered or unexported fields
}
NLP tokenizes a text using NLP
func (*NLP) TokenizeEntities ¶
TokenizeEntities returns nested tokenized entities
func (*NLP) TokenizeText ¶
TokenizeText tokenizes a text
type PoSDeterm ¶
type PoSDeterm struct {
// contains filtered or unexported fields
}
PoSDeterm represents the default part of speech determinator
func NewPoSDetermer ¶
NewPoSDetermer returns a new default part of speech determinator
type PoSDetermer ¶
PoSDetermer determinates if part of speech tags should be deleted
Click to show internal directories.
Click to hide internal directories.