Documentation ¶
Index ¶
- type BibViewConf
- type FilterConf
- type NgramConf
- type SelfJoinConf
- type VTEConf
- func (c *VTEConf) GetAtomParentStructure() string
- func (c *VTEConf) GetAtomStructure() string
- func (c *VTEConf) GetCorpus() string
- func (c *VTEConf) GetDbConfSettings() []string
- func (c *VTEConf) GetFilterFn() string
- func (c *VTEConf) GetFilterLib() string
- func (c *VTEConf) GetMaxNumErrors() int
- func (c *VTEConf) GetNgrams() *NgramConf
- func (c *VTEConf) GetStackStructEval() bool
- func (c *VTEConf) GetStructures() map[string][]string
- func (c *VTEConf) HasConfiguredBib() bool
- func (c *VTEConf) HasConfiguredFilter() bool
- func (c *VTEConf) UsesSelfJoin() bool
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type BibViewConf ¶
BibViewConf is a sub-configuration for bibliographic data.
type FilterConf ¶
FilterConf specifies a plug-in containing a compatible filter (see LineFilter interface).
type NgramConf ¶
type NgramConf struct { AttrColumns []int `json:"attrColumns"` ColumnMods []string `json:"columnMods"` NgramSize int `json:"ngramSize"` UniqKeyColumns []int `json:"uniqKeyColumns"` CalcARF bool `json:"calcARF"` }
NgramConf configures positional attributes (referred by their column position) we want to store and count as n-grams. This can be used to extract all the unique PoS tags or frequency information about words/lemmas.
type SelfJoinConf ¶
type SelfJoinConf struct { ArgColumns []string `json:"argColumns"` GeneratorFn string `json:"generatorFn"` }
SelfJoinConf contains information about aligned structural attributes (e.g. sentences from two languages).
type VTEConf ¶
type VTEConf struct { Corpus string `json:"corpus"` AtomStructure string `json:"atomStructure"` AtomParentStructure string `json:"atomParentStructure"` StackStructEval bool `json:"stackStructEval"` // MaxNumErrors if reached then the process stops MaxNumErrors int `json:"maxNumErrors"` Structures map[string][]string `json:"structures"` // Ngrams - see NgramConf // If omitted then the function is disabled. Ngrams NgramConf `json:"ngrams"` // VerticalFile can be either a path to a single file // or a path to a directory containing multiple vertical // files (then we assume all the vertical files are of the // same structure) VerticalFile string `json:"verticalFile"` // VerticalFiles is an alternative to VerticalFile allowing // explicit selection of one or more files to be processed // as one. VerticalFiles []string `json:"verticalFiles"` DBFile string `json:"dbFile"` DBConfSettings []string `json:"dbConfSettings"` Encoding string `json:"encoding"` SelfJoin SelfJoinConf `json:"selfJoin"` IndexedCols []string `json:"indexedCols"` BibView BibViewConf `json:"bibView"` Filter FilterConf `json:"filter"` }
VTEConf holds configuration for a concrete data extraction task.
func (*VTEConf) GetAtomParentStructure ¶
func (*VTEConf) GetAtomStructure ¶
func (*VTEConf) GetDbConfSettings ¶
func (*VTEConf) GetFilterFn ¶
func (*VTEConf) GetFilterLib ¶
func (*VTEConf) GetMaxNumErrors ¶
func (*VTEConf) GetStackStructEval ¶
func (*VTEConf) GetStructures ¶
func (*VTEConf) HasConfiguredBib ¶
func (*VTEConf) HasConfiguredFilter ¶
func (*VTEConf) UsesSelfJoin ¶
Click to show internal directories.
Click to hide internal directories.