Documentation ¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func SplitToSentences ¶ added in v0.24.0
SplitToSentences splits given text into slice of sentences.
Types ¶
type InputReader ¶ added in v0.22.0
InputReader ...
type Tag ¶
type Tag struct { // Value of the tag, i.e. a word Value string // Score used to represent importance of the tag Score float64 // Count is the number of times tag appeared in a text Count int // Docs is the number of documents in a text in which the tag appeared Docs int // DocsCount is the number of documents in a text DocsCount int }
Tag holds some arbitrary string value (e.g. a word) along with some extra data about it.
func ParseHTML ¶
ParseHTML receives lines of raw HTML markup text from the Web and returns simple text, plus list of prioritised tags (if tagify == true) based on the importance of HTML tags which wrap sentences.
Example:
<h1>A story about foo <p> Foo was a good guy but, had a quite poor time management skills, therefore he had issues with shipping all his tasks. Though foo had heaps of other amazing skills, which gained him a fortune.
Result:
foo: 2 + 1 = 3, story: 2, management: 1 + 1 = 2, skills: 1 + 1 = 2.
Returns a slice of tags as 1st result, a title of the page as 2nd and a version of the document based on the hashed contents as 3rd.
func ParseText ¶
func ParseText(in InputReader, verbose, noStopWords bool) ([]*Tag, []byte)
ParseText parses given text lines of text into a slice of tags.
Click to show internal directories.
Click to hide internal directories.