README
¶
HTML 🤓
Simple Go (golang) library for manipulating HTML content
FromText(text) html
ToText(html) text
IsHTML(string) bool
RemoveTags(html) string
Summary(html) string
CollapseWhitespace(string) string
Pull Requests Welcome
Original versions of this library have been used in production on commercial applications for years, and the extra data collection has been a tremendous help for everyone involved.
I'm now open sourcing this library, and others, with hopes that you'll also benefit from a more robust error package.
Please use GitHub to make suggestions, pull requests, and enhancements. We're all in this together! 🤪
Documentation
¶
Index ¶
- func CollapseWhitespace(text string) string
- func FromText(text string) string
- func IsHTML(html string) bool
- func Minimal(text string) string
- func RemoveAnchors(html string) string
- func RemoveSpecialCharacters(html string) string
- func RemoveTags(html string) string
- func Summary(html string) string
- func ToSearchText(html string) string
- func ToText(html string) string
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func CollapseWhitespace ¶
CollapseWhitespace converts all whitespace characters into a single SPACE character
func IsHTML ¶
IsHTML returns TRUE if the string provided "looks like" HTML, in that, it has one or more substrings that appear to be an HTML tag
func RemoveAnchors ¶ added in v0.20.0
RemoveAnchors strips all HTML anchor tags from a string.
func RemoveSpecialCharacters ¶ added in v0.20.0
RemoveSpecialCharacters removes special Unicode characters from a string
func RemoveTags ¶
RemoveTags aggressively strips HTML tags from a string. It will only keep anything between `>` and `<`. From: https://stackoverflow.com/questions/55036156/how-to-replace-all-html-tag-with-empty-string-in-golang Original code by: Daniel Morell <https://stackoverflow.com/users/10463261/daniel-morell>
func ToSearchText ¶ added in v0.20.0
ToSearchText removes tags in a way that is suitable to text searches. This means that it will remove all tags, but adds regular whitespace in between them.
Types ¶
This section is empty.