Documentation ¶
Index ¶
- Constants
- func DateInSlice(slice []time.Time, element time.Time) bool
- func DownloadFile(filePath string, url string) error
- func EnsureDirectory(filePath string) error
- func ExtractDomain(link string) string
- func ExtractFileName(link string) string
- func FileTypeCheck(link string, fileType string) bool
- func FixUrl(href, base string) string
- func HTMLStringToDoc(resp string) (*goquery.Document, error)
- func Maximum(list []int) int
- func Minimum(list []int) int
- func ObjectIdInSlice(slice []bson.ObjectId, element bson.ObjectId) bool
- func ParseCategoriesString(categoriesString string) []string
- func ParsePdf(source string) string
- func ProcessNameString(stringValue string) string
- func StringContainsAnyInSlice(slice []string, element string) bool
- func StringInSlice(slice []string, element string) bool
- func StringMatchPercentage(string1 string, string2 string) int
- func StringsMatch(string1 string, string2 string, tolerance int) bool
Constants ¶
View Source
const NewPageMarker = "\n*******************\n"
Variables ¶
This section is empty.
Functions ¶
func DateInSlice ¶
* check if a given date exists in a given date slice
func DownloadFile ¶
* download a file given the source and destination
func EnsureDirectory ¶
func ExtractDomain ¶
* extract the main domain from a given source path
func ExtractFileName ¶
* extract filename from a given source path
func FileTypeCheck ¶
* check if the file type of given source path matches given file type
func ObjectIdInSlice ¶
* check if a given string exists in a given slice
func ParseCategoriesString ¶
func ProcessNameString ¶
func StringContainsAnyInSlice ¶
* check if a given string exist in any string in a given string slice
func StringInSlice ¶
* check if a given string exists in a given slice
func StringMatchPercentage ¶
* match strings using Levenshtein distance source: https://en.wikipedia.org/wiki/Levenshtein_distance translated from C to Go
Types ¶
This section is empty.
Source Files ¶
- check_file_type.go
- date_in_slice.go
- download_file.go
- ensure_directory.go
- extract_domain.go
- extract_file_name.go
- fix_url.go
- html_string_to_doc.go
- maximum.go
- minimum.go
- objectid_in_slice.go
- parse_categories_string.go
- pdf_parser.go
- process_name_string.go
- string_contains_any_in_slice.go
- string_in_slice.go
- string_match_percentage.go
Click to show internal directories.
Click to hide internal directories.