Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
View Source
var ( // PageExtension is the file extension that downloaded pages get PageExtension = ".html" // PageDirIndex is the file name of the index file for every dir PageDirIndex = "index" + PageExtension )
Functions ¶
func GetPageFilePath ¶
GetPageFilePath returns a filename for a URL that represents a page.
Types ¶
type Scraper ¶
type Scraper struct { // Configuration ImageQuality uint MaxDepth uint OutputDirectory string Username string Password string URL *url.URL // contains filtered or unexported fields }
Scraper contains all scraping data
func (*Scraper) GetFilePath ¶
GetFilePath returns a file path for a URL to store the URL content in
func (*Scraper) RemoveAnchor ¶
RemoveAnchor removes anchors from URLS
func (*Scraper) SetExcludes ¶
SetExcludes sets and checks the exclusions regular expressions.
func (*Scraper) SetIncludes ¶
SetIncludes sets and checks the inclusion regular expressions.
Click to show internal directories.
Click to hide internal directories.