Documentation ¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Scraper ¶
type Scraper struct { URL *urlpkg.URL // contains the main URL to parse, will be modified in case of a redirect Client download.HttpClient Fs afero.Fs // filesystem // ETagsDB stores ETags (hashes of file state) for each URL ETagsDB *db.DB // contains filtered or unexported fields }
Scraper contains all scraping data, starts the process and handles the concurrency. It includes the logic to decide what URLs to include/exclude and when to stop.
func (*Scraper) Downloader ¶
Click to show internal directories.
Click to hide internal directories.