Documentation
¶
Index ¶
- func CreateDNSJob(url *url.URL, resolverAddress net.IP, resolverPort uint16) (*pipeline.Job, error)
- func CreateHTTPJob(url *url.URL, userAgent string) *pipeline.Job
- func CreateRenderJob(url *url.URL, quality int) *pipeline.Job
- func CreateRobotsJob(target *url.URL, userAgent string) (*pipeline.Job, error)
- func CreateScrapeJob(payload *HTTPResponsePayload) *pipeline.Job
- type HTTPRequestPayload
- type HTTPResponsePayload
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func CreateDNSJob ¶
CreateDNSJob fetches a hostname's A record and creates the record.
func CreateHTTPJob ¶
CreateHTTPJob performs a HTTP GET request and returns the corresponding records.
func CreateRenderJob ¶
CreateRenderJob creates a render of the site.
func CreateRobotsJob ¶
CreateRobotsJob fetches the Robots TXT file.
func CreateScrapeJob ¶
func CreateScrapeJob(payload *HTTPResponsePayload) *pipeline.Job
CreateScrapeJob extracts URLs in use by the target.
Types ¶
type HTTPRequestPayload ¶
type HTTPRequestPayload struct { warc.RawPayload Request *http.Request }
HTTPRequestPayload is the payload of a HTTP request.
type HTTPResponsePayload ¶
type HTTPResponsePayload struct { warc.RawPayload Response *http.Response }
HTTPResponsePayload is the payload of a HTTP response.
Click to show internal directories.
Click to hide internal directories.