crawl

package
v0.0.0-...-bf5dde6 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: May 29, 2024 License: GPL-3.0 Imports: 13 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func DismissPagePopups

func DismissPagePopups(page *rod.Page) error

DismissPagePopups tries to find and click on elements that are likely to be dismissible pop-ups.

Types

type ClickedElement

type ClickedElement struct {
	// contains filtered or unexported fields
}

type CrawlItem

type CrawlItem struct {
	// contains filtered or unexported fields
}

type CrawlOptions

type CrawlOptions struct {
	ExtraHeaders    map[string][]string
	MaxDepth        int
	MaxPagesToCrawl int
}

type CrawledPageResut

type CrawledPageResut struct {
	URL            string
	DiscoveredURLs []string
	IsError        bool
}

type Crawler

type Crawler struct {
	Options CrawlOptions
	// contains filtered or unexported fields
}

func NewCrawler

func NewCrawler(startURLs []string, maxPagesToCrawl int, maxDepth int, poolSize int, excludePatterns []string, workspaceID, taskID uint, extraHeaders map[string][]string) *Crawler

func (*Crawler) CreateScopeFromProvidedUrls

func (c *Crawler) CreateScopeFromProvidedUrls()

CreateScopeFromProvidedUrls creates scope items given the received urls

func (*Crawler) Run

func (c *Crawler) Run() []*db.History

type SubmittedForm

type SubmittedForm struct {
	// contains filtered or unexported fields
}

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL