Documentation ¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func ReorderAndCrop ¶
func ReorderAndCrop(conf *config.ParsingConfig, url *url.URL)
ReorderAndCrop removes the anchor (#sth) Fragment, sorts, removes and encodes query string parameters and lowercases Host
func ShallParse ¶
func ShallParse(conf *config.ParsingConfig, url string) bool
ShallParse checks whether site's source should be parsed
func StripProtocol ¶
StripProtocol strips a protocol from URL represented in string
Types ¶
type Generator ¶
type Generator struct { WorkerQueue *channels.InfiniteChannel // contains filtered or unexported fields }
func NewGenerator ¶
NewSitemapGenerator constructs a new sitemap generator instance, Call Start() in order to start the proccesszz
type Validator ¶
Validator manages address flow by pushing them to certain creation proccesses and makes sure no links are parsed twice.
func NewValidator ¶
func NewValidator(config config.Config, workerQueue *channels.InfiniteChannel, waitGroup *sync.WaitGroup, robots *robotstxt.RobotsData, generator chan string) *Validator
NewValidator creates a new validator instance
Click to show internal directories.
Click to hide internal directories.