shelob

command module
v0.0.0-...-bdb5213 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Oct 12, 2024 License: MIT Imports: 1 Imported by: 0

README

Shelob

Simple web crawler

Installation
go install github.com/ovechkin-dm/shelob@latest
Example usage
shelob --baseurl=https://google.com --debug --downloadpath="./downloads/" --resume --workers=10
Options
Usage of ./shelob:
      --baseurl string        Base URL for the crawler
      --debug                 Enable debug mode
      --downloadpath string   Base path to download content (default "./downloads/")
      --resume                Resume previous download
      --workers int           Number of workers to use (default 1)
Roadmap
  • Basic web crawler
  • Parallel download
  • Resume download
  • Better url sanitization
  • Better handling of download directories (e.g. create directory only if subdirectories shoould be created)
  • Improved logging
  • Retries / circuit breaker

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL