web2image

command module
v0.0.0-...-0c00936 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Dec 28, 2016 License: GPL-2.0 Imports: 16 Imported by: 0

README

Web2Image

Go tool to convert web pages to screenshots and categorize the websites.

Prerequisite

Install WebKit2Png Dependencies

apt-get install python-qt4 libqtwebkit4 xvfb
pip install webkit2png
Install

Download Zip and Build

go build ./web2image.go
Usage
> web2image -h

Usage of ./web2image:
  -category string
    	Categories File (default "categories.csv")
  -list string
    	File containing urls to scan (default "websites.txt")
  -out string
    	Name of json output (default "web2image.json")
  -threads int
    	# of Threads to run (default 5)
  -timeout int
    	HTTP Timeout in seconds (default 7)
  -verbose
    	Enable verbose output
Example
> web2image -list=websites.txt -out=example.json -verbose=true
             _    ___ _
 __ __ _____| |__|_  |_)_ __  __ _ __ _ ___
 \ V  V / -_) '_ \/ /| | '  \/ _' / _' / -_)
  \_/\_/\___|_.__/___|_|_|_|_\__,_\__, \___|
                                  |___/   
Worker 0 URL: http://google.com
Worker 1 URL: http://localhost
Worker 2 URL: http://localhost:8080
Redirect from: http://google.com -> http://www.google.com/
[+] Rendered: http_localhost_8080.png
[+] Rendered: http_localhost.png
[+] Rendered: http_google_com.png
Starting grouping
Finished Grouping
Websites being grouped based on title
Title does not exist: Apache Tomcat
Title does not exist: Apache2 Debian Default Page It works
Title does not exist: Google
Finished Grouping Websites
Starting Categorization
Categorizing Group: Apache Tomcat
Categorizing Group: Apache2 Debian Default Page It works
Categorizing Group: Google
Categorization done

Note

If using within a screen and images are empty, prefix the command with the following:

xvfb-run --server-args="-screen 0, 1024x768x24" ./web2image -list=websites.txt -out=example.json -verbose=true
Parser

To use parser, drag and drop json to index.html

Authors
  • Mitchell Hennigan
License

This project is licensed under the GNU General Public License - see the LICENSE file for details

Acknowledgments

Documentation

The Go Gopher

There is no documentation for this package.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL