cutups

command module
v0.0.0-...-ea20db3 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 19, 2022 License: MIT Imports: 1 Imported by: 0

README

cutups

Using OCR to make digital cutups

example

src/words/example.jpg

usage

first install tesseract:

> sudo apt install tesseract-ocr

then download a high-quality newspaper (or other document). I like the papers from Library of Congress.

then convert the document into a 1.hocr file:

> tesseract <your-downloaded-image.jpg> 1 -c hocr_font_info=1 -c hocr_char_boxes=1 txt hocr

license

mit

Documentation

The Go Gopher

There is no documentation for this package.

Directories

Path Synopsis
src
hocr
hocr contains structures and functions for parsing and analysing hocr files
hocr contains structures and functions for parsing and analysing hocr files
line
line contains various functions to manipulate ocr lines
line contains various functions to manipulate ocr lines
prob
prob processes .prob files generated by ocropus
prob processes .prob files generated by ocropus

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL