Directories ¶
Path | Synopsis |
---|---|
addtoqueue adds a message to a queue.
|
addtoqueue adds a message to a queue. |
bookpipeline is the core command of the bookpipeline package, which watches queues for messages and does various OCR related tasks when it receives them, saving the results in cloud storage.
|
bookpipeline is the core command of the bookpipeline package, which watches queues for messages and does various OCR related tasks when it receives them, saving the results in cloud storage. |
booktopipeline uploads a book to cloud storage and adds the name to a queue ready to be processed by the bookpipeline tool.
|
booktopipeline uploads a book to cloud storage and adds the name to a queue ready to be processed by the bookpipeline tool. |
confgraph creates a graph showing the average word confidence of each page of hOCR in a directory.
|
confgraph creates a graph showing the average word confidence of each page of hOCR in a directory. |
getallhocrs downloads every 'best' file from a set of OCRed books stored on cloud infrastructure
|
getallhocrs downloads every 'best' file from a set of OCRed books stored on cloud infrastructure |
getandpurgequeue gets and deletes all messages from a queue.
|
getandpurgequeue gets and deletes all messages from a queue. |
getbests downloads every 'best' file from a set of OCRed books stored on cloud infrastructure
|
getbests downloads every 'best' file from a set of OCRed books stored on cloud infrastructure |
getpipelinebook downloads the pipeline results for a book.
|
getpipelinebook downloads the pipeline results for a book. |
getsamplepages downloads sample pages from each book in a set of OCRed books
|
getsamplepages downloads sample pages from each book in a set of OCRed books |
getstats gets relevant files for creating statistics from a set of OCRed books stored on cloud infrastructure
|
getstats gets relevant files for creating statistics from a set of OCRed books stored on cloud infrastructure |
logwholequeue gets all messages in a queue.
|
logwholequeue gets all messages in a queue. |
lspipeline lists useful things related to the book pipeline.
|
lspipeline lists useful things related to the book pipeline. |
lspipeline-ng lists useful things related to the book pipeline.
|
lspipeline-ng lists useful things related to the book pipeline. |
mkpipeline sets up the necessary buckets and queues for the book pipeline.
|
mkpipeline sets up the necessary buckets and queues for the book pipeline. |
pagegraph creates a graph showing the average confidence of each word in a page of hOCR.
|
pagegraph creates a graph showing the average confidence of each word in a page of hOCR. |
pdfbook creates a searchable PDF from a directory of hOCR and image files.
|
pdfbook creates a searchable PDF from a directory of hOCR and image files. |
rescribe is a modification of bookpipeline designed for local-only operation, which rolls uploading, processing, and downloading of a single book by the pipeline into one command.
|
rescribe is a modification of bookpipeline designed for local-only operation, which rolls uploading, processing, and downloading of a single book by the pipeline into one command. |
rmbook removes a book from cloud storage.
|
rmbook removes a book from cloud storage. |
spotme creates new spot instances for the book pipeline.
|
spotme creates new spot instances for the book pipeline. |
trimqueue deletes any messages in a queue that match a specified prefix.
|
trimqueue deletes any messages in a queue that match a specified prefix. |
Click to show internal directories.
Click to hide internal directories.