gocatdoc

package module
v0.0.4 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Aug 9, 2023 License: MIT Imports: 14 Imported by: 0

README

Go-catdoc, get text and metadata from .doc files.

GoDoc Tests

Uses Wazero to run catdoc as webassembly in Go. The catdoc source is slightly modified to support reading metadata in .doc.
The msdoc.hexpat file is a pattern file for imhex that can parse the summaryinformation ole object inside the .doc file.

To compile the webassembly binary, go to ./catdoc/src/ and run make catdoc-wasm.
To run the tests, do go test ./...

Usage:

f, err := os.Open("test.doc")
text, err := gocatdoc.GetTextFromFile(f)

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL