Documentation ¶
Overview ¶
Package read provides methods for parsing PDF files into memory.
The in memory representation of a PDF file is called a PDFContext.
The PDFContext is a container for the PDF cross reference table and stats.
Index ¶
- func GetEncodedStreamContent(ctx *types.PDFContext, streamDict *types.PDFStreamDict) ([]byte, error)
- func PDFFile(fileName string, config *types.Configuration) (ctx *types.PDFContext, err error)
- func ScanLines(data []byte, atEOF bool) (advance int, token []byte, err error)
- func Verbose(verbose bool)
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func GetEncodedStreamContent ¶
func GetEncodedStreamContent(ctx *types.PDFContext, streamDict *types.PDFStreamDict) ([]byte, error)
GetEncodedStreamContent loads the encoded stream content from file into PDFStreamDict.
func PDFFile ¶
func PDFFile(fileName string, config *types.Configuration) (ctx *types.PDFContext, err error)
PDFFile reads in a PDFFile and generates a PDFContext, an in-memory representation containing a cross reference table.
func ScanLines ¶ added in v0.0.6
ScanLines is a split function for a Scanner that returns each line of text, stripped of any trailing end-of-line marker. The returned line may be empty. The end-of-line marker is one carriage return followed by one newline or one carriage return or one newline. The last non-empty line of input will be returned even if it has no newline.
Types ¶
This section is empty.