parse

package

v0.4.0 Latest Latest Go to latest Published: Jul 14, 2017 License: Apache-2.0 Imports: 11 Imported by: 56

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/google/gapid

Links

Open Source Insights

Documentation ¶

Overview ¶

Package parse provides support functionality for writing scannerless parsers.

The main entry point is the Parse function. It works by setting up a parser on the supplied content, and then invoking the supplied root parsing function. The CST is built for you inside the ParseLeaf and ParseBranch methods of the parser, but it is up to the supplied parsing functions to hold on to the CST if you want it, and also to build the AST.

Index ¶

Constants
Variables
type Branch
- func (n *Branch) AddPrefix(s Separator)
- func (n *Branch) AddSuffix(s Separator)
- func (n *Branch) Parent() *Branch
- func (n *Branch) Prefix() Separator
- func (n *Branch) Suffix() Separator
- func (n *Branch) Token() Token
- func (n *Branch) WriteTo(w io.Writer) error
type BranchParser
type CSTMap
- func NewCSTMap() CSTMap
type Error
- func Parse(root BranchParser, filename, data string, skip Skip, m CSTMap) []Error
- func (err Error) Error() string
- func (err Error) Format(f fmt.State, c rune)
type ErrorList
- func (l *ErrorList) Add(r *Reader, at Fragment, message string, args ...interface{})
- func (errs ErrorList) Error() string
type Fragment
- func NewFragment(token Token) Fragment
type Leaf
- func (n *Leaf) AddPrefix(s Separator)
- func (n *Leaf) AddSuffix(s Separator)
- func (n *Leaf) Parent() *Branch
- func (n *Leaf) Prefix() Separator
- func (n *Leaf) SetToken(token Token)
- func (n *Leaf) Suffix() Separator
- func (n *Leaf) Token() Token
- func (n *Leaf) WriteTo(w io.Writer) error
type LeafParser
type Node
type NumberKind
type Parser
- func (p *Parser) Error(message string, args ...interface{})
- func (p *Parser) ErrorAt(at interface{}, message string, args ...interface{})
- func (p *Parser) Expected(value string)
- func (p *Parser) Extend(ast interface{}, do BranchParser)
- func (p *Parser) ParseBranch(cst *Branch, do BranchParser)
- func (p *Parser) ParseLeaf(cst *Branch, do LeafParser)
type Reader
- func NewReader(filename string, data string) *Reader
- func (r *Reader) Advance()
- func (r *Reader) AdvanceN(n int)
- func (r *Reader) AlphaNumeric() bool
- func (r *Reader) Consume() Token
- func (r *Reader) EOL() bool
- func (r *Reader) GuessNextToken() Token
- func (r *Reader) IsEOF() bool
- func (r *Reader) IsEOL() bool
- func (r *Reader) NotSpace() bool
- func (r *Reader) Numeric() NumberKind
- func (r *Reader) Peek() rune
- func (r *Reader) PeekN(n int) rune
- func (r *Reader) Rollback()
- func (r *Reader) Rune(value rune) bool
- func (r *Reader) SeekRune(value rune) bool
- func (r *Reader) Space() bool
- func (r *Reader) String(value string) bool
- func (r *Reader) Token() Token
type Separator
- func (s Separator) WriteTo(w io.Writer) error
type Skip
- func NewSkip(line, blockstart, blockend string) Skip
type SkipMode
type Source
- func (s Source) RelativeFilename() string
type Token
- func (t Token) At() string
- func (t Token) Cursor() (line int, column int)
- func (t Token) Format(f fmt.State, c rune)
- func (t Token) Len() int
- func (t Token) Less(rhs Token) bool
- func (t Token) String() string

Constants ¶

View Source

const (
	// AbortParse is paniced when a parse cannot continue. It is recovered at the
	// top level, to allow the errors to be cleanly returned to the caller.
	AbortParse = fault.Const("abort")
)

Variables ¶

View Source

var (
	// ParseErrorLimit is the maximum number of errors before a parse is aborted.
	ParseErrorLimit = 10
)

Functions ¶

This section is empty.

Types ¶

type Branch ¶

type Branch struct {

	// Children is the slice of child nodes for this Branch.
	Children []Node
	// contains filtered or unexported fields
}

Branch is a CST node that can have children.

func (*Branch) AddPrefix ¶

func (n *Branch) AddPrefix(s Separator)

func (*Branch) AddSuffix ¶

func (n *Branch) AddSuffix(s Separator)

func (*Branch) Parent ¶

func (n *Branch) Parent() *Branch

func (*Branch) Prefix ¶

func (n *Branch) Prefix() Separator

func (*Branch) Suffix ¶

func (n *Branch) Suffix() Separator

func (*Branch) Token ¶

func (n *Branch) Token() Token

func (*Branch) WriteTo ¶

func (n *Branch) WriteTo(w io.Writer) error

type BranchParser ¶

type BranchParser func(p *Parser, cst *Branch)

BranchParser is a function that is passed to ParseBranch. It is handed the Branch to fill in and the Parser to fill it from, and must either succeed or add an error to the parser.

type CSTMap ¶

type CSTMap interface {
	// SetCST is called to map an ast node to a CST node.
	SetCST(ast interface{}, cst Node)
	// CST is called to lookup the CST mapping for an AST node.
	CST(ast interface{}) Node
}

CSTMap is the interface to an object into which ast<->cts mappings are stored.

func NewCSTMap ¶

func NewCSTMap() CSTMap

NewCSTMap returns a new CSTMap instance.

type Error ¶

type Error struct {
	// At is the parse fragment that was being processed when the error was encountered.
	At Fragment
	// Message is the message associated with the error.
	Message string
	// Stack is the captured stack trace at the point the error was noticed.
	Stack []byte
}

Error represents the information that us useful in debugging a parse failure.

func Parse ¶

func Parse(root BranchParser, filename, data string, skip Skip, m CSTMap) []Error

Parse is the main entry point to the parse library. Given a root parse function, the input string and the Skip controller, it builds and initializes a Parser, runs the root using it, verifies it worked correctly and then returns the errors generated if any.

func (Error) Error ¶

func (err Error) Error() string

func (Error) Format ¶

func (err Error) Format(f fmt.State, c rune)

type ErrorList ¶

type ErrorList []Error

ErrorList is a convenience type for managing lists of errors.

func (*ErrorList) Add ¶

func (l *ErrorList) Add(r *Reader, at Fragment, message string, args ...interface{})

func (ErrorList) Error ¶

func (errs ErrorList) Error() string

type Fragment ¶

type Fragment interface {
	// Token returns the underlying token of this node.
	Token() Token
	// Write is used to write the underlying token out to the writer.
	WriteTo(io.Writer) error
}

Fragment is a component of a cst that is backed by a token. This includes Nodes and all forms of space and comment.

func NewFragment ¶

func NewFragment(token Token) Fragment

type Leaf ¶

type Leaf struct {
	// contains filtered or unexported fields
}

Leaf nodes are part of the cst that cannot have child nodes, they represent a single token from the input.

func (*Leaf) AddPrefix ¶

func (n *Leaf) AddPrefix(s Separator)

func (*Leaf) AddSuffix ¶

func (n *Leaf) AddSuffix(s Separator)

func (*Leaf) Parent ¶

func (n *Leaf) Parent() *Branch

func (*Leaf) Prefix ¶

func (n *Leaf) Prefix() Separator

func (*Leaf) SetToken ¶

func (n *Leaf) SetToken(token Token)

func (*Leaf) Suffix ¶

func (n *Leaf) Suffix() Separator

func (*Leaf) Token ¶

func (n *Leaf) Token() Token

func (*Leaf) WriteTo ¶

func (n *Leaf) WriteTo(w io.Writer) error

type LeafParser ¶

type LeafParser func(p *Parser, cst *Leaf)

LeafParser is a function that is passed to ParseLeaf. It is handed the Leaf to fill in and the Parser to fill it from, and must either succeed or add an error to the parser.

type Node ¶

type Node interface {
	Fragment
	// Parent returns the Branch that this node is under.
	Parent() *Branch
	// Prefix returns the set of skippable fragments associated with this Node
	// that precede it in the stream. Association is defined by the Skip function
	// in use.
	Prefix() Separator
	// AddPrefix adds more fragments to the Prefix list.
	AddPrefix(Separator)
	// Suffix returns the set of skippable fragments associated with this Node
	// that follow it in the stream. Association is defined by the Skip function
	// in use, the default is until the end of the line.
	Suffix() Separator
	// AddSuffix adds more fragments to the Suffix list.
	AddSuffix(Separator)
}

Node is a Fragment in a cst that represents unskipped tokens.

type NumberKind ¶

type NumberKind uint8

NumberKind is a type used by Reader.Numeric for identifying various kinds of numbers.

const (
	// No number was found.
	NotNumeric NumberKind = iota
	// A decimal number.
	Decimal
	// An octal number, starting with "0". PS: A lone "0" is classified as octal.
	Octal
	// A hexadecimal number, starting with "0x".
	Hexadecimal
	// A floating point number: "123.456". Whole and the fractional parts are optional (but
	// not both at the same time).
	Floating
	// A floating point number in scientific notation: "123.456e±789". The fractional part,
	// the dot and the exponent sign are all optional.
	Scientific
)

type Parser ¶

type Parser struct {
	Reader           // The token reader for this parser.
	CSTMap           // A map of cst nodes to ast nodes
	Errors ErrorList // The set of errors generated during the parse.
	// contains filtered or unexported fields
}

Parser contains all the context needed while parsing. They are built for you by the Parse function.

func (*Parser) Error ¶

func (p *Parser) Error(message string, args ...interface{})

Error adds a new error to the parser error list. It will attempt to consume a token from the reader to act as a place holder, and also to ensure progress is made in the presence of errors.

func (*Parser) ErrorAt ¶

func (p *Parser) ErrorAt(at interface{}, message string, args ...interface{})

ErrorAt is like Error, except because it is handed a fragment, it will not try to consume anything itself.

func (*Parser) Expected ¶

func (p *Parser) Expected(value string)

Expected is a wrapper around p.ErrorAt for the very common case of an unexpected input. It uses value as the expected input, and parses a token of the stream for the unexpected actual input.

func (*Parser) Extend ¶

func (p *Parser) Extend(ast interface{}, do BranchParser)

Extend inserts a new branch between ast and its parent, and calls do() with the newly insterted branch.

Extend will transform:

n.parent ──> n

to:

n.parent ──> b ──> n

where b is passed as the second argument to do().

func (*Parser) ParseBranch ¶

func (p *Parser) ParseBranch(cst *Branch, do BranchParser)

ParseBranch adds a new Branch to cst and then calls the do function to parse the branch. This is called recursively to build the node tree.

func (*Parser) ParseLeaf ¶

func (p *Parser) ParseLeaf(cst *Branch, do LeafParser)

ParseLeaf adds a new Leaf to cst and then calls the do function to parse the Leaf. If do is nil, a leaf will be built with the current unconsumed input.

type Reader ¶

type Reader struct {
	Source *Source // The source being parsed.
	// contains filtered or unexported fields
}

Reader is the interface to an object that converts a rune array into tokens.

func NewReader ¶

func NewReader(filename string, data string) *Reader

NewReader creates a new reader which reads from the supplied string.

func (*Reader) Advance ¶

func (r *Reader) Advance()

Advance moves the cursor one rune forward.

func (*Reader) AdvanceN ¶

func (r *Reader) AdvanceN(n int)

AdvanceN moves the cursor n runes forward.

func (*Reader) AlphaNumeric ¶

func (r *Reader) AlphaNumeric() bool

AlphaNumeric moves past anything that starts with a letter or underscore, and consists of letters, numbers or underscores. It returns true if the pattern was matched, false otherwise.

func (*Reader) Consume ¶

func (r *Reader) Consume() Token

Consume consumes the current token.

func (*Reader) EOL ¶

func (r *Reader) EOL() bool

EOL moves the cursor past the EOL and returns true if the cursor is at the end of a line (\n or \r\n), otherwise EOL returns false and does nothing.

func (*Reader) GuessNextToken ¶

func (r *Reader) GuessNextToken() Token

GuessNextToken attempts to do a general purpose consume of a single arbitrary token from the stream. It is used by error handlers to indicate where the error occurred. It guarantees that if the stream is not finished, it will consume at least one character.

func (*Reader) IsEOF ¶

func (r *Reader) IsEOF() bool

IsEOF returns true when the cursor is at the end of the input.

func (*Reader) IsEOL ¶

func (r *Reader) IsEOL() bool

IsEOL returns true when the cursor is at the end of a line (\n or \r\n). IsEOL does not move the cursor.

func (*Reader) NotSpace ¶

func (r *Reader) NotSpace() bool

Space skips over any non whitespace, returning true if it advanced the cursor.

func (*Reader) Numeric ¶

func (r *Reader) Numeric() NumberKind

Numeric tries to move past the common number pattern. It returns a constant of type NumberKind describing the kind of number it found.

func (*Reader) Peek ¶

func (r *Reader) Peek() rune

Peek returns the next rune without advancing the cursor.

func (*Reader) PeekN ¶

func (r *Reader) PeekN(n int) rune

PeekN returns the n'th next rune without advancing the cursor.

func (*Reader) Rollback ¶

func (r *Reader) Rollback()

Rollback sets the cursor back to the last consume point.

func (*Reader) Rune ¶

func (r *Reader) Rune(value rune) bool

Rune advances and returns true if the next rune after the cursor matches value.

func (*Reader) SeekRune ¶

func (r *Reader) SeekRune(value rune) bool

SeekRune advances the cursor until either the value is found or the end of stream is reached. It returns true if it found value, false otherwise.

func (*Reader) Space ¶

func (r *Reader) Space() bool

Space skips over any whitespace, returning true if it advanced the cursor.

func (*Reader) String ¶

func (r *Reader) String(value string) bool

String checks to see if value occurs at cursor, if it does, it advances the cursor past it and returns true.

func (*Reader) Token ¶

func (r *Reader) Token() Token

Token peeks at the current scanned token value. It does not consume anything.

type Separator ¶

type Separator []Fragment

Separator is a list type to manage fragments that were skipped.

func (Separator) WriteTo ¶

func (s Separator) WriteTo(w io.Writer) error

type Skip ¶

type Skip func(parser *Parser, mode SkipMode) Separator

Skip is the function used to skip separating tokens. A separating token is one where, as far as the parser is concerned, the tokens do not exist, even though the tokens may have been necessary to separate the lexical tokens (whitespace), or carry useful information (comments).

func NewSkip ¶

func NewSkip(line, blockstart, blockend string) Skip

NewSkip builds a Skip function for the common case of a parser that has one type of line comment, one type of block comment, and want to treat all unicode space characters as skippable.

type SkipMode ¶

type SkipMode int

const (
	// SkipPrefix is the skip mode that skips tokens that are associated with
	// the following lexically relevant token. This is mostly important for
	// comment association.
	SkipPrefix SkipMode = iota
	// SkipSuffix is the skip mode that skips tokens that are associated with
	// the preceding lexically relevant token. This is mostly important for
	// comment association.
	SkipSuffix
)

type Source ¶

type Source struct {
	Filename string // The path which was used to load the source (if any).
	Runes    []rune // The full content of the source file.
}

func (Source) RelativeFilename ¶

func (s Source) RelativeFilename() string

RelativeFilename returns the filename relative to the current working directory.

type Token ¶

type Token struct {
	Source *Source // The source object this token is from (including the full rune array).
	Start  int     // The start of the token in the full rune array.
	End    int     // One past the end of the token.
}

A Token represents the smallest consumed unit input.

func (Token) At ¶

func (t Token) At() string

At returns the token's location in the form: filename:line:column

func (Token) Cursor ¶

func (t Token) Cursor() (line int, column int)

Cursor is used to calculate the line and column of the start of the token. It may be very expensive to call, and is intended to be used sparingly in producing human readable error messages only.

func (Token) Format ¶

func (t Token) Format(f fmt.State, c rune)

Format implements fmt.Formatter writing the start end and value of the token.

func (Token) Len ¶

func (t Token) Len() int

Len returns the length of the token in runes.

func (Token) Less ¶

func (t Token) Less(rhs Token) bool

Less returns true if t comes before rhs when considering source files and offsets within the same file.

func (Token) String ¶

func (t Token) String() string

String returns the string form of the rune range the token represents.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
test

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL