parser

package
v0.0.0-...-13a6da0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Aug 27, 2024 License: Apache-2.0 Imports: 21 Imported by: 0

README

Generated Documentation and Embedded Help Texts

Many parts of the parser package include special consideration for generation or other production of user-facing documentation. This includes interactive help messages, generated documentation of the set of available functions, or diagrams of the various expressions.

Generated documentation is produced and maintained at compile time, while the interactive, contextual help is returned at runtime.

We equip the generated parser with the ability to report contextual help in two circumstances:

  • when the user explicitly requests help with the HELPTOKEN (current syntax: standalone "??")
  • when the user makes a grammatical mistake (e.g. INSERT sometable INTO(x, y) ...)

We use the docgen tool to produce the generated documentation files that are then included in the broader (handwritten) published documentation.

Help texts embedded in the grammar

The help is embedded in the grammar using special markers in yacc comments, for example:

// %Help: HELPKEY - shortdescription
// %Category: SomeCat
// %Text: whatever until next %marker at start of line, or non-comment.
// %SeeAlso: whatever until next %marker at start of line, or non-comment.
// %End (optional)

The "HELPKEY" becomes the map key in the generated Go map.

These texts are extracted automatically by help.awk and converted into a Go data structure in help_messages.go.

Support in the parser

Primary mechanism - LALR error recovery

The primary mechanism is leveraging error recovery in LALR parsers using the special error token [1] [2]: when an unexpected token is encountered, the LALR parser will pop tokens on the stack until the prefix matches a grammar rule with the special "error" token (if any). If such a rule exists, its action is used to reduce and the erroneous tokens are discarded.

This mechanism is used both when the user makes a mistake, and when the user inserts the HELPTOKEN in the middle of a statement. When present in the middle of a statement, HELPTOKEN is considered an error and triggers the error recovery.

We use this for contextual help by providing error rules that generate a contextual help text during LALR error recovery.

For example:

backup_stmt:
  BACKUP targets TO string_or_placeholder opt_as_of_clause opt_incremental opt_with_options
  {
    $$.val = &Backup{Targets: $2.targetList(), To: $4.expr(), IncrementalFrom: $6.exprs(), AsOf: $5.asOfClause(), Options: $7.kvOptions()}
  }
| BACKUP error { return helpWith(sqllex, `BACKUP`) }

In this example, the grammar specifies that if the BACKUP keyword is followed by some input tokens such that the first (valid) grammar rule doesn't apply, the parser will "recover from the error" by backtracking up until the point it only sees BACKUP on the stack followed by non-parsable tokens, at which points it takes the error rule and executes its action.

The action is return helpWith(...). What this does is:

  • halts parsing (the generated parser executes all actions in a big loop; a return interrupts this loop);
  • makes the parser return with an error (the helpWith function returns non-zero);
  • extends the parsing error message with a help text; this help text can subsequently be exploited in a client to display the help message in a friendly manner.
Code generation

Since the pattern "{ return helpWith(sqllex, ...) }" is common, we also implement a shorthand syntax based on comments, for example:

backup_stmt:
   ...
| BACKUP error // SHOW HELP: BACKUP

The special comment syntax "SHOW HELP: XXXX" is substituted by means of an auxiliary script (replace_help_rules.awk) into the form explained above.

Secondary mechanism - explicit help token

The mechanism described above works both when the user make a grammatical error and when they place the HELPTOKEN in the middle of a statement, rendering it invalid.

However for contextual help this is not sufficient: what happens if the user requests HELPTOKEN at a position in the grammar where everything before is a complete, valid SQL input?

For example: DELETE FROM foo ?

When encountering this input, the LALR parser will see DELETE FROM foo first, then reduce using the DELETE action because everything up to this point is a valid DELETE statement. When the HELPTOKEN is encountered, the statement has already been completed and the LALR parser doesn't 'know' any more that it was in the context of a DELETE statement.

If we try to place an error-based recovery rule at the top-level:

stmt:
  alter_stmt
| backup_stmt
| ...
| delete_stmt
| ...
| error { ??? }

This wouldn't work: the code inside the error action cannot "observe" the tokens observed so far and there would be no way to know whether the error should be about DELETE, or instead about ALTER, BACKUP, etc.

So in order to handle HELPTOKEN after a valid statement, we must place it in a rule where the context is still available, that is before the statement's grammar rule is reduced.

Where would that be? Suppose we had a simple statement rule:

somesimplestmt:
  SIMPLE DO SOMETHING { $$ = reduce(...) }
| SIMPLE error { help ... }

We could extend with:

somesimplestmt:
  SIMPLE DO SOMETHING { $$ = reduce(...) }
| SIMPLE DO SOMETHING HELPTOKEN { help ... }
| SIMPLE error { help ... }

(the alternative also works:

somesimplestmt:
  SIMPLE DO SOMETHING { $$ = reduce(...) }
| SIMPLE DO SOMETHING error { help ... }
| SIMPLE error { help ... }

)

That is all fine and dandy, but in SQL we have statements with many alternate forms, for example:

alter_rename_table_stmt:
  ALTER TABLE relation_expr RENAME TO qualified_name { ... }
| ALTER TABLE IF EXISTS relation_expr RENAME TO qualified_name { ... }
| ALTER TABLE relation_expr RENAME opt_column name TO name { ... }
| ALTER TABLE IF EXISTS relation_expr RENAME opt_column name TO name { ... }

To add complementary handling of the help token at the end of valid statements we could, but would hate to, duplicate all the rules:

alter_rename_table_stmt:
  ALTER TABLE relation_expr RENAME TO qualified_name { ... }
| ALTER TABLE relation_expr RENAME TO qualified_name HELPTOKEN { help ... }
| ALTER TABLE IF EXISTS relation_expr RENAME TO qualified_name { ... }
| ALTER TABLE IF EXISTS relation_expr RENAME TO qualified_name HELPTOKEN { help ... }
| ALTER TABLE relation_expr RENAME opt_column name TO name { ... }
| ALTER TABLE relation_expr RENAME opt_column name TO name HELPTOKEN { help ... }
| ALTER TABLE IF EXISTS relation_expr RENAME opt_column name TO name { ... }
| ALTER TABLE IF EXISTS relation_expr RENAME opt_column name TO name HELPTOKEN { help ... }

This duplication is horrendous (not to mention hard to maintain), so instead we should attempt to factor the help token in a context where it is still known that we are dealing just with that statement.

The following works:

alter_rename_table_stmt:
  real_alter_rename_table_stmt { $$ = $1 }
| real_alter_rename_table_stmt HELPTOKEN { help ... }

real_alter_rename_table_stmt:
  ALTER TABLE relation_expr RENAME TO qualified_name { ... }
| ALTER TABLE IF EXISTS relation_expr RENAME TO qualified_name { ... }
| ALTER TABLE relation_expr RENAME opt_column name TO name { ... }
| ALTER TABLE IF EXISTS relation_expr RENAME opt_column name TO name { ... }

Or does it? Without anything else, yacc complains with a "shift/reduce conflict". The reason is coming from the ambiguity: when the parsing stack contains everything sufficient to match a real_alter_rename_table_stmt, there is a choice between reducing the simple form alter_rename_table_stmt: real_alter_rename_table_stmt, or shifting into the more complex form alter_rename_table_stmt: real_alter_rename_table_stmt HELPTOKEN.

This is another form of the textbook situation when yacc is used to parse if-else statements in a programming language: the rule stmt: IF cond THEN body | IF cond THEN body ELSE body is ambiguous (and yields a shift/reduce conflict) for exactly the same reason.

The solution here is also straight out of a textbook: one simply informs yacc of the relative priority between the two candidate rules. In this case, when faced with a neutral choice, we encourage yacc to shift. The particular mechanism is to tell yacc that one rule has a higher priority than another.

It just so happens however that the yacc language only allows us to set relative priorites of tokens, not rules. And here we have a problem, of the two rules that need to be prioritized, only one has a token to work with (the one with HELPTOKEN). Which token should we prioritze for the other?

Conveniently yacc knows about this trouble and offers us an awkward, but working solution: we can tell it "use for this rule the same priority level as an existing token, even though the token is not part of the rule". The syntax for this is rule %prec TOKEN.

We can then use this as follows:

alter_rename_table_stmt:
  real_alter_rename_table_stmt           %prec LOWTOKEN { $$ = $1 }
| real_alter_rename_table_stmt HELPTOKEN %prec HIGHTOKEN { help ... }

We could create two new pseudo-tokens for this (called LOWTOKEN and HIGHTOKEN) however conveniently we can also reuse otherwise valid tokens that have known relative priorities. We settled in our case on VALUES (low priority) and UMINUS (high priority).

Code generation

With the latter mechanism presented above the pattern

rule:
  somerule           %prec VALUES
| somerule HELPTOKEN %prec UMINUS { help ...}`

becomes super common, so we automate it with the following special syntax:

rule:
  somerule // EXTEND WITH HELP: XXX

And the code replacement in replace_help_rules.awk expands this to the form above automatically.

Generated Documentation

Documentation of the SQL functions and operators is generated by the docgen utility, using make generate PKG=./docs/.... The markdown-formatted files are kept in docs/generated/sql and should be re-generated whenever the functions/operators they document change, and indeed if regenerating produces a diff, a CI failure is expected.

References

  1. https://www.gnu.org/software/bison/manual/html_node/Error-Recovery.html
  2. http://stackoverflow.com/questions/9796608/error-handling-in-yacc

Documentation

Index

Constants

View Source
const ABORT = lex.ABORT
View Source
const ACTION = lex.ACTION
View Source
const ADD = lex.ADD
View Source
const ADMIN = lex.ADMIN
View Source
const AGGREGATE = lex.AGGREGATE
View Source
const ALL = lex.ALL
View Source
const ALTER = lex.ALTER
View Source
const ANALYSE = lex.ANALYSE
View Source
const ANALYZE = lex.ANALYZE
View Source
const AND = lex.AND
View Source
const AND_AND = lex.AND_AND
View Source
const ANNOTATE_TYPE = lex.ANNOTATE_TYPE
View Source
const ANY = lex.ANY
View Source
const ARRAY = lex.ARRAY
View Source
const AS = lex.AS
View Source
const ASC = lex.ASC
View Source
const ASYMMETRIC = lex.ASYMMETRIC
View Source
const AS_LA = lex.AS_LA
View Source
const AT = lex.AT
View Source
const AUTHORIZATION = lex.AUTHORIZATION
View Source
const AUTOMATIC = lex.AUTOMATIC
View Source
const BACKUP = lex.BACKUP
View Source
const BCONST = lex.BCONST
View Source
const BEGIN = lex.BEGIN
View Source
const BETWEEN = lex.BETWEEN
View Source
const BIGINT = lex.BIGINT
View Source
const BIGSERIAL = lex.BIGSERIAL
View Source
const BIT = lex.BIT
View Source
const BITCONST = lex.BITCONST
View Source
const BLOB = lex.BLOB
View Source
const BOOL = lex.BOOL
View Source
const BOOLEAN = lex.BOOLEAN
View Source
const BOTH = lex.BOTH
View Source
const BUCKET_COUNT = lex.BUCKET_COUNT
View Source
const BUNDLE = lex.BUNDLE
View Source
const BY = lex.BY
View Source
const BYTEA = lex.BYTEA
View Source
const BYTES = lex.BYTES
View Source
const CACHE = lex.CACHE
View Source
const CANCEL = lex.CANCEL
View Source
const CASCADE = lex.CASCADE
View Source
const CASE = lex.CASE
View Source
const CAST = lex.CAST
View Source
const CHANGEFEED = lex.CHANGEFEED
View Source
const CHAR = lex.CHAR
View Source
const CHARACTER = lex.CHARACTER
View Source
const CHARACTERISTICS = lex.CHARACTERISTICS
View Source
const CHECK = lex.CHECK
View Source
const CLUSTER = lex.CLUSTER
View Source
const COALESCE = lex.COALESCE
View Source
const COLLATE = lex.COLLATE
View Source
const COLLATION = lex.COLLATION
View Source
const COLUMN = lex.COLUMN
View Source
const COLUMNS = lex.COLUMNS
View Source
const COMMENT = lex.COMMENT
View Source
const COMMIT = lex.COMMIT
View Source
const COMMITTED = lex.COMMITTED
View Source
const COMPACT = lex.COMPACT
View Source
const COMPLETE = lex.COMPLETE
View Source
const CONCAT = lex.CONCAT
View Source
const CONCURRENTLY = lex.CONCURRENTLY
View Source
const CONFIGURATION = lex.CONFIGURATION
View Source
const CONFIGURATIONS = lex.CONFIGURATIONS
View Source
const CONFIGURE = lex.CONFIGURE
View Source
const CONFLICT = lex.CONFLICT
View Source
const CONSTRAINT = lex.CONSTRAINT
View Source
const CONSTRAINTS = lex.CONSTRAINTS
View Source
const CONTAINED_BY = lex.CONTAINED_BY
View Source
const CONTAINS = lex.CONTAINS
View Source
const CONVERSION = lex.CONVERSION
View Source
const COPY = lex.COPY
View Source
const COVERING = lex.COVERING
View Source
const CREATE = lex.CREATE
View Source
const CREATEROLE = lex.CREATEROLE
View Source
const CROSS = lex.CROSS
View Source
const CUBE = lex.CUBE
View Source
const CURRENT = lex.CURRENT
View Source
const CURRENT_CATALOG = lex.CURRENT_CATALOG
View Source
const CURRENT_DATE = lex.CURRENT_DATE
View Source
const CURRENT_ROLE = lex.CURRENT_ROLE
View Source
const CURRENT_SCHEMA = lex.CURRENT_SCHEMA
View Source
const CURRENT_TIME = lex.CURRENT_TIME
View Source
const CURRENT_TIMESTAMP = lex.CURRENT_TIMESTAMP
View Source
const CURRENT_USER = lex.CURRENT_USER
View Source
const CYCLE = lex.CYCLE
View Source
const DATA = lex.DATA
View Source
const DATABASE = lex.DATABASE
View Source
const DATABASES = lex.DATABASES
View Source
const DATE = lex.DATE
View Source
const DAY = lex.DAY
View Source
const DEALLOCATE = lex.DEALLOCATE
View Source
const DEC = lex.DEC
View Source
const DECIMAL = lex.DECIMAL
View Source
const DEFAULT = lex.DEFAULT
View Source
const DEFERRABLE = lex.DEFERRABLE
View Source
const DEFERRED = lex.DEFERRED
View Source
const DELETE = lex.DELETE
View Source
const DESC = lex.DESC
View Source
const DISCARD = lex.DISCARD
View Source
const DISTINCT = lex.DISTINCT
View Source
const DO = lex.DO
View Source
const DOMAIN = lex.DOMAIN
View Source
const DOT_DOT = lex.DOT_DOT
View Source
const DOUBLE = lex.DOUBLE
View Source
const DROP = lex.DROP
View Source
const ELSE = lex.ELSE
View Source
const ENCODING = lex.ENCODING
View Source
const END = lex.END
View Source
const ENUM = lex.ENUM
View Source
const ERROR = lex.ERROR
View Source
const ESCAPE = lex.ESCAPE
View Source
const EXCEPT = lex.EXCEPT
View Source
const EXCLUDE = lex.EXCLUDE
View Source
const EXECUTE = lex.EXECUTE
View Source
const EXISTS = lex.EXISTS
View Source
const EXPERIMENTAL = lex.EXPERIMENTAL
View Source
const EXPERIMENTAL_AUDIT = lex.EXPERIMENTAL_AUDIT
View Source
const EXPERIMENTAL_FINGERPRINTS = lex.EXPERIMENTAL_FINGERPRINTS
View Source
const EXPERIMENTAL_RELOCATE = lex.EXPERIMENTAL_RELOCATE
View Source
const EXPERIMENTAL_REPLICA = lex.EXPERIMENTAL_REPLICA
View Source
const EXPIRATION = lex.EXPIRATION
View Source
const EXPLAIN = lex.EXPLAIN
View Source
const EXPORT = lex.EXPORT
View Source
const EXTENSION = lex.EXTENSION
View Source
const EXTRACT = lex.EXTRACT
View Source
const EXTRACT_DURATION = lex.EXTRACT_DURATION
View Source
const FALSE = lex.FALSE
View Source
const FAMILY = lex.FAMILY
View Source
const FCONST = lex.FCONST
View Source
const FETCH = lex.FETCH
View Source
const FETCHTEXT = lex.FETCHTEXT
View Source
const FETCHTEXT_PATH = lex.FETCHTEXT_PATH
View Source
const FETCHVAL = lex.FETCHVAL
View Source
const FETCHVAL_PATH = lex.FETCHVAL_PATH
View Source
const FILES = lex.FILES
View Source
const FILTER = lex.FILTER
View Source
const FIRST = lex.FIRST
View Source
const FLOAT = lex.FLOAT
View Source
const FLOAT4 = lex.FLOAT4
View Source
const FLOAT8 = lex.FLOAT8
View Source
const FLOORDIV = lex.FLOORDIV
View Source
const FOLLOWING = lex.FOLLOWING
View Source
const FOR = lex.FOR
View Source
const FORCE_INDEX = lex.FORCE_INDEX
View Source
const FOREIGN = lex.FOREIGN
View Source
const FROM = lex.FROM
View Source
const FULL = lex.FULL
View Source
const FUNCTION = lex.FUNCTION
View Source
const GLOBAL = lex.GLOBAL
View Source
const GRANT = lex.GRANT
View Source
const GRANTS = lex.GRANTS
View Source
const GREATER_EQUALS = lex.GREATER_EQUALS
View Source
const GREATEST = lex.GREATEST
View Source
const GROUP = lex.GROUP
View Source
const GROUPING = lex.GROUPING
View Source
const GROUPS = lex.GROUPS
View Source
const HASH = lex.HASH
View Source
const HAVING = lex.HAVING
View Source
const HELPTOKEN = lex.HELPTOKEN
View Source
const HIGH = lex.HIGH
View Source
const HISTOGRAM = lex.HISTOGRAM
View Source
const HOUR = lex.HOUR
View Source
const ICONST = lex.ICONST
View Source
const IDENT = lex.IDENT
View Source
const IF = lex.IF
View Source
const IFERROR = lex.IFERROR
View Source
const IFNULL = lex.IFNULL
View Source
const IGNORE_FOREIGN_KEYS = lex.IGNORE_FOREIGN_KEYS
View Source
const ILIKE = lex.ILIKE
View Source
const IMMEDIATE = lex.IMMEDIATE
View Source
const IMPORT = lex.IMPORT
View Source
const IN = lex.IN
View Source
const INCLUDE = lex.INCLUDE
View Source
const INCREMENT = lex.INCREMENT
View Source
const INCREMENTAL = lex.INCREMENTAL
View Source
const INDEX = lex.INDEX
View Source
const INDEXES = lex.INDEXES
View Source
const INET = lex.INET
View Source
const INET_CONTAINED_BY_OR_EQUALS = lex.INET_CONTAINED_BY_OR_EQUALS
View Source
const INET_CONTAINS_OR_EQUALS = lex.INET_CONTAINS_OR_EQUALS
View Source
const INITIALLY = lex.INITIALLY
View Source
const INJECT = lex.INJECT
View Source
const INNER = lex.INNER
View Source
const INSERT = lex.INSERT
View Source
const INT = lex.INT
View Source
const INT2 = lex.INT2
View Source
const INT2VECTOR = lex.INT2VECTOR
View Source
const INT4 = lex.INT4
View Source
const INT64 = lex.INT64
View Source
const INT8 = lex.INT8
View Source
const INTEGER = lex.INTEGER
View Source
const INTERLEAVE = lex.INTERLEAVE
View Source
const INTERSECT = lex.INTERSECT
View Source
const INTERVAL = lex.INTERVAL
View Source
const INTO = lex.INTO
View Source
const INVERTED = lex.INVERTED
View Source
const IS = lex.IS
View Source
const ISERROR = lex.ISERROR
View Source
const ISNULL = lex.ISNULL
View Source
const ISOLATION = lex.ISOLATION
View Source
const JOB = lex.JOB
View Source
const JOBS = lex.JOBS
View Source
const JOIN = lex.JOIN
View Source
const JSON = lex.JSON
View Source
const JSONB = lex.JSONB
View Source
const JSON_ALL_EXISTS = lex.JSON_ALL_EXISTS
View Source
const JSON_SOME_EXISTS = lex.JSON_SOME_EXISTS
View Source
const KEY = lex.KEY
View Source
const KEYS = lex.KEYS
View Source
const KV = lex.KV
View Source
const LANGUAGE = lex.LANGUAGE
View Source
const LAST = lex.LAST
View Source
const LATERAL = lex.LATERAL
View Source
const LC_COLLATE = lex.LC_COLLATE
View Source
const LC_CTYPE = lex.LC_CTYPE
View Source
const LEADING = lex.LEADING
View Source
const LEASE = lex.LEASE
View Source
const LEAST = lex.LEAST
View Source
const LEFT = lex.LEFT
View Source
const LESS = lex.LESS
View Source
const LESS_EQUALS = lex.LESS_EQUALS
View Source
const LEVEL = lex.LEVEL
View Source
const LIKE = lex.LIKE
View Source
const LIMIT = lex.LIMIT
View Source
const LIST = lex.LIST
View Source
const LOCAL = lex.LOCAL
View Source
const LOCALTIME = lex.LOCALTIME
View Source
const LOCALTIMESTAMP = lex.LOCALTIMESTAMP
View Source
const LOCKED = lex.LOCKED
View Source
const LOGIN = lex.LOGIN
View Source
const LOOKUP = lex.LOOKUP
View Source
const LOW = lex.LOW
View Source
const LSHIFT = lex.LSHIFT
View Source
const MATCH = lex.MATCH
View Source
const MATERIALIZED = lex.MATERIALIZED
View Source
const MAXVALUE = lex.MAXVALUE
View Source
const MERGE = lex.MERGE
View Source
const MINUTE = lex.MINUTE
View Source
const MINVALUE = lex.MINVALUE
View Source
const MONTH = lex.MONTH
View Source
const MaxInt = int(MaxUint >> 1)
View Source
const MaxUint = ^uint(0)
View Source
const NAME = lex.NAME
View Source
const NAMES = lex.NAMES
View Source
const NAN = lex.NAN
View Source
const NATURAL = lex.NATURAL
View Source
const NEXT = lex.NEXT
View Source
const NO = lex.NO
View Source
const NOCREATEROLE = lex.NOCREATEROLE
View Source
const NOLOGIN = lex.NOLOGIN
View Source
const NONE = lex.NONE
View Source
const NORMAL = lex.NORMAL
View Source
const NOT = lex.NOT
View Source
const NOTHING = lex.NOTHING
View Source
const NOTNULL = lex.NOTNULL
View Source
const NOT_EQUALS = lex.NOT_EQUALS
View Source
const NOT_LA = lex.NOT_LA
View Source
const NOT_REGIMATCH = lex.NOT_REGIMATCH
View Source
const NOT_REGMATCH = lex.NOT_REGMATCH
View Source
const NOWAIT = lex.NOWAIT
View Source
const NO_INDEX_JOIN = lex.NO_INDEX_JOIN
View Source
const NULL = lex.NULL
View Source
const NULLIF = lex.NULLIF
View Source
const NULLS = lex.NULLS
View Source
const NUMERIC = lex.NUMERIC
View Source
const OF = lex.OF
View Source
const OFF = lex.OFF
View Source
const OFFSET = lex.OFFSET
View Source
const OID = lex.OID
View Source
const OIDS = lex.OIDS
View Source
const OIDVECTOR = lex.OIDVECTOR
View Source
const ON = lex.ON
View Source
const ONLY = lex.ONLY
View Source
const OPERATOR = lex.OPERATOR
View Source
const OPT = lex.OPT
View Source
const OPTION = lex.OPTION
View Source
const OPTIONS = lex.OPTIONS
View Source
const OR = lex.OR
View Source
const ORDER = lex.ORDER
View Source
const ORDINALITY = lex.ORDINALITY
View Source
const OTHERS = lex.OTHERS
View Source
const OUT = lex.OUT
View Source
const OUTER = lex.OUTER
View Source
const OVER = lex.OVER
View Source
const OVERLAPS = lex.OVERLAPS
View Source
const OVERLAY = lex.OVERLAY
View Source
const OWNED = lex.OWNED
View Source
const PARENT = lex.PARENT
View Source
const PARTIAL = lex.PARTIAL
View Source
const PARTITION = lex.PARTITION
View Source
const PARTITIONS = lex.PARTITIONS
View Source
const PASSWORD = lex.PASSWORD
View Source
const PAUSE = lex.PAUSE
View Source
const PHYSICAL = lex.PHYSICAL
View Source
const PLACEHOLDER = lex.PLACEHOLDER
View Source
const PLACING = lex.PLACING
View Source
const PLAN = lex.PLAN
View Source
const PLANS = lex.PLANS
View Source
const POSITION = lex.POSITION
View Source
const POSTFIXOP = lex.POSTFIXOP
View Source
const PRECEDING = lex.PRECEDING
View Source
const PRECISION = lex.PRECISION
View Source
const PREPARE = lex.PREPARE
View Source
const PRESERVE = lex.PRESERVE
View Source
const PRIMARY = lex.PRIMARY
View Source
const PRIORITY = lex.PRIORITY
View Source
const PROCEDURAL = lex.PROCEDURAL
View Source
const PUBLIC = lex.PUBLIC
View Source
const PUBLICATION = lex.PUBLICATION
View Source
const QUERIES = lex.QUERIES
View Source
const QUERY = lex.QUERY
View Source
const RANGE = lex.RANGE
View Source
const RANGES = lex.RANGES
View Source
const READ = lex.READ
View Source
const REAL = lex.REAL
View Source
const RECURSIVE = lex.RECURSIVE
View Source
const REF = lex.REF
View Source
const REFERENCES = lex.REFERENCES
View Source
const REGCLASS = lex.REGCLASS
View Source
const REGIMATCH = lex.REGIMATCH
View Source
const REGNAMESPACE = lex.REGNAMESPACE
View Source
const REGPROC = lex.REGPROC
View Source
const REGPROCEDURE = lex.REGPROCEDURE
View Source
const REGTYPE = lex.REGTYPE
View Source
const REINDEX = lex.REINDEX
View Source
const RELEASE = lex.RELEASE
View Source
const REMOVE_PATH = lex.REMOVE_PATH
View Source
const RENAME = lex.RENAME
View Source
const REPEATABLE = lex.REPEATABLE
View Source
const REPLACE = lex.REPLACE
View Source
const RESET = lex.RESET
View Source
const RESTORE = lex.RESTORE
View Source
const RESTRICT = lex.RESTRICT
View Source
const RESUME = lex.RESUME
View Source
const RETURNING = lex.RETURNING
View Source
const REVOKE = lex.REVOKE
View Source
const RIGHT = lex.RIGHT
View Source
const ROLE = lex.ROLE
View Source
const ROLES = lex.ROLES
View Source
const ROLLBACK = lex.ROLLBACK
View Source
const ROLLUP = lex.ROLLUP
View Source
const ROW = lex.ROW
View Source
const ROWS = lex.ROWS
View Source
const RSHIFT = lex.RSHIFT
View Source
const RULE = lex.RULE
View Source
const SAVEPOINT = lex.SAVEPOINT
View Source
const SCATTER = lex.SCATTER
View Source
const SCHEMA = lex.SCHEMA
View Source
const SCHEMAS = lex.SCHEMAS
View Source
const SCONST = lex.SCONST
View Source
const SCRUB = lex.SCRUB
View Source
const SEARCH = lex.SEARCH
View Source
const SECOND = lex.SECOND
View Source
const SELECT = lex.SELECT
View Source
const SEQUENCE = lex.SEQUENCE
View Source
const SEQUENCES = lex.SEQUENCES
View Source
const SERIAL = lex.SERIAL
View Source
const SERIAL2 = lex.SERIAL2
View Source
const SERIAL4 = lex.SERIAL4
View Source
const SERIAL8 = lex.SERIAL8
View Source
const SERIALIZABLE = lex.SERIALIZABLE
View Source
const SERVER = lex.SERVER
View Source
const SESSION = lex.SESSION
View Source
const SESSIONS = lex.SESSIONS
View Source
const SESSION_USER = lex.SESSION_USER
View Source
const SET = lex.SET
View Source
const SETTING = lex.SETTING
View Source
const SETTINGS = lex.SETTINGS
View Source
const SHARE = lex.SHARE
View Source
const SHOW = lex.SHOW
View Source
const SIMILAR = lex.SIMILAR
View Source
const SIMPLE = lex.SIMPLE
View Source
const SKIP = lex.SKIP
View Source
const SMALLINT = lex.SMALLINT
View Source
const SMALLSERIAL = lex.SMALLSERIAL
View Source
const SNAPSHOT = lex.SNAPSHOT
View Source
const SOME = lex.SOME
View Source
const SPLIT = lex.SPLIT
View Source
const SQL = lex.SQL
View Source
const START = lex.START
View Source
const STATISTICS = lex.STATISTICS
View Source
const STATUS = lex.STATUS
View Source
const STDIN = lex.STDIN
View Source
const STORE = lex.STORE
View Source
const STORED = lex.STORED
View Source
const STORING = lex.STORING
View Source
const STRICT = lex.STRICT
View Source
const STRING = lex.STRING
View Source
const SUBSCRIPTION = lex.SUBSCRIPTION
View Source
const SUBSTRING = lex.SUBSTRING
View Source
const SYMMETRIC = lex.SYMMETRIC
View Source
const SYNTAX = lex.SYNTAX
View Source
const SYSTEM = lex.SYSTEM
View Source
const TABLE = lex.TABLE
View Source
const TABLES = lex.TABLES
View Source
const TEMP = lex.TEMP
View Source
const TEMPLATE = lex.TEMPLATE
View Source
const TEMPORARY = lex.TEMPORARY
View Source
const TESTING_RELOCATE = lex.TESTING_RELOCATE
View Source
const TEXT = lex.TEXT
View Source
const THEN = lex.THEN
View Source
const THROTTLING = lex.THROTTLING
View Source
const TIES = lex.TIES
View Source
const TIME = lex.TIME
View Source
const TIMESTAMP = lex.TIMESTAMP
View Source
const TIMESTAMPTZ = lex.TIMESTAMPTZ
View Source
const TIMETZ = lex.TIMETZ
View Source
const TO = lex.TO
View Source
const TRACE = lex.TRACE
View Source
const TRACING = lex.TRACING
View Source
const TRAILING = lex.TRAILING
View Source
const TRANSACTION = lex.TRANSACTION
View Source
const TREAT = lex.TREAT
View Source
const TRIGGER = lex.TRIGGER
View Source
const TRIM = lex.TRIM
View Source
const TRUE = lex.TRUE
View Source
const TRUNCATE = lex.TRUNCATE
View Source
const TRUSTED = lex.TRUSTED
View Source
const TYPE = lex.TYPE
View Source
const TYPEANNOTATE = lex.TYPEANNOTATE
View Source
const TYPECAST = lex.TYPECAST
View Source
const UMINUS = lex.UMINUS
View Source
const UNBOUNDED = lex.UNBOUNDED
View Source
const UNCOMMITTED = lex.UNCOMMITTED
View Source
const UNION = lex.UNION
View Source
const UNIQUE = lex.UNIQUE
View Source
const UNKNOWN = lex.UNKNOWN
View Source
const UNLOGGED = lex.UNLOGGED
View Source
const UNSPLIT = lex.UNSPLIT
View Source
const UNTIL = lex.UNTIL
View Source
const UPDATE = lex.UPDATE
View Source
const UPSERT = lex.UPSERT
View Source
const USE = lex.USE
View Source
const USER = lex.USER
View Source
const USERS = lex.USERS
View Source
const USING = lex.USING
View Source
const UUID = lex.UUID
View Source
const VALID = lex.VALID
View Source
const VALIDATE = lex.VALIDATE
View Source
const VALUE = lex.VALUE
View Source
const VALUES = lex.VALUES
View Source
const VARBIT = lex.VARBIT
View Source
const VARCHAR = lex.VARCHAR
View Source
const VARIADIC = lex.VARIADIC
View Source
const VARYING = lex.VARYING
View Source
const VIEW = lex.VIEW
View Source
const VIRTUAL = lex.VIRTUAL
View Source
const WHEN = lex.WHEN
View Source
const WHERE = lex.WHERE
View Source
const WINDOW = lex.WINDOW
View Source
const WITH = lex.WITH
View Source
const WITHIN = lex.WITHIN
View Source
const WITHOUT = lex.WITHOUT
View Source
const WITH_LA = lex.WITH_LA
View Source
const WORK = lex.WORK
View Source
const WRITE = lex.WRITE
View Source
const YEAR = lex.YEAR
View Source
const ZONE = lex.ZONE

Variables

View Source
var AllHelp = func(h map[string]HelpMessageBody) string {

	cmds := make(map[string][]string)
	for c, details := range h {
		if details.Category == "" {
			continue
		}
		cmds[details.Category] = append(cmds[details.Category], c)
	}

	// Ensure the result is deterministic.
	var categories []string
	for c, l := range cmds {
		categories = append(categories, c)
		sort.Strings(l)
	}
	sort.Strings(categories)

	// Compile the final help index.
	var buf bytes.Buffer
	w := tabwriter.NewWriter(&buf, 0, 0, 1, ' ', 0)
	for _, cat := range categories {
		fmt.Fprintf(w, "%s:\n", strings.Title(cat))
		for _, item := range cmds[cat] {
			fmt.Fprintf(w, "\t\t%s\t%s\n", item, h[item].ShortDescription)
		}
		fmt.Fprintln(w)
	}
	_ = w.Flush()
	return buf.String()
}(helpMessages)

AllHelp contains an overview of all statements with help messages. For example, displayed in the CLI shell with \h without additional parameters.

View Source
var DocsURLBase = "https://www.cockroachlabs.com/docs/" + "v20.1.11"

DocsURLBase is the root URL for the version of the docs associated with this binary.

View Source
var HelpMessages = func(h map[string]HelpMessageBody) map[string]HelpMessageBody {
	appendSeeAlso := func(newItem, prevItems string) string {

		if prevItems != "" {
			return newItem + "\n  " + prevItems
		}
		return newItem
	}
	reformatSeeAlso := func(seeAlso string) string {
		return strings.Replace(
			strings.Replace(seeAlso, ", ", "\n  ", -1),
			"WEBDOCS", DocsURLBase, -1)
	}
	srcMsg := h["<SOURCE>"]
	srcMsg.SeeAlso = reformatSeeAlso(strings.TrimSpace(srcMsg.SeeAlso))
	selectMsg := h["<SELECTCLAUSE>"]
	selectMsg.SeeAlso = reformatSeeAlso(strings.TrimSpace(selectMsg.SeeAlso))
	for k, m := range h {
		m = h[k]
		m.ShortDescription = strings.TrimSpace(m.ShortDescription)
		m.Text = strings.TrimSpace(m.Text)
		m.SeeAlso = strings.TrimSpace(m.SeeAlso)

		if strings.Contains(m.Text, "<source>") && k != "<SOURCE>" {
			m.Text = strings.TrimSpace(m.Text) + "\n\n" + strings.TrimSpace(srcMsg.Text)
			m.SeeAlso = appendSeeAlso(srcMsg.SeeAlso, m.SeeAlso)
		}

		if strings.Contains(m.Text, "<selectclause>") && k != "<SELECTCLAUSE>" {
			m.Text = strings.TrimSpace(m.Text) + "\n\n" + strings.TrimSpace(selectMsg.Text)
			m.SeeAlso = appendSeeAlso(selectMsg.SeeAlso, m.SeeAlso)
		}

		if strings.Contains(m.Text, "<tablename>") {
			m.SeeAlso = appendSeeAlso("SHOW TABLES", m.SeeAlso)
		}
		m.SeeAlso = reformatSeeAlso(m.SeeAlso)
		h[k] = m
	}
	return h
}(helpMessages)

HelpMessages is the registry of all help messages, keyed by the top-level statement that they document. The key is intended for use via the \h client-side command.

Functions

func DocsURL

func DocsURL(pageName string) string

DocsURL generates the URL to pageName in the version of the docs associated with this binary.

func EndsInSemicolon

func EndsInSemicolon(sql string) bool

EndsInSemicolon returns true if the last lexical token is a semicolon.

func LastLexicalToken

func LastLexicalToken(sql string) (lastTok int, ok bool)

LastLexicalToken returns the last lexical token. If the string has no lexical tokens, returns 0 and ok=false.

func ParseExpr

func ParseExpr(sql string) (tree.Expr, error)

ParseExpr is a short-hand for parseExprs([]string{sql})

func ParseExprs

func ParseExprs(sql []string) (tree.Exprs, error)

ParseExprs is a short-hand for parseExprs(sql)

func ParseQualifiedTableName

func ParseQualifiedTableName(sql string) (*tree.TableName, error)

ParseQualifiedTableName parses a SQL string of the form `[ database_name . ] [ schema_name . ] table_name`.

func ParseTableName

func ParseTableName(sql string) (*tree.UnresolvedObjectName, error)

ParseTableName parses a table name.

func ParseType

func ParseType(sql string) (*types.T, error)

ParseType parses a column type.

func RunShowSyntax

func RunShowSyntax(
	ctx context.Context,
	stmt string,
	report func(ctx context.Context, field, msg string),
	reportErr func(ctx context.Context, err error),
)

RunShowSyntax analyzes the syntax and reports its structure as data for the client. Even an error is reported as data.

Since errors won't propagate to the client as an error, but as a result, the usual code path to capture and record errors will not be triggered. Instead, the caller can pass a reportErr closure to capture errors instead. May be nil.

func SplitFirstStatement

func SplitFirstStatement(sql string) (pos int, ok bool)

SplitFirstStatement returns the length of the prefix of the string up to and including the first semicolon that separates statements. If there is no semicolon, returns ok=false.

Types

type HelpMessage

type HelpMessage struct {
	// Command is set if the message is about a statement.
	Command string
	// Function is set if the message is about a built-in function.
	Function string

	// HelpMessageBody contains the details of the message.
	HelpMessageBody
}

HelpMessage describes a contextual help message.

func (*HelpMessage) Format

func (h *HelpMessage) Format(w io.Writer)

Format prints out details about the message onto the specified output stream.

func (*HelpMessage) String

func (h *HelpMessage) String() string

String implements the fmt.String interface.

type HelpMessageBody

type HelpMessageBody struct {
	Category         string
	ShortDescription string
	Text             string
	SeeAlso          string
}

HelpMessageBody defines the body of a help text. The messages are structured to facilitate future help navigation functionality.

type Parser

type Parser struct {
	// contains filtered or unexported fields
}

Parser wraps a scanner, parser and other utilities present in the parser package.

func (*Parser) Parse

func (p *Parser) Parse(sql string) (Statements, error)

Parse parses the sql and returns a list of statements.

func (*Parser) ParseWithInt

func (p *Parser) ParseWithInt(sql string, nakedIntType *types.T) (Statements, error)

ParseWithInt parses a sql statement string and returns a list of Statements. The INT token will result in the specified TInt type.

type Statement

type Statement struct {
	// AST is the root of the AST tree for the parsed statement.
	AST tree.Statement

	// SQL is the original SQL from which the statement was parsed. Note that this
	// is not appropriate for use in logging, as it may contain passwords and
	// other sensitive data.
	SQL string

	// NumPlaceholders indicates the number of arguments to the statement (which
	// are referenced through placeholders). This corresponds to the highest
	// argument position (i.e. the x in "$x") that appears in the query.
	//
	// Note: where there are "gaps" in the placeholder positions, this number is
	// based on the highest position encountered. For example, for `SELECT $3`,
	// NumPlaceholders is 3. These cases are malformed and will result in a
	// type-check error.
	NumPlaceholders int

	// NumAnnotations indicates the number of annotations in the tree. It is equal
	// to the maximum annotation index.
	NumAnnotations tree.AnnotationIdx
}

Statement is the result of parsing a single statement. It contains the AST node along with other information.

func ParseOne

func ParseOne(sql string) (Statement, error)

ParseOne parses a sql statement string, ensuring that it contains only a single statement, and returns that Statement. ParseOne will always interpret the INT and SERIAL types as 64-bit types, since this is used in various internal-execution paths where we might receive bits of SQL from other nodes. In general, we expect that all user-generated SQL has been run through the ParseWithInt() function.

type Statements

type Statements []Statement

Statements is a list of parsed statements.

func Parse

func Parse(sql string) (Statements, error)

Parse parses a sql statement string and returns a list of Statements.

func (Statements) String

func (stmts Statements) String() string

String returns the AST formatted as a string.

func (Statements) StringWithFlags

func (stmts Statements) StringWithFlags(flags tree.FmtFlags) string

StringWithFlags returns the AST formatted as a string (with the given flags).

type TokenString

type TokenString struct {
	TokenID int32
	Str     string
}

TokenString is the unit value returned by Tokens.

func Tokens

func Tokens(sql string) (tokens []TokenString, ok bool)

Tokens decomposes the input into lexical tokens.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL