lexer

package

v0.11.0 Latest Latest Go to latest Published: Jun 8, 2024 License: MIT Imports: 4 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/AntoninoAdornetto/issue-summoner

Links

Open Source Insights

Documentation ¶

Overview ¶

The goal for our lexer is not to create a compiler or intrepreter. It's primary purpose is to scan the raw source code as a series of characters and group them into tokens. The implementation will ignore many tokens that other scanner/lexers would not.

The types of tokens that we are concered about: Single line comment tokens (such as // for languages that have adopted c like comment syntax or # for python) Multi line comment tokens (such as /* for c adopted languages and ”' & """ for python) End of file tokens

We should check for string tokens, but we do not need to create the token or store the lexeme. The reason for checking strings is so we can prevent certain edge cases from happening. One example could be where a string contains characters that could be denoted as a comment. For C like languages that could be a string such as "/*" or "//". We don't want the lexer to create tokens for strings that may contain comment syntax.

Each language that is supported will need to satisfy the LexingManager interface and support tokenizing methods for Comments and Strings. This will allow each implementation to utilize the comment notation that is specific to a language.

Index ¶

Constants
Variables
func IsAdoptedFromC(ext string) bool
type CLexer
type Comment
type Lexer
- func NewLexer(src []byte, fileName string) (*Lexer, error)
- func (l *Lexer) AnalyzeTokens() ([]Token, error)
type LexingManager
- func NewLexingManager(ext string) (LexingManager, error)
type Token
- func (t *Token) ParseMultiLineCommentToken(annotation []byte, trim func(r rune) bool) Comment
- func (t *Token) ParseSingleLineCommentToken(annotation []byte, trim func(r rune) bool) Comment
type TokenType

Constants ¶

View Source

const (
	SINGLE_LINE_COMMENT = iota
	MULTI_LINE_COMMENT
	STRING
	EOF
)

Variables ¶

View Source

var (
	ASTERISK       byte = '*'
	BACK_TICK      byte = '`'
	BACKWARD_SLASH byte = '\\'
	FORWARD_SLASH  byte = '/'
	HASH           byte = '#'
	QUOTE          byte = '\''
	DOUBLE_QUOTE   byte = '"'
	NEWLINE        byte = '\n'
	TAB            byte = '\t'
	WHITESPACE     byte = ' '
)

Functions ¶

func IsAdoptedFromC ¶

func IsAdoptedFromC(ext string) bool

Types ¶

type CLexer ¶

type CLexer struct{}

func (*CLexer) AnalyzeToken ¶

func (cl *CLexer) AnalyzeToken(lex *Lexer) error

func (*CLexer) Comment ¶

func (cl *CLexer) Comment(lex *Lexer) error

func (*CLexer) MultiLineComment ¶

func (cl *CLexer) MultiLineComment(lex *Lexer) error

func (*CLexer) ParseCommentTokens ¶

func (cl *CLexer) ParseCommentTokens(lex *Lexer, annotation []byte) ([]Comment, error)

func (*CLexer) SingleLineComment ¶

func (cl *CLexer) SingleLineComment(lex *Lexer) error

func (*CLexer) String ¶

func (cl *CLexer) String(lex *Lexer, delim byte) error

type Comment ¶

type Comment struct {
	Title          []byte
	Description    []byte
	TokenIndex     int
	Source         []byte
	SourceFileName string
}

func (*Comment) Prepare ¶

func (c *Comment) Prepare(fileName string, index int)

func (*Comment) Push ¶ added in v0.10.3

func (c *Comment) Push(comments *[]Comment, fileName string, tokenIndex int)

func (*Comment) Validate ¶

func (c *Comment) Validate() bool

type Lexer ¶

type Lexer struct {
	Source   []byte
	FileName string
	Tokens   []Token
	Start    int
	Current  int
	Line     int
	Manager  LexingManager
}

func NewLexer ¶

func NewLexer(src []byte, fileName string) (*Lexer, error)

func (*Lexer) AnalyzeTokens ¶

func (l *Lexer) AnalyzeTokens() ([]Token, error)

type LexingManager ¶

type LexingManager interface {
	AnalyzeToken(lexer *Lexer) error
	String(lexer *Lexer, delim byte) error
	Comment(lexer *Lexer) error
	ParseCommentTokens(lexer *Lexer, annotation []byte) ([]Comment, error)
}

func NewLexingManager ¶

func NewLexingManager(ext string) (LexingManager, error)

type Token ¶

type Token struct {
	TokenType      TokenType
	Lexeme         []byte
	Line           int
	StartByteIndex int
	EndByteIndex   int
}

func (*Token) ParseMultiLineCommentToken ¶ added in v0.10.3

func (t *Token) ParseMultiLineCommentToken(annotation []byte, trim func(r rune) bool) Comment

func (*Token) ParseSingleLineCommentToken ¶ added in v0.10.3

func (t *Token) ParseSingleLineCommentToken(annotation []byte, trim func(r rune) bool) Comment

type TokenType ¶

type TokenType = int

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL