cl100kbase

package
v0.9.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 19, 2023 License: MIT Imports: 4 Imported by: 0

Documentation

Overview

Package cl100kbase registers the "cl100k_base" tokenizer with gotoken. To use this tokenizer:

import (
    "github.com/peterheb/gotoken"
    _ "github.com/peterheb/gotoken/cl100kbase"
)
...
tok, err := gotoken.GetTokenizer("cl100k_base")

This file was generated from the following data:

Index

Constants

View Source
const (
	EndOfText   = "<|endoftext|>"
	FIMPrefix   = "<|fim_prefix|>"
	FIMMiddle   = "<|fim_middle|>"
	FIMSuffix   = "<|fim_suffix|>"
	IMStart     = "<|im_start|>" // these are documented in the tiktoken README
	IMEnd       = "<|im_end|>"   // but aren't in the Python code
	EndOfPrompt = "<|endofprompt|>"
)

These special tokens are defined by this encoding.

Variables

This section is empty.

Functions

This section is empty.

Types

This section is empty.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL