command
Version:
v1.0.0
Opens a new window with list of versions in this module.
Published: Sep 20, 2024
License: Apache-2.0
Opens a new window with license information.
Imports: 7
Opens a new window with list of imports.
Imported by: 0
Opens a new window with list of known importers.
README
¶
BPE model
This demonstrates how to train a tokenizer from scratch using BPE model.
It trains a tokenizer for Esperanto language from scratch using data from
input
folder and saves vocab
and merges
into model
folder.
To run:
# run training
go run . -mode=train
# run test
go run . -mode=test
Documentation
¶
There is no documentation for this package.
Source Files
¶
Click to show internal directories.
Click to hide internal directories.