llamacpphtmld

command module

v1.1.0 Latest Latest Go to latest Published: Apr 8, 2023 License: MIT Imports: 11 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

A web interface and API for the LLaMA large language AI model, based on the llama.cpp runtime.

All configuration should be supplied as environment variables:

LCH_MODEL_PATH=/srv/llama/ggml-vicuna-13b-4bit-rev1.bin \
	LCH_NET_BIND=:8090 \
	LCH_SIMULTANEOUS_REQUESTS=1 \
	./llamacpphtmld

Use the GOMAXPROCS environment variable to control how many threads the llama.cpp engine uses.

The generate endpoint will live stream new tokens into an existing conversation until the LLM stops naturally.

Usage: curl -v -X POST -d '{"Content": "The quick brown fox"}' 'http://localhost:8090/api/v1/generate'
You can optionally supply ConversationID and APIKey string parameters. However, these are not currently used by the server.
You can optionally supply a MaxTokens integer parameter, to cap the number of generated tokens from the LLM.

MIT

New web interface style, that is more mobile friendly and shows API status messages
Add default example prompt
Use a longer n_ctx by default

Rendered for

There is no documentation for this package.