command
Version:
v0.4.19
Opens a new window with list of versions in this module.
Published: Jan 28, 2025
License: MIT
Opens a new window with license information.
Imports: 11
Opens a new window with list of imports.
Imported by: 0
Opens a new window with list of known importers.
README
¶
I want an eval tool for my geppetto prompts:
Input:
- eval dataset json file
- prompt template
Output:
dataset + template -> llm calls -> compute accuracy -> eval results
step 0
- create a glazed command for evals
- generate mock rows for eval results
- wrap as command line tool
step 1
-
load a eval data set from eval.json
- array of objects
- each object:
- input: hash[string]interface{}
- golden answer: interface{}
-
iterate over each entry in eval.json
-
load a prompt from complaint.yaml
-
interpolate the complaint.yaml command
Running the actual LLM inference
- run it
- load the API key, etc...
- create the chat step
- get the step result
- store the metadata in the result json
Postprocessing the LLM response
- store the answer
- store the LLM metadata
- store the date
- give it a unique UUID
go run ./cmd/eval --dataset eval.json --command complaint.yaml
step 2
- run a grading function against the LLM answer
- take a javascript script grading
- compute a accuracy score
go run ./cmd/eval --dataset eval.json --command complaint.yaml --scoring score.js
step 3
features
Documentation
¶
There is no documentation for this package.
Source Files
¶
Directories
¶
Click to show internal directories.
Click to hide internal directories.