Why Go
- Case Studies
  
  Common problems companies solve with Go
- Use Cases
  
  Stories about how and why companies use Go
- Security Policy
  
  How Go can help keep you secure by default
Learn
Docs
- Effective Go
  
  Tips for writing clear, performant, and idiomatic Go code
- Go User Manual
  
  A complete introduction to building software with Go
- Standard library
  
  Reference documentation for Go's standard library
- Release Notes
  
  Learn what's new in each Go release
Packages
Community
- Recorded Talks
  
  Videos from prior events
- Meetups
  
  Meet other local Go developers
- Conferences
  
  Learn and network with Go developers from around the world
- Go blog
  
  The Go project's official blog.
- Go project
  
  Get help and stay informed from Go
- Get connected

eval

command

v0.4.19 Latest Latest Go to latest Published: Jan 28, 2025 License: MIT Imports: 11 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/go-go-golems/pinocchio

README ¶

I want an eval tool for my geppetto prompts:

Input:

eval dataset json file
prompt template

Output:

set of eval metrics

dataset + template -> llm calls -> compute accuracy -> eval results

step 0

create a glazed command for evals
generate mock rows for eval results
wrap as command line tool

step 1

load a eval data set from eval.json
- array of objects
- each object:
  - input: hash[string]interface{}
  - golden answer: interface{}
iterate over each entry in eval.json
load a prompt from complaint.yaml
interpolate the complaint.yaml command

Running the actual LLM inference

run it
- load the API key, etc...
- create the chat step
- get the step result
- store the metadata in the result json

Postprocessing the LLM response

store the answer
- store the LLM metadata
- store the date
- give it a unique UUID

go run ./cmd/eval --dataset eval.json --command complaint.yaml

step 2

run a grading function against the LLM answer
- take a javascript script grading
compute a accuracy score

go run ./cmd/eval --dataset eval.json --command complaint.yaml --scoring score.js

step 3

REST API
web ui (braintrust inspired)
- make it cancellable when pressing Ctrl-C
- show full conversation when expanding
- rerun a single conversation and get streaming completion
- import/export datasets
- import/export/manage prompts
- log + monitoring of testruns
- streaming display of running datasets
- edit prompt and save new revisions
- switch between different versions and compare results and metrics and accuracy

features

caching of inference

Documentation ¶

The Go Gopher

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
eval
serve templ: version: v0.2.793	templ: version: v0.2.793