grafana-llm-app

module

v0.6.4 Latest Latest Go to latest Published: Mar 11, 2024 License: Apache-2.0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/grafana/grafana-llm-app

Links

Open Source Insights

README ¶

Grafana LLM App (Public Preview)

A Grafana plugin designed to centralize access to LLMs, providing authentication, proxying, streaming, and custom extensions. Installing this plugin will enable various pieces of LLM-based functionality throughout Grafana.

Note: The Grafana LLM App plugin is currently in Public preview. Grafana Labs offers support on a best-effort basis, and there might be breaking changes before the feature is generally available.

Install the plugin on Grafana Cloud

Prerequisite:

Any Grafana Cloud environment (including the free tier)

Steps:

In Grafana Cloud, click Administration > Plugins and data > Plugins in the side navigation menu.
Browse or search for the LLM plugin and click to open it.
On the Configuration tab, select "Enable OpenAI access via Grafana".
Click to permit us to share limited data with OpenAI's API (not for training, and only to provide these features).
Click Save settings.

If you prefer, you may configure your own API authentication from supported LLM providers, including OpenAI and Azure. With this option, the LLM app securely stores API keys for you.

Install the plugin directly

To install this plugin, use the GF_INSTALL_PLUGINS environment variable when running Grafana:

GF_INSTALL_PLUGINS=grafana-llm-app

or alternatively install using the Grafana CLI:

grafana cli plugins install grafana-llm-app

The plugin can then be configured either in the UI or using provisioning, as shown below.

Provisioning this plugin

To provision this plugin you should set the following environment variable when running Grafana:

OPENAI_API_KEY=sk-...

and use the following provisioning file (e.g. in /etc/grafana/provisioning/plugins/grafana-llm-app, when running in Docker):

apiVersion: 1

apps:
  - type: 'grafana-llm-app'
    disabled: false
    jsonData:
      openAI:
        url: https://api.openai.com
    secureJsonData:
      openAIKey: $OPENAI_API_KEY

Using Azure OpenAI

To provision the plugin to use Azure OpenAI, use settings similar to this:

apiVersion: 1

apps:
  - type: 'grafana-llm-app'
    disabled: false
    jsonData:
      openAI:
        provider: azure
        url: https://<resource>.openai.azure.com
        azureModelMapping:
          - ["gpt-3.5-turbo", "gpt-35-turbo"]
    secureJsonData:
      openAIKey: $OPENAI_API_KEY

where:

<resource> is your Azure OpenAI resource name
the azureModelMapping field contains [model, deployment] pairs so that features know which Azure deployment to use in place of each model you wish to be used.

Provisioning vector services

The vector services of the plugin allow some AI-based features (initially, the PromQL query advisor) to use semantic search to send better context to LLMs (and improve responses). Configuration is in roughly three parts:

'global' vector settings:
- enabled - whether to enable or disable vector services overall
- model - the name of the model to use to calculate embeddings for searches. This must match the model used when storing the data, or the embeddings will be meaningless.
'embedding' vector settings (embed):
- type - the type of embedding service, either openai or grafana/vectorapi to use Grafana's own vector API (recommended if you're just starting out).
- grafanaVectorAPI, if type is grafana/vectorapi, with keys:
  - url - the URL of the Grafana VectorAPI instance.
  - authType - the type of authentication to use, either no-auth or basic-auth.
  - basicAuthUser - the username to use if authType is basic-auth.
'store' vector settings (store):
- type - the type of vector store to connect to. We recommend starting out with grafana/vectorapi to use Grafana's own vector API for a quick start. We also support qdrant for Qdrant.
- grafanaVectorAPI, if type is grafana/vectorapi, with keys:
  - url - the URL of the Grafana VectorAPI instance.
  - authType - the type of authentication to use, either no-auth or basic-auth.
  - basicAuthUser - the username to use if authType is basic-auth.
- qdrant, if type is qdrant, with keys:
  - address - the address of the Qdrant server. Note that this uses a gRPC connection.
  - secure - boolean, whether to use a secure connection. If you're using a secure connection you can set the qdrantApiKey field in secureJsonData to provide an API key with each request.

Note

Currently Azure OpenAI is not supported as an embedder.
Grafana Vector API used in embedding and store can be optionally different.
If you want to enable the PromQL Query Advisor, set up the Grafana vector API - we'll walk you through loading the data you need for that feature. If you're interested in building your own vector-based features on the Grafana platform, we do also support OpenAI embeddings and Qdrant.

Grafana VectorAPI Store + Grafana VectorAPI Embedder example

apps:
  - type: grafana-llm-app
    jsonData:
      openAI:
        provider: openai
        url: https://api.openai.com
        organizationId: $OPENAI_ORGANIZATION_ID
      # openAI:
        # provider: azure
        # url: https://<resource>.openai.azure.com
        # azureModelMapping:
        #   - ["gpt-3.5-turbo", "gpt-35-turbo"]
      vector:
        enabled: true
        model: BAAI/bge-small-en-v1.5
        embed:
          type: grafana/vectorapi
          grafanaVectorAPI:
            url: <vectorapi-url> # e.g. http://localhost:8889
            authType: no-auth
            # authType: basic-auth
            # basicAuthUser: <user>
        store:
          type: grafana/vectorapi
          grafanaVectorAPI:
            url: <vectorapi-url> # e.g. http://localhost:8889
            authType: no-auth
            # authType: basic-auth
            # basicAuthUser: <user>

    secureJsonData:
      openAIKey: $OPENAI_API_KEY
      # openAIKey: $AZURE_OPENAI_API_KEY
      # vectorEmbedderBasicAuthPassword: $VECTOR_EMBEDDER_BASIC_AUTH_PASSWORD
      # vectorStoreBasicAuthPassword: $VECTOR_STORE_BASIC_AUTH_PASSWORD

OpenAI Embedder + Qdrant Store example

apiVersion: 1

apps:
  - type: grafana-llm-app
    jsonData:
      openAI:
        provider: openai
        url: https://api.openai.com
        organizationId: $OPENAI_ORGANIZATION_ID
      vector:
        enabled: true
        model: text-embedding-ada-002
        embed:
          type: openai
        store:
          type: qdrant
          qdrant:
            address: <qdrant-grpc-address> # e.g. localhost:6334

    secureJsonData:
      openAIKey: $OPENAI_API_KEY

Note: openai embed type uses the setting from openAI automatically

Adding LLM features to your plugin or Grafana core

To make use of this plugin when adding LLM-based features, you can use the helper functions in the @grafana/experimental package.

First, add the correct version of @grafana/experimental to your dependencies in package.json:

{
  "dependencies": {
    "@grafana/experimental": "1.7.0"
  }
}

Then in your components you can use the llm object from @grafana/experimental like so:

import React, { useState } from 'react';
import { useAsync } from 'react-use';

import { llms } from '@grafana/experimental';
import { PluginPage } from '@grafana/runtime';

import { Button, Input, Spinner } from '@grafana/ui';

const MyComponent = (): JSX.Element => {
  const [input, setInput] = React.useState('');
  const [message, setMessage] = React.useState('');
  const [reply, setReply] = useState('');

  const { loading, error } = useAsync(async () => {
    const enabled = await llms.openai.enabled();
    if (!enabled) {
      return false;
    }
    if (message === '') {
      return;
    }
    // Stream the completions. Each element is the next stream chunk.
    const stream = llms.openai
      .streamChatCompletions({
        model: 'gpt-3.5-turbo',
        messages: [
          { role: 'system', content: 'You are a cynical assistant.' },
          { role: 'user', content: message },
        ],
      })
      .pipe(llms.openai.accumulateContent());
    // Subscribe to the stream and update the state for each returned value.
    return stream.subscribe(setReply);
  }, [message]);

  if (error) {
    // TODO: handle errors.
    return null;
  }

  return (
    <div>
      <Input value={input} onChange={(e) => setInput(e.currentTarget.value)} placeholder="Enter a message" />
      <br />
      <Button type="submit" onClick={() => setMessage(input)}>
        Submit
      </Button>
      <br />
      <div>{loading ? <Spinner /> : reply}</div>
    </div>
  );
};

The messages parameter is the same as OpenAI's concept of messages.

The .subscribe method can take a few different forms. The "callback form" shown here is the more concise form. Another form allows more specific callbacks based on conditions, e.g. error or complete which can be useful if you want to do specific UI actions like showing a loading indicator.

Developing this plugin

Backend

Update Grafana plugin SDK for Go dependency to the latest minor version:
```
go get -u github.com/grafana/grafana-plugin-sdk-go
go mod tidy
```
Build backend plugin binaries for Linux, Windows and Darwin:
```
mage -v
```
List all available Mage targets for additional commands:
```
mage -l
```

Frontend

Install dependencies
```
npm install
```
Build plugin in development mode and run in watch mode
```
npm run dev
```
Build plugin in production mode
```
npm run build
```

Run the tests (using Jest)

# Runs the tests and watches for changes, requires git init first
npm run test

# Exits after running all the tests
npm run test:ci

Spin up a Grafana instance and run the plugin inside it (using Docker)
```
npm run server
```

Run the E2E tests (using Cypress)

# Spins up a Grafana instance first that we tests against
npm run server

# Starts the tests
npm run e2e

Run the linter
```
npm run lint

# or

npm run lint:fix
```

Developing with the Example App

The LLM example app can be a quick way to test out changes to the LLM plugin.

To use the example app in conjunction with the LLM plugin:

Clone the llm example app
Update the following fields in docker-compose.yaml in the llm example app

comment out # GF_INSTALL_PLUGINS: grafana-llm-app
Add the following volume:

<some-parent-path>/grafana-llm-app/dist:/var/lib/grafana/plugins/grafana-llm-app

Follow the instructions in the llm example app to run the app

Release process

Plugin Release

Bump version in package.json (e.g., 0.2.0 to 0.2.1)
Add notes to changelog describing changes since last release
Merge PR for a branch containing those changes into main
Go to drone here and identify the build corresponding to the merge into main
Promote to target 'publish'

llmclient Release

Push a new tag to the repo (e.g., git tag -a llmclient/v0.X.X -m "llmclient v0.X.X release")

Directories ¶

Path	Synopsis
llmclient module
pkg
plugin
plugin/vector package vector provides a service for searching vector embeddings.	package vector provides a service for searching vector embeddings.
plugin/vector/embed
plugin/vector/store

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL