golem-llm

WebAssembly Components providing a unified API for various LLM providers.

Versions

There are 8 published WASM files for each release:

Name	Description
`golem-llm-anthropic.wasm`	LLM implementation for Anthropic AI, using custom Golem specific durability features
`golem-llm-ollama.wasm`	LLM implementation for Ollama, using custom Golem specific durability features
`golem-llm-grok.wasm`	LLM implementation for xAI (Grok), using custom Golem specific durability features
`golem-llm-openai.wasm`	LLM implementation for OpenAI, using custom Golem specific durability features
`golem-llm-openrouter.wasm`	LLM implementation for OpenRouter, using custom Golem specific durability features
`golem-llm-bedrock.wasm`	LLM implementation for Amazon Bedrock, using custom Golem specific durability features
`golem-llm-anthropic-portable.wasm`	LLM implementation for Anthropic AI, with no Golem specific dependencies.
`golem-llm-ollama-portable.wasm`	LLM implementation for Ollama, with no Golem specific dependencies.
`golem-llm-grok-portable.wasm`	LLM implementation for xAI (Grok), with no Golem specific dependencies.
`golem-llm-openai-portable.wasm`	LLM implementation for OpenAI, with no Golem specific dependencies.
`golem-llm-openrouter-portable.wasm`	LLM implementation for OpenRouter, with no Golem specific dependencies.
`golem-llm-bedrock-portable.wasm`	LLM implementation for Amazon Bedrock, with no Golem specific dependencies.

Every component exports the same golem:llm interface, defined here.

The -portable versions only depend on wasi:io, wasi:http and wasi:logging.

The default versions also depend on Golem's host API to implement advanced durability related features.

Usage

Each provider has to be configured with an API key passed as an environment variable:

Provider	Environment Variable
Anthropic	`ANTHROPIC_API_KEY`
Grok	`XAI_API_KEY`
OpenAI	`OPENAI_API_KEY`
OpenRouter	`OPENROUTER_API_KEY`
Ollama	`GOLEM_OLLAMA_BASE_URL`
Amazon Bedrock	`AWS_ACCESS_KEY_ID`, `AWS_REGION`, `AWS_SECRET_ACCESS_KEY` and `AWS_SESSION_TOKEN` (optional)

Additionally, setting the GOLEM_LLM_LOG=trace environment variable enables trace logging for all the communication with the underlying LLM provider.

Using with Golem

Using a template

The easiest way to get started is to use one of the predefined templates Golem provides.

NOT AVAILABLE YET

Using a component dependency

To existing Golem applications the golem-llm WASM components can be added as a binary dependency.

NOT AVAILABLE YET

Integrating the composing step to the build

Currently it is necessary to manually add the wac tool call to the application manifest to link with the selected LLM implementation. The test directory of this repository shows an example of this.

The summary of the steps to be done, assuming the component was created with golem-cli component new rust my:example:

Copy the profiles section from common-rust/golem.yaml to the component's golem.yaml file (for example in components-rust/my-example/golem.yaml) so it can be customized.
Add a second build step after the cargo component build which is calling wac to compose with the selected ( and downloaded) golem-llm binary. See the example below.
Modify the componentWasm field to point to the composed WASM file.
Add the golem-llm.wit file (from this repository) to the application's root wit/deps/golem:llm directory.
Import golem-llm.wit in your component's WIT file: `import golem:llm/[email protected];'

Example app manifest build section:

components:
  my:example:
    profiles:
      debug:
        build:
          - command: cargo component build
            sources:
              - src
              - wit-generated
              - ../../common-rust
            targets:
              - ../../target/wasm32-wasip1/debug/my_example.wasm
          - command: wac plug --plug ../../golem_llm_openai.wasm ../../target/wasm32-wasip1/debug/my_example.wasm -o ../../target/wasm32-wasip1/debug/my_example_plugged.wasm
            sources:
              - ../../target/wasm32-wasip1/debug/my_example.wasm
              - ../../golem_llm_openai.wasm
            targets:
              - ../../target/wasm32-wasip1/debug/my_example_plugged.wasm
        sourceWit: wit
        generatedWit: wit-generated
        componentWasm: ../../target/wasm32-wasip1/debug/my_example_plugged.wasm
        linkedWasm: ../../golem-temp/components/my_example_debug.wasm
        clean:
          - src/bindings.rs

Using without Golem

To use the LLM provider components in a WebAssembly project independent of Golem you need to do the following:

Download one of the -portable.wasm versions
Download the golem-llm.wit WIT package and import it
Use wac to compose your component with the selected LLM implementation.

Examples

Take the test application as an example of using golem-llm from Rust. The implemented test functions are demonstrating the following:

Function Name	Description
`test1`	Simple text question and answer, no streaming
`test2`	Demonstrates using tools without streaming
`test3`	Simple text question and answer with streaming
`test4`	Tool usage with streaming
`test5`	Using an image in the prompt
`test6`	Demonstrates that the streaming response is continued in case of a crash (with Golem only)
`test7`	Using a source image by passing byte array as base64 in the prompt

Running the examples

To run the examples first you need a running Golem instance. This can be Golem Cloud or the single-executable golem binary started with golem server run.

NOTE: golem-llm requires the latest (unstable) version of Golem currently. It's going to work with the next public stable release 1.2.2.

Then build and deploy the test application. Select one of the following profiles to choose which provider to use:

Profile Name	Description
`anthropic-debug`	Uses the Anthropic LLM implementation and compiles the code in debug profile
`anthropic-release`	Uses the Anthropic LLM implementation and compiles the code in release profile
`ollama-debug`	Uses the Ollama LLM implementation and compiles the code in debug profile
`ollama-release`	Uses the Ollama LLM implementation and compiles the code in release profile
`grok-debug`	Uses the Grok LLM implementation and compiles the code in debug profile
`grok-release`	Uses the Grok LLM implementation and compiles the code in release profile
`openai-debug`	Uses the OpenAI LLM implementation and compiles the code in debug profile
`openai-release`	Uses the OpenAI LLM implementation and compiles the code in release profile
`openrouter-debug`	Uses the OpenRouter LLM implementation and compiles the code in debug profile
`openrouter-release`	Uses the OpenRouter LLM implementation and compiles the code in release profile
`bedrock-debug`	Uses the Amazon Bedrock LLM implementation and compiles the code in debug profile
`bedrock-release`	Uses the Amazon Bedrock LLM implementation and compiles the code in release profile

cd test
golem app build -b openai-debug
golem app deploy -b openai-debug

Depending on the provider selected, an environment variable has to be set for the worker to be started, containing the API key for the given provider:

golem worker new test:llm/debug --env OPENAI_API_KEY=xxx --env GOLEM_LLM_LOG=trace

Then you can invoke the test functions on this worker:

golem worker invoke test:llm/debug test1 --stream

Development

This repository uses cargo-make to automate build tasks. Some of the important tasks are:

Command	Description
`cargo make build`	Build all components with Golem bindings in Debug
`cargo make release-build`	Build all components with Golem bindings in Release
`cargo make build-portable`	Build all components with no Golem bindings in Debug
`cargo make release-build-portable`	Build all components with no Golem bindings in Release
`cargo make unit-tests`	Run all unit tests
`cargo make check`	Checks formatting and Clippy rules
`cargo make fix`	Fixes formatting and Clippy rules
`cargo make wit`	To be used after editing the `wit/golem-llm.wit` file - distributes the changes to all wit directories

The test directory contains a Golem application for testing various features of the LLM components. Check the Golem documentation to learn how to install Golem and golem-cli to run these tests.

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
.github/workflows		.github/workflows
llm		llm
search		search
test		test
websearch		websearch
.gitignore		.gitignore
BOUNTY_GUIDE.md		BOUNTY_GUIDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile.toml		Makefile.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

golem-llm

Versions

Usage

Using with Golem

Using a template

Using a component dependency

Integrating the composing step to the build

Using without Golem

Examples

Running the examples

Development

About

Uh oh!

Releases

Packages

Languages

License

toti85/golem-ai

Folders and files

Latest commit

History

Repository files navigation

golem-llm

Versions

Usage

Using with Golem

Using a template

Using a component dependency

Integrating the composing step to the build

Using without Golem

Examples

Running the examples

Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages