Examples¶

Runnable programs live under example/llm. Each resolves a model from the built-in catalog and reads its API key from the environment, so every run command is prefixed with the key it needs. All but model_switch use a single DeepSeek key.

The code for each example is collapsed below — click a block to expand it. A good reading order is top to bottom: request basics, then streaming and reasoning, then tools, conversations, and the lower-level controls.

basic¶

The smallest possible program, end to end. GetModel resolves an entry from the built-in catalog (a provider plus a model id); Prompt wraps a single string into a one-message Context; and Complete runs the whole request, blocking until the model finishes, and returns one AssistantMessage. With an empty StreamOptions{}, the API key is read from the provider's environment variable. Start here to confirm your key and network path work before adding anything else.

DEEPSEEK_API_KEY=… go run ./example/llm/basic

example/llm/basic/main.go

// Command basic sends a single prompt to a model and prints the reply.
//
// It is the smallest possible use of the llm package: resolve a model from the
// built-in catalog, build a one-message Context with Prompt, and let Complete
// run the request and return the final assistant message.
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/basic
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

func main() {
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    msg, err := llm.Complete(context.Background(), model,
        llm.Prompt("Explain goroutines in one sentence."),
        llm.StreamOptions{})
    if err != nil {
        log.Fatal(err)
    }

    fmt.Println(msg.Text())
}

options¶

One step past basic. PromptWithSystem prepends a system message that sets tone and role; Temperature and MaxTokens bound how the model generates; and the returned message carries more than text — Usage (input and output tokens), Usage.Cost.Total (priced from the catalog), and StopReason (why generation ended). In production, read these on every response, not just when debugging.

DEEPSEEK_API_KEY=… go run ./example/llm/options

example/llm/options/main.go

// Command options sends a prompt with a system message and per-request options.
//
// It shows the next step after the basic example: shape the model's behavior
// with a system prompt, control generation with StreamOptions, and inspect
// response metadata such as token usage, cost, and stop reason.
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/options
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

func ptr[T any](v T) *T {
    return &v
}

func main() {
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    input := llm.PromptWithSystem(
        "You are a concise Go expert.",
        "How should I choose between channels and mutexes?",
    )

    msg, err := llm.Complete(context.Background(), model, input, llm.StreamOptions{
        Temperature: ptr(0.2),
        MaxTokens:   500,
    })
    if err != nil {
        log.Fatal(err)
    }

    fmt.Println(msg.Text())
    fmt.Printf(
        "\ntokens in=%d out=%d cost=$%.5f stop=%s\n",
        msg.Usage.Input,
        msg.Usage.Output,
        msg.Usage.Cost.Total,
        msg.StopReason,
    )
}

streaming¶

The same request as basic, consumed incrementally. Stream returns a channel of Event; each EventTextDelta is a chunk of text to print as it arrives, and the stream ends with exactly one terminal EventDone (whose Message is the assembled final message) or EventError. This is the shape behind every streaming UI — Complete is literally this loop collapsed to return only the final message. Text, reasoning, and tool-call blocks each also emit start and end events when you need finer structure.

DEEPSEEK_API_KEY=… go run ./example/llm/streaming

example/llm/streaming/main.go

// Command streaming consumes a response as a live event stream instead of
// waiting for the final message.
//
// It shows the difference from the basic example: Stream returns a channel of
// Event values that report incremental progress. Here it prints each text delta
// as it arrives, then stops on the single terminal EventDone (or EventError).
// Complete is just this loop wrapped up when you only want the final message.
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/streaming
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

func main() {
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    events, err := llm.Stream(context.Background(), model,
        llm.Prompt("Write a short haiku about Go."),
        llm.StreamOptions{})
    if err != nil {
        log.Fatal(err)
    }

    for event := range events {
        switch event.Type {
        case llm.EventTextDelta:
            fmt.Print(event.Delta) // incremental text, printed as it streams
        case llm.EventDone:
            fmt.Println() // terminal event: event.Message holds the final message
        case llm.EventError:
            log.Fatal(event.Err)
        }
    }
}

reasoning¶

Turns on thinking with the provider-neutral Reasoning level (off through xhigh). The model's thinking streams as EventThinking* events, kept separate from the EventText* answer, so you can show a "thinking…" panel, log it, or drop it entirely. Each adapter maps the level to that provider's native reasoning form and clamps it to what the model supports; a model without reasoning simply ignores the setting, so the same code stays safe across models.

DEEPSEEK_API_KEY=… go run ./example/llm/reasoning

example/llm/reasoning/main.go

// Command reasoning asks a reasoning-capable model to think before answering and
// streams the reasoning and the final answer as separate phases.
//
// It shows the provider-neutral Reasoning knob: a ModelThinkingLevel that each
// adapter maps to its own native form and clamps to what the model supports.
// Thinking arrives as EventThinking* events, distinct from the EventText* answer,
// so a caller can display or hide it. Non-reasoning models ignore the setting.
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/reasoning
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

func main() {
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    events, err := llm.Stream(context.Background(), model,
        llm.Prompt("A farmer must cross a river with a wolf, a goat, and a cabbage. "+
            "The boat carries only one item at a time. How does he get all three across?"),
        llm.StreamOptions{
            Reasoning: llm.ModelThinkingHigh, // off, minimal, low, medium, high, xhigh
        })
    if err != nil {
        log.Fatal(err)
    }

    for event := range events {
        switch event.Type {
        case llm.EventThinkingStart:
            fmt.Println("--- thinking ---")
        case llm.EventThinkingDelta:
            fmt.Print(event.Delta)
        case llm.EventTextStart:
            fmt.Println("\n--- answer ---")
        case llm.EventTextDelta:
            fmt.Print(event.Delta)
        case llm.EventDone:
            fmt.Println()
        case llm.EventError:
            log.Fatal(event.Err)
        }
    }
}

tools¶

The example to study — a hand-written tool loop that lays bare what or/agent automates for you. The model can call a typed tool, read the result, and keep going until it produces a final text answer. Each turn:

MustTool[T] derives a JSON Schema from a Go struct once, up front.
Stream surfaces the model's thinking and answer live.
On EventDone, append the assistant message first, then inspect ToolCalls().
DecodeToolCall[T] validates each call; on failure, feed the error back as a ToolResult so the model can self-correct its arguments.
Otherwise run the tool and append its ToolResult, then loop.
No tool calls means the streamed answer was final — stop.

Wrap this loop with run state, steering, and persistence and you have an agent, which is exactly why the loop lives in the library's foundations rather than hidden behind them.

DEEPSEEK_API_KEY=… go run ./example/llm/tools

example/llm/tools/main.go

// Command tools runs a streaming tool loop with reasoning: the model thinks,
// optionally calls a typed tool, sees the result, and continues until it gives a
// final answer.
//
// It combines the pieces the agent package automates into one loop:
//   - MustTool derives a JSON Schema from a Go struct.
//   - Stream with Reasoning surfaces the model's thinking (EventThinking*) and
//     answer (EventText*) live, turn by turn.
//   - EventDone carries the final assistant message; DecodeToolCall validates and
//     decodes any tool calls, and ToolResult feeds each outcome back.
//
// Wrap this loop with state, steering, and persistence and you have an agent.
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/tools
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

// WeatherArgs is the tool's argument schema. A json field without omitempty is
// required; jsonschema tags add descriptions and constraints to the schema.
type WeatherArgs struct {
    City string `json:"city" jsonschema:"description=City name,minLength=1"`
    Days int    `json:"days" jsonschema:"minimum=1,maximum=10"`
}

func main() {
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    weather := llm.MustTool[WeatherArgs]("get_weather", "Get a weather forecast for a city")

    input := llm.NewContext(llm.UserText("What should I pack for a trip to Beijing over the next 3 days?"))
    input.Tools = []llm.ToolDefinition{weather}

    for turn := 1; ; turn++ {
        fmt.Printf("\n===== turn %d =====\n", turn)

        events, err := llm.Stream(context.Background(), model, input, llm.StreamOptions{
            Reasoning: llm.ModelThinkingHigh,
        })
        if err != nil {
            log.Fatal(err)
        }

        var final llm.AssistantMessage
        for event := range events {
            switch event.Type {
            case llm.EventThinkingStart:
                fmt.Print("[thinking] ")
            case llm.EventThinkingDelta:
                fmt.Print(event.Delta)
            case llm.EventTextStart:
                fmt.Print("\n[answer] ")
            case llm.EventTextDelta:
                fmt.Print(event.Delta)
            case llm.EventDone:
                fmt.Println()
                final = *event.Message // the assembled assistant turn
            case llm.EventError:
                log.Fatal(event.Err)
            }
        }
        input.Messages = append(input.Messages, &final) // record the assistant turn

        calls := final.ToolCalls()
        if len(calls) == 0 {
            return // no tool calls: the streamed answer above was final
        }

        for _, call := range calls {
            args, err := llm.DecodeToolCall[WeatherArgs](weather, call)
            if err != nil {
                // Feed the error back so the model can correct its arguments.
                input.Messages = append(input.Messages,
                    llm.ToolResult(call.ID, call.Name, "invalid arguments: "+err.Error()))
                continue
            }

            // A real tool would do work here; this one returns a canned result.
            result := fmt.Sprintf("%s: sunny, around 25C for the next %d days", args.City, args.Days)
            fmt.Printf("[tool] get_weather(city=%q, days=%d) -> %s\n", args.City, args.Days, result)
            input.Messages = append(input.Messages, llm.ToolResult(call.ID, call.Name, result))
        }
    }
}

conversation¶

A multi-turn exchange, and the clearest demonstration of the library being stateless. The history is a []llm.Message you own; after each reply you append the assistant turn (as a pointer, so it keeps the type needed to replay it) and the next user message, then resend the whole slice. The library stores nothing server-side — the follow-up ("that pattern") only resolves because the earlier turns travel with the request. A SystemPrompt set on the Context applies every turn without being part of the history.

DEEPSEEK_API_KEY=… go run ./example/llm/conversation

example/llm/conversation/main.go

// Command conversation carries history across multiple turns.
//
// It shows the step beyond a one-shot Complete: keep the messages in a slice,
// append each reply and follow-up, and send the growing history back every
// turn. The library is stateless, so retaining and resending the history is how
// the model "remembers" earlier turns.
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/conversation
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

func main() {
    ctx := context.Background()
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    // The conversation is just a slice of messages the caller owns.
    history := []llm.Message{
        llm.UserText("Name one classic Go concurrency pattern."),
    }

    // Turn 1: ask the first question.
    first := ask(ctx, model, history)
    fmt.Println("A1:", first.Text())

    // Append the reply, then a follow-up that relies on it ("that pattern").
    // Resending the whole history is what lets the model resolve the reference.
    history = append(history, &first)
    history = append(history, llm.UserText("Show a minimal code sketch of that pattern."))

    // Turn 2: the model answers with the earlier turn in context.
    second := ask(ctx, model, history)
    fmt.Println("\nA2:", second.Text())
}

// ask sends the current history with a shared system prompt and returns the
// final assistant message.
func ask(ctx context.Context, model llm.Model, history []llm.Message) llm.AssistantMessage {
    input := llm.Context{
        SystemPrompt: "You are a concise Go tutor. Keep answers short.",
        Messages:     history,
    }

    msg, err := llm.Complete(ctx, model, input, llm.StreamOptions{MaxTokens: 500})
    if err != nil {
        log.Fatal(err)
    }
    return msg
}

model_switch¶

One conversation carried across two different wire protocols — the library's core value in a single file. Turn 1 goes to DeepSeek (OpenAI-compatible Chat Completions); turn 2 sends the same, unchanged history to MiniMax CN (Anthropic-compatible Messages). Because two protocols are in play, both provider packages must be registered (blank imports) and each needs its own key. Before each request llm re-adapts the stored history for the target protocol — downgrading images, reconciling tool-call IDs, handling reasoning signatures — so you never rebuild the conversation by hand.

DEEPSEEK_API_KEY=… MINIMAX_CN_API_KEY=… go run ./example/llm/model_switch

example/llm/model_switch/main.go

// Command model_switch continues one conversation across two protocols.
//
// Turn 1 goes to DeepSeek, which speaks OpenAI-compatible Chat Completions.
// Turn 2 sends the same history — unchanged — to MiniMax on its China endpoint,
// which speaks Anthropic-compatible Messages. The caller does not rebuild the
// conversation: llm re-adapts the stored history for the target protocol on
// each request (downgrading images, reconciling tool-call IDs, and so on).
//
// Because the two turns use different protocols, both provider packages must be
// registered. Each needs its own key:
//
//  DEEPSEEK_API_KEY=sk-...   (DeepSeek, OpenAI-compatible)
//  MINIMAX_CN_API_KEY=...    (MiniMax CN, Anthropic-compatible)
//
//  DEEPSEEK_API_KEY=... MINIMAX_CN_API_KEY=... go run ./example/llm/model_switch
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/anthropic" // MiniMax CN speaks Anthropic-compatible Messages
    _ "github.com/ktsoator/or/llm/openai"    // DeepSeek speaks OpenAI-compatible Chat Completions
)

func main() {
    ctx := context.Background()

    deepseek := llm.GetModel("deepseek", "deepseek-v4-flash")
    minimax := llm.GetModel("minimax-cn", "MiniMax-M2.7")

    history := []llm.Message{
        llm.UserText("Suggest a name for a Go library that unifies LLM providers."),
    }

    // Turn 1 — DeepSeek (OpenAI-compatible).
    first := complete(ctx, deepseek, history)
    fmt.Printf("[%s] %s\n", deepseek.Provider, first.Text())

    // Carry the reply forward and ask a follow-up.
    history = append(history, &first)
    history = append(history, llm.UserText("Now critique that name in one sentence."))

    // Turn 2 — MiniMax CN (Anthropic-compatible). Same history slice, different
    // protocol; no manual conversion needed.
    second := complete(ctx, minimax, history)
    fmt.Printf("[%s] %s\n", minimax.Provider, second.Text())
}

func complete(ctx context.Context, model llm.Model, history []llm.Message) llm.AssistantMessage {
    msg, err := llm.Complete(ctx, model, llm.NewContext(history...), llm.StreamOptions{MaxTokens: 500})
    if err != nil {
        log.Fatal(err)
    }
    return msg
}

advanced¶

Two lower-level controls layered onto an otherwise normal request. OnRequest hands you the exact serialized request body just before it goes out (once per attempt, retries included) — useful for debugging, logging, or asserting on the wire format in tests. A protocol-specific ToolChoice, carried on ProtocolOptions, forces the model to call a tool this turn; it is validated against the target protocol before sending, so a mismatched option fails fast instead of reaching the provider. For an endpoint not in the catalog, build an llm.Model by hand and point its BaseURL at it.

DEEPSEEK_API_KEY=… go run ./example/llm/advanced

example/llm/advanced/main.go

// Command advanced shows two lower-level controls layered on a normal request:
//
//   - Observe the exact serialized HTTP request with the OnRequest hook. It fires
//     once per attempt, including retries.
//   - Force the model to call a tool with a protocol-specific ToolChoice, carried
//     on ProtocolOptions and validated against the target protocol before sending.
//
// The model comes from the built-in catalog. To reach an OpenAI-compatible
// endpoint that is not in the catalog, construct an llm.Model by hand instead and
// point its BaseURL at that endpoint (Provider still drives the API key lookup).
//
// The API key is read from the provider's environment variable when
// StreamOptions.APIKey is empty:
//
//  DEEPSEEK_API_KEY=sk-... go run ./example/llm/advanced
package main

import (
    "context"
    "fmt"
    "log"

    "github.com/ktsoator/or/llm"
    _ "github.com/ktsoator/or/llm/openai" // register the OpenAI-compatible protocol (DeepSeek speaks it)
)

// WeatherArgs is the tool's argument schema.
type WeatherArgs struct {
    City string `json:"city" jsonschema:"description=City name,minLength=1"`
    Days int    `json:"days" jsonschema:"minimum=1,maximum=10"`
}

func main() {
    model := llm.GetModel("deepseek", "deepseek-v4-flash")

    weather := llm.MustTool[WeatherArgs]("get_weather", "Get a weather forecast for a city")

    input := llm.NewContext(llm.UserText("What's the weather in Beijing for the next 3 days?"))
    input.Tools = []llm.ToolDefinition{weather}

    msg, err := llm.Complete(context.Background(), model, input, llm.StreamOptions{
        // Observe the exact request body sent to the provider.
        OnRequest: func(method, url string, body []byte) {
            fmt.Printf(">> %s %s\n%s\n\n", method, url, body)
        },
        // Protocol-specific option: force the model to call a tool this turn.
        ProtocolOptions: &llm.OpenAICompletionsStreamOptions{
            ToolChoice: llm.OpenAIToolChoiceRequired,
        },
    })
    if err != nil {
        log.Fatal(err)
    }

    for _, call := range msg.ToolCalls() {
        args, err := llm.DecodeToolCall[WeatherArgs](weather, call)
        if err != nil {
            log.Fatal(err)
        }
        fmt.Printf("model chose: get_weather(city=%q, days=%d)\n", args.City, args.Days)
    }
}