Providers

The SDK ships with zero hard dependencies on any specific vendor. You install & import only what you need.

Supported Providers

OpenAI

GPT models, embeddings, and function calling with full streaming support.

Anthropic

Claude models with OpenAI-compatible API interface.

Quick Start

from ai_sdk import openai, anthropic

# OpenAI
openai_model = openai("gpt-4.1-mini")
res = generate_text(model=openai_model, prompt="Hello!")

# Anthropic
anthropic_model = anthropic("claude-3-haiku-20240307")
res = generate_text(model=anthropic_model, prompt="Hello!")

Provider Features

Feature	OpenAI	Anthropic
Text Generation	✅ Full support	✅ Full support
Streaming	✅ Real-time	✅ Real-time
Tool Calling	✅ Native	✅ Compatible
Structured Output	✅ Native	✅ Compatible
Embeddings	✅ Full support	❌ Not available
Function Calling	✅ Native	✅ Compatible

Authentication

Environment Variables

Set your API keys as environment variables:

# OpenAI
export OPENAI_API_KEY="sk-..."

# Anthropic
export ANTHROPIC_API_KEY="sk-ant-..."

Explicit API Keys

Pass API keys directly to the provider functions:

from ai_sdk import openai, anthropic

# OpenAI with explicit key
model = openai("gpt-4.1-mini", api_key="sk-...")

# Anthropic with explicit key
model = anthropic("claude-3-haiku-20240307", api_key="sk-ant-...")

Common Parameters

All providers support these common parameters:

Parameter	Type	Default	Description
`model`	`str`	-	Model identifier (required)
`api_key`	`str`	`None`	API key (uses env var if not provided)
`temperature`	`float`	`1.0`	Controls randomness (0.0 = deterministic)
`max_tokens`	`int`	`None`	Maximum tokens to generate
`top_p`	`float`	`1.0`	Nucleus sampling parameter
`frequency_penalty`	`float`	`0.0`	Reduces repetition
`presence_penalty`	`float`	`0.0`	Encourages new topics

Provider-Specific Features

OpenAI

Native structured output with response_format="json_object"
Function calling with full tool support
Embedding models with automatic batching
Vision models with multimodal support

Anthropic

OpenAI-compatible API using compatibility layer
Function calling via OpenAI SDK
Streaming support with real-time deltas
System prompts and message-based conversations

Bring Your Own Provider

Implement the LanguageModel or EmbeddingModel interface – only a handful of methods are required:

from ai_sdk.providers.language_model import LanguageModel

class MyLLM(LanguageModel):
    def __init__(self, endpoint: str):
        self.endpoint = endpoint

    def generate_text(self, *, prompt=None, system=None, messages=None, **kw):
        # HTTP POST → return {"text": str, "finish_reason": "stop"}
        ...

    def stream_text(self, *, prompt=None, system=None, messages=None, **kw):
        # async generator yielding str deltas
        ...

Pass an instance to generate_text and you instantly get the full SDK experience (tool-calling, streaming, …) without writing another line of glue code.

We maintain zero coupling between helpers and providers, so community packages can live in separate repos.

Error Handling

All providers handle common errors gracefully:

from ai_sdk import openai, generate_text

try:
    model = openai("gpt-4.1-mini")
    res = generate_text(model=model, prompt="Hello!")
    print(res.text)
except Exception as e:
    print(f"Error: {e}")

Common error scenarios:

Invalid API key - Check your credentials
Rate limiting - Implement exponential backoff
Model not found - Verify model name
Token limit exceeded - Reduce input length

Best Practices

1. Use Environment Variables

Keep API keys secure:

# Good
model = openai("gpt-4.1-mini")  # Uses OPENAI_API_KEY env var

# Avoid
model = openai("gpt-4.1-mini", api_key="sk-...")  # Hardcoded key

2. Set Default Parameters

Configure once, use everywhere:

model = openai(
    "gpt-4.1-mini",
    temperature=0.7,
    max_tokens=1000,
    user="my-app/123"  # For analytics
)

3. Handle Rate Limits

Implement retry logic for production:

import time
from ai_sdk import openai, generate_text

def generate_with_retry(model, prompt, max_retries=3):
    for attempt in range(max_retries):
        try:
            return generate_text(model=model, prompt=prompt)
        except Exception as e:
            if "rate_limit" in str(e).lower() and attempt < max_retries - 1:
                time.sleep(2 ** attempt)  # Exponential backoff
                continue
            raise

4. Monitor Usage

Track token usage for cost management:

from ai_sdk import openai, generate_text

model = openai("gpt-4.1-mini")
res = generate_text(model=model, prompt="Hello!")

if res.usage:
    print(f"Tokens used: {res.usage.total_tokens}")
    print(f"Cost: ${res.usage.total_tokens * 0.00001:.4f}")  # Approximate

Text Generation

Object Generation

Embeddings

Tools

Types

Providers

Supported Providers

OpenAI

Anthropic

Quick Start

Provider Features

Authentication

Environment Variables

Explicit API Keys

Common Parameters

Provider-Specific Features

OpenAI

Anthropic

Bring Your Own Provider

Error Handling

Best Practices

1. Use Environment Variables

2. Set Default Parameters

3. Handle Rate Limits

4. Monitor Usage

Text Generation

Object Generation

Embeddings

Tools

Providers

Types

​Supported Providers

OpenAI

Anthropic

​Quick Start

​Provider Features

​Authentication

​Environment Variables

​Explicit API Keys

​Common Parameters

​Provider-Specific Features

​OpenAI

​Anthropic

​Bring Your Own Provider

​Error Handling

​Best Practices

​1. Use Environment Variables

​2. Set Default Parameters

​3. Handle Rate Limits

​4. Monitor Usage

Supported Providers

Quick Start

Provider Features

Authentication

Environment Variables

Explicit API Keys

Common Parameters

Provider-Specific Features

OpenAI

Anthropic

Bring Your Own Provider

Error Handling

Best Practices

1. Use Environment Variables

2. Set Default Parameters

3. Handle Rate Limits

4. Monitor Usage