LLM Model Configuration
LLM (Large Language Model) settings control the AI model used for RAG chat responses in the Playground.
Available Providers
| Provider | Models | Notes |
|---|---|---|
| OpenAI | GPT-4o, GPT-4o Mini | Most popular, excellent quality |
| Anthropic | Claude Sonnet 4, Claude 3.5 Sonnet, Claude 3.5 Haiku | Long context, nuanced responses |
Configuration Options
- Provider: Select OpenAI or Anthropic
- API Key: Enter your provider's API key
- Model: Choose the specific LLM model
- Temperature: Adjust creativity (0.0 = focused, 2.0 = creative)
- Max Tokens: Limit response length
API Key Management
Each provider requires its own API key:
- OpenAI: Get your key from platform.openai.com
- Anthropic: Get your key from console.anthropic.com
The settings page shows a status banner indicating whether your API key is configured:
- Green checkmark: API key is configured and ready
- Yellow warning: API key required - enter your key to enable chat
Choosing an LLM Model
| Model | Context | Best For | Quality |
|---|---|---|---|
| GPT-4o | 128K tokens | Complex reasoning, detailed responses | Highest (OpenAI) |
| GPT-4o Mini | 128K tokens | Fast, cost-effective responses | High |
| Claude Sonnet 4 | 200K tokens | Long documents, nuanced analysis | Highest (Anthropic) |
| Claude 3.5 Haiku | 200K tokens | Fast responses, simple queries | Good |
Temperature Guide
| Temperature | Behavior | Best For |
|---|---|---|
| 0.0 - 0.3 | Focused, deterministic | Factual Q&A, technical docs |
| 0.4 - 0.7 | Balanced | General use (default: 0.7) |
| 0.8 - 1.2 | Creative, varied | Content generation |
| 1.3 - 2.0 | Highly creative | Brainstorming, exploration |