Supported Providers
Sorty supports multiple AI providers, from cloud-based services to on-device models:OpenAI
GPT-4, GPT-4 Turbo, GPT-3.5Supports vision mode for image analysis.
Anthropic
Claude 3.5 Sonnet, Claude 3 Opus/HaikuStrong reasoning, detailed explanations.
Apple Intelligence
On-device foundation modelsPrivate, no data leaves your Mac. Requires macOS 15.1+, M-series chip.
Ollama
Local LLMs (Llama, Mistral, etc.)Run models locally for complete privacy.
GitHub Copilot
GitHub Copilot LLM accessUse your existing Copilot subscription.
OpenRouter
Multi-provider gatewayAccess to 100+ models through one API.
All providers implement the
AIClientProtocol, making them interchangeable.Provider Factory
Sorty uses a factory pattern to instantiate the correct client:Configuration
OpenAI
Get API Key
Sign up at platform.openai.com and create an API key.
Configure in Sorty
Navigate to Settings → AI Provider and select OpenAI.
- API URL:
https://api.openai.com/v1 - API Key: Your OpenAI API key
- Model:
gpt-4-turbo-preview(recommended)
Recommended Models
Recommended Models
| Model | Speed | Quality | Vision | Cost |
|---|---|---|---|---|
| gpt-4-turbo-preview | Medium | Excellent | Yes | $$$ |
| gpt-4 | Slow | Excellent | Yes | $$$$ |
| gpt-3.5-turbo | Fast | Good | No | $ |
| gpt-4o | Fast | Excellent | Yes | $$$ |
Anthropic
Get API Key
Sign up at console.anthropic.com and create an API key.
Recommended Models
Recommended Models
| Model | Speed | Quality | Context Window |
|---|---|---|---|
| claude-3-5-sonnet-20241022 | Fast | Excellent | 200K tokens |
| claude-3-opus-20240229 | Medium | Best | 200K tokens |
| claude-3-haiku-20240307 | Very Fast | Good | 200K tokens |
Apple Intelligence
Enable Apple Intelligence
Go to System Settings → Apple Intelligence & Siri and enable Apple Intelligence.
Privacy Benefits
Privacy Benefits
- 100% On-Device: Nothing leaves your Mac
- No API Costs: Free to use
- No Network Required: Works offline
- No Rate Limits: Process unlimited files
- Smaller context window than cloud models
- May be slower than server-side GPUs
- No vision mode support (yet)
Ollama (Local Models)
Install Ollama
Download from ollama.ai and install.
Recommended Local Models
Recommended Local Models
| Model | Size | Quality | Speed | RAM Required |
|---|---|---|---|---|
| llama3.2 | 3B | Good | Fast | 8GB |
| mistral | 7B | Very Good | Medium | 16GB |
| qwen2.5 | 7B | Excellent | Medium | 16GB |
| llama3.1 | 8B | Excellent | Medium | 16GB |
| llama3.1:70b | 70B | Best | Slow | 64GB+ |
GitHub Copilot
Authenticate
Navigate to Settings → AI Provider and select GitHub Copilot.Click Authenticate to sign in with GitHub.
Requires an active GitHub Copilot subscription ($10/month for individuals).
OpenRouter
Get API Key
Sign up at openrouter.ai and create an API key.
Configure in Sorty
Navigate to Settings → AI Provider and select OpenRouter.
- API URL:
https://openrouter.ai/api/v1 - API Key: Your OpenRouter key
- Model: See available models
Popular Models on OpenRouter
Popular Models on OpenRouter
anthropic/claude-3.5-sonnetopenai/gpt-4-turbometa-llama/llama-3.1-70b-instructgoogle/gemini-pro-1.5mistralai/mistral-large
Streaming Support
All providers support streaming responses for live progress updates:How Streaming Works
How Streaming Works
- Client sends request with streaming enabled
- Server responds in chunks via Server-Sent Events (SSE)
- Delegate receives chunks and updates UI in real-time
- Progress insights are extracted from
>>prefixed lines - JSON output is parsed when complete
Error Handling
Sorty provides detailed, user-friendly error messages:- 403 Forbidden
- 404 Not Found
- 429 Rate Limited
- 500 Server Error
API keys are automatically redacted from error messages for security.
Vision Support
Compatible Providers
| Provider | Vision Support | Models |
|---|---|---|
| OpenAI | ✅ Yes | gpt-4-vision-preview, gpt-4-turbo, gpt-4o |
| Anthropic | ✅ Yes | claude-3-opus, claude-3-sonnet, claude-3.5-sonnet |
| Ollama | ⚠️ Limited | llava, bakllava (experimental) |
| Apple Intelligence | ❌ Not yet | - |
| GitHub Copilot | ❌ No | - |
How Vision Analysis Works
Deep Scan Support
Provider Compatibility
Why Some Providers Don't Support Deep Scan
Why Some Providers Don't Support Deep Scan
- Ollama: Local models have smaller context windows
- Apple Intelligence: API limitations
- GitHub Copilot: Designed for code completion, not document analysis
Timeout & Retry
Configure request timeouts in Settings → Advanced:API Key Security
API keys are stored securely in macOS Keychain:Related Features
File Organization
Learn how the AI organizes your files
Personas
Customize AI behavior for different workflows
The Learnings
Train the AI on your preferences