Fable 5 API Quickstart
Model ID claude-fable-5, standard Messages API, official SDKs work as-is. Point base_url at any compatible endpoint — zero code changes.
Three-minute setup
Full docs: docs.omniakey.com — Anthropic protocol on the bare host, OpenAI protocol at /v1/chat/completions; the same key also covers GPT-5.5 and Gemini 3.1.
Python
from anthropic import Anthropic
client = Anthropic(
api_key="YOUR_API_KEY",
base_url="https://api.omniakey.com", # drop this line for the official endpoint
)
message = client.messages.create(
model="claude-fable-5",
max_tokens=2048,
messages=[{"role": "user", "content": "Introduce Fable 5 in one sentence"}],
)
print(message.content[0].text)
Node.js
import Anthropic from "@anthropic-ai/sdk";
const client = new Anthropic({
apiKey: "YOUR_API_KEY",
baseURL: "https://api.omniakey.com", // drop for the official endpoint
});
const msg = await client.messages.create({
model: "claude-fable-5",
max_tokens: 2048,
messages: [{ role: "user", content: "Hello, Fable 5" }],
});
console.log(msg.content[0].text);
cURL
curl https://api.omniakey.com/v1/messages \
-H "x-api-key: $YOUR_API_KEY" \
-H "anthropic-version: 2023-06-01" \
-H "content-type: application/json" \
-d '{
"model": "claude-fable-5",
"max_tokens": 1024,
"messages": [{"role": "user", "content": "Hello"}]
}'
How to get a Fable 5 API key
Two routes, by audience:
| Official Claude Console | OmniaKey (aggregator) | |
|---|---|---|
| Best for | Teams with international cards | Instant start, no card |
| Prerequisites | Visa/Mastercard | No card required to sign up |
| Top-up | International credit card | USDT or card (Stripe) |
| Price | $10 / $50 | $3 / $15 limited-time (~70% off) |
Official: sign up at Claude Console, add a card, generate a key on the API Keys page. OmniaKey: sign up → top up (balance never expires; unused balance refundable within 30 days) → generate a key, then point base_url at it in the three-minute setup above.
Pricing at a glance
| Item | Price (per million tokens) |
|---|---|
| Input / Output | $10 / $50 |
| Cache write (5 min / 1 hr) | $12.50 / $20 |
| Cache hit / refresh | $1 |
| Batch (input / output) | $5 / $25 |
| US-only inference | 1.1× |
That's exactly 2× Opus 4.8 ($5/$25). Run your numbers with the cost guide.
Fallback billing: the gotcha to know
⚠️ Your request may not be answered by Fable 5
Queries touching cybersecurity, biology, chemistry, or large-scale distillation are intercepted by safety classifiers and answered by Opus 4.8 instead. The response carries a notice, and that request is billed at Opus 4.8 rates. Average trigger rate is under 5%, tuned conservatively (false positives happen).
- Log the fallback flag — monitor your downgrade rate, especially for security or bio products.
- Budget at a blended rate for high-sensitivity domains.
- Handle quality fallbacks — flows that depend on the 1M context should detect downgrades and decide: retry or accept.
Background on the guardrails — and why the unguarded Claude Mythos 5 isn't public — on our sister site.
Engineering notes
- Always stream. Time-to-first-token can reach ~80 seconds; non-streaming calls will trip default client timeouts. Set HTTP timeouts to the 10-minute range.
- Lean on prompt caching. Cache hits cost $1/M — long system prompts, RAG context, and few-shot examples belong in cache. It's the biggest cost lever.
- Reserve it for hard work. For short, well-defined tasks, swap the
modelstring to Opus 4.8 or Sonnet 4.6 — same code. - Compliance: Fable 5 requests carry a mandatory 30-day retention window (safety monitoring; not used for training).
Instant Fable 5 endpoint
OmniaKey: Fable 5 at $3/$15 (limited-time, ~70% off) · USDT or card · no card required to sign up
FAQ
How do I get a Fable 5 API key?
Official: sign up at Claude Console, add an international card, and generate a key. Alternative: OmniaKey issues a key right after signup — no card required, USDT or card top-up, limited-time $3/$15. Both keys work with the code examples above.
What are Fable 5's context window and max output?
A 1M-token context window and 128K-token max output per request. Pair long inputs with prompt caching ($1/M on hits) to keep costs sane.
Does the Fable 5 API have rate limits?
The official API raises rate limits by usage tier, so new accounts start low; third-party endpoints follow their own rules. Validate limits and fallback rates at low volume before going to production, and implement exponential-backoff retries.
Can I use Fable 5 in Cursor or other IDEs?
Yes — any tool that supports a custom OpenAI-compatible or Anthropic-protocol endpoint can run claude-fable-5. Step-by-step setup for Cursor, Cline, and Continue: Fable 5 in Cursor & IDEs.