Pricing variables
For searchers and implementers, pricing variables should be treated as a measurable part of the Claude Fable 5 API pricing decision. The published Fable 5 API rate is $10 per million input tokens and $50 per million output tokens; batch processing and prompt caching can materially change the effective bill. Write down the assumption, source, owner, and acceptance test before using it in production.
- What to verify: source, current status, and owner.
- What to measure: quality, latency, cost, retries, and review time.
- What to document: rollback path, fallback model, and user-facing behavior.
Input vs output costs
For searchers and implementers, input vs output costs should be treated as a measurable part of the Claude Fable 5 API pricing decision. The published Fable 5 API rate is $10 per million input tokens and $50 per million output tokens; batch processing and prompt caching can materially change the effective bill. Write down the assumption, source, owner, and acceptance test before using it in production.
- What to verify: source, current status, and owner.
- What to measure: quality, latency, cost, retries, and review time.
- What to document: rollback path, fallback model, and user-facing behavior.
| Fact to verify | Why it matters |
|---|---|
claude-fable-5 | Use the current model ID in configuration and tests. |
| 1M context / 128K output | Large capacity does not remove the need for context discipline. |
| $10 input / $50 output per MTok | Output length and retries drive real cost. |
| Prompt cache and batch options | Reusable context and offline work can reduce effective cost. |
| Refusal and fallback behavior | Safety paths must be visible in logs, UI, and support workflows. |
Caching impact
For searchers and implementers, caching impact should be treated as a measurable part of the Claude Fable 5 API pricing decision. The published Fable 5 API rate is $10 per million input tokens and $50 per million output tokens; batch processing and prompt caching can materially change the effective bill. Write down the assumption, source, owner, and acceptance test before using it in production.
- What to verify: source, current status, and owner.
- What to measure: quality, latency, cost, retries, and review time.
- What to document: rollback path, fallback model, and user-facing behavior.
Budget worksheet
For searchers and implementers, budget worksheet should be treated as a measurable part of the Claude Fable 5 API pricing decision. The published Fable 5 API rate is $10 per million input tokens and $50 per million output tokens; batch processing and prompt caching can materially change the effective bill. Write down the assumption, source, owner, and acceptance test before using it in production.
- What to verify: source, current status, and owner.
- What to measure: quality, latency, cost, retries, and review time.
- What to document: rollback path, fallback model, and user-facing behavior.
Operational checklist
- Confirm the current official docs for Claude Fable 5 API pricing before launch.
- Record the model ID, provider, region, and pinned version in configuration.
- Run at least five production-like test tasks before changing defaults.
- Log input tokens, output tokens, stop_reason, retries, latency, and final outcome.
- Keep a cheaper fallback route for routine work and a manual review path for refusals.
- Review cost after the first 50 to 100 real requests, not after a single demo.
Concrete next steps
- Estimate input and output tokens separately.
- Model cache writes, cache hits, batch jobs, retries, and fallback requests.
- Set per-workflow budgets before enabling long agent runs.
- Review spend by task outcome, not by prompt count.
Sources used
- platform.claude.com - referenced for current model, API, pricing, workflow, or integration details.
- platform.claude.com - referenced for current model, API, pricing, workflow, or integration details.