Fable Pricing Cost Breakdown: Inputs, Outputs, and Cache Hits

Fable pricing should be modeled from the shape of your workload. A long-context coding session has a different cost profile than a short analysis prompt.

Inputs

Input tokens are usually the largest controllable cost in code and research workflows. Reduce repeated context with file selection, summaries, retrieval, and prompt caching.

Outputs

Output tokens cost more than input tokens, so ask for concise patches, diffs, or structured answers when full prose is not needed.

Cache and batch

Use prompt caching when the same large context appears across turns. Use batch jobs when the work is offline and does not require interactive latency.

Bottom line

For Claude Fable 5, the right decision is practical: use it where stronger reasoning, longer context, and better coding judgment reduce total work. Measure the result against your own tasks before changing defaults.

Open the Fable pricing page