|
|
|
|
|
by joshdev
74 days ago
|
|
We discovered a bug in AWS Bedrock that is double counting cache writes when thinking/reasoning is enabled for the Anthropic models. It’s not clear to me if this is limited to just AWS Bedrock or all providers. AWS Support is aware. We’ve also observed a much higher cache miss rate in the past few weeks. Combine both together and your usage consumption can be greatly increased. |
|