Skip to main content
Technical Deep DiveD8 Reliability

CDN and Caching for Agent Readiness: Why Response Time Matters for D8 Reliability

D8 Reliability carries a 0.13 weight in the Agent Readiness Score — the second-highest dimension. CDN-backed APIs respond in under 100ms. Origin-only APIs hit 500ms or more. AgentHermes detects CDN headers, HTTP/2 support, and response time. The fix takes 15 minutes and costs nothing.

AH
AgentHermes Research
April 15, 202613 min read

Why AI Agents Care About Response Time

Humans tolerate 2-3 second page loads because they are looking at a screen and waiting. AI agents do not wait patiently. Every millisecond an agent waits for your API response is a millisecond it burns on compute. Agents operate under time budgets and cost budgets. A slow API is an expensive API — not because of pricing, but because of wasted inference time.

More critically, agents learn. If your API consistently responds in 800ms while a competitor responds in 60ms, the agent will prefer the faster endpoint for future requests. This is not theoretical — it is how LLM-based agents optimize their tool selection. Slow responses train agents to avoid you.

AgentHermes measures response time as part of D8 Reliability. The 0.13 weight makes D8 the second most impactful dimension after D2 API Quality (0.15). Response time is not the only D8 signal — uptime, status pages, and error rates also contribute — but it is the most immediate and the easiest to fix.

0.13
D8 weight
<100ms
CDN-backed response
500ms+
origin-only response
$0
Cloudflare free tier

Response Time Tiers and D8 Impact

AgentHermes categorizes response times into tiers. Edge-cached responses get full D8 credit. Responses over 2 seconds get zero credit and risk agent abandonment.

<50ms

Edge-cached

Full D8 credit

Vercel, Cloudflare Workers, cached API responses

50-100ms

CDN-backed

Near-full D8 credit

Supabase, Stripe, AWS API Gateway + CloudFront

100-300ms

Fast origin

Partial D8 credit

Well-configured VPS, Render, Railway

300-500ms

Slow origin

Minimal D8 credit

Shared hosting, unoptimized WordPress

500ms-2s

Unreliable

Near-zero D8 credit

Cold-start serverless, overloaded origin

>2s

Timeout risk

Zero D8 credit

Agents may abandon the request entirely

CDN Headers AgentHermes Detects

AgentHermes scans response headers to detect CDN infrastructure. The presence of these headers confirms that responses are being served from edge locations, not a single origin server.

Header
Provider
What It Means
D8 Impact
CF-Cache-Status
Cloudflare
HIT, MISS, DYNAMIC, BYPASS. AgentHermes detects this to confirm edge caching is active.
+2-3 pts D8
X-Cache
AWS CloudFront / Fastly
Hit from cloudfront or HIT. Standard header across multiple CDN providers.
+2-3 pts D8
Age
Any CDN
Seconds since the response was cached. Presence confirms caching is working. Age: 0 means fresh from origin.
+1 pt D8
X-Vercel-Cache
Vercel
HIT, STALE, MISS. Vercel edge network cache status. Vercel scores 70 partly because of this.
+2-3 pts D8
Cache-Control
Origin server
max-age, s-maxage, stale-while-revalidate. Tells CDNs and agents how long to cache responses.
+1-2 pts D8
Via
Any proxy/CDN
Indicates the request passed through a proxy or CDN. 1.1 varnish or 1.1 cloudfront.
+1 pt D8

Key insight: You do not need all of these headers. Any one CDN header confirming edge caching is enough to boost D8. Cloudflare free tier automatically adds CF-Cache-Status, which AgentHermes detects. That single header proves your responses are edge-cached.

Every Top Scorer Uses Edge CDNs

The correlation between CDN usage and high Agent Readiness Scores is near-perfect. Every business scoring Silver or above uses edge infrastructure.

Business
Score
CDN
Response Time
Tier
Resend
75
Cloudflare
~35ms
Gold
Vercel
70
Vercel Edge Network
~22ms
Silver
Supabase
69
Cloudflare + AWS
~45ms
Silver
Stripe
68
Cloudflare
~60ms
Silver
Avg local business
8
None
~800ms
Not Scored

The average local business responds in ~800ms from a single origin server — often shared hosting with no CDN, no HTTP/2, and no cache headers. That is 10-20x slower than the businesses scoring Silver and above. The gap is entirely fixable with free infrastructure.

The 15-Minute Fix: Cloudflare Free Tier

The single highest-impact, lowest-effort change for D8 Reliability. Here is the exact sequence.

1

Sign up for Cloudflare (free)

Create a free account at cloudflare.com. Add your domain. Cloudflare scans your DNS records automatically.

2

Update nameservers

Point your domain nameservers to Cloudflare. This is the only step that requires DNS access. Takes 5 minutes to propagate.

3

Enable proxying (orange cloud)

Toggle the proxy icon to orange for your API and website records. This routes traffic through Cloudflare edge, enabling CDN, HTTPS, HTTP/2, and DDoS protection.

4

Set cache rules

For static assets and public API responses: Cache-Control: public, max-age=3600, s-maxage=86400. For dynamic responses: Cache-Control: no-store. Cloudflare respects these headers.

5

Verify with AgentHermes

Run a scan at /audit. Check that CF-Cache-Status appears in the response headers. Your D8 score should increase 3-5 points immediately.

After this setup, your responses travel from the nearest Cloudflare edge location — one of 330+ data centers worldwide — instead of from a single origin server. Response times drop from 500ms+ to under 100ms for cached responses, and 150-200ms for dynamic responses that still benefit from edge TLS termination and connection pooling.

Beyond Basic CDN: What Separates Silver from Gold

HTTP/2 and HTTP/3

Multiplexed requests, header compression, and 0-RTT connection resumption. Agents making parallel API calls see 30-50% faster total completion time. Cloudflare enables HTTP/2 by default and HTTP/3 with one toggle.

stale-while-revalidate

The best cache-control directive for APIs. Serve stale cached content instantly while fetching a fresh copy in the background. Agents always get a fast response, and the data is never more than one request behind.

Edge compute (Workers)

Run API logic at the edge, not at origin. Cloudflare Workers, Vercel Edge Functions, and Deno Deploy execute code in 300+ locations. Sub-50ms response times for dynamic content. This is why Vercel scores 70.

Health endpoints

A /health or /status endpoint that returns 200 with uptime data. CDNs can monitor this and route around failures. AgentHermes checks for health endpoints as part of D8. Combined with CDN caching, this maximizes reliability.

The compounding effect: CDN alone improves D8. But CDN plus HTTP/2 plus edge compute plus health endpoints plus a status page pushes D8 close to maximum. Vercel and Supabase score 69-70 because they stack all five. Most businesses have zero of them.

Frequently Asked Questions

Do AI agents actually check response times?

Yes. AI agents operate under time and cost budgets. A slow API burns tokens while the agent waits, increases the chance of timeout errors, and degrades the user experience. Agents learn to prefer faster, more reliable endpoints. AgentHermes measures response time as part of D8 Reliability, which carries a 0.13 weight — the second-highest dimension after D2 API Quality.

Is Cloudflare free tier enough for agent readiness?

For most businesses, yes. Cloudflare free tier gives you a global CDN, automatic HTTPS, HTTP/2, DDoS protection, and cache headers — all of which improve D8 Reliability. The free tier handles millions of requests per month. Upgrade to Pro ($20/month) only if you need advanced caching rules, image optimization, or WAF rules.

Does HTTP/2 matter for agent readiness?

HTTP/2 enables multiplexed requests, header compression, and server push. Agents making multiple API calls benefit from multiplexing — they can send parallel requests over a single connection instead of opening multiple TCP connections. AgentHermes detects HTTP/2 support and it contributes to D8 scoring. Every major CDN enables HTTP/2 by default.

What about API responses that cannot be cached?

Dynamic API responses (user-specific data, real-time availability) should not be cached at the CDN layer. But the CDN still helps: TLS termination at the edge, HTTP/2, connection pooling to origin, and DDoS protection all reduce response time even for uncacheable responses. Set Cache-Control: no-store for truly dynamic responses and let the CDN optimize the transport layer.


How fast is your API to agents?

See your D8 Reliability score and response time in 60 seconds. If you are over 300ms, the Cloudflare free tier fix takes 15 minutes.


Share this article: