Pushing performance with HTTP's Cache-Control header

Oct 23, 2024

In this post I want to explore how we can use HTTP’s native caching features to reduce server load and response times as an application begins to scale. By having clients or intermediary caches store data we can save ourselves roundtrips to the server, decrease the amount of requests to the server, and descreas the amount of data sent over the network. We also must balance this with making sure clients have up-to-date data when they need it. So how does the server decide what gets cached, where, and for how long? That’s where response headers like Cache-Control come into play.

We’ll start by walking through an example request flow with the Cache-Control header and then build from there.

The Cache-Control Header

The flow starts with the client making a GET request for data. The server responds with a 200 OK status, the JSON data in the body of the request, and a Cache-Control header indicating the age of the cache, in this case 3600 milliseconds (1 hour).

GET /api/data

200 OK
Cache-Control: max-age=3600
Content-Type: application/json
{ "data": "content" }

The client stores in its local cache the data for the endpoint and the age of the cache.

When the client makes a GET request for the same endpoint and the cache has not yet expired, the response data is served locally from its cache. We avoid making a roundtrip request to the server, significantly reducing latency and bandwidth on the application. This storing and serving from cache is part of the HTTP protocol, and is enacted by the browser when it receives the Cache-Control header.

After a full hour passes the cache becomes stale. If the client makes another GET request to the same endpoint the browser forces a request to the server. The return data and the cache age is again used to re-populate the cache.

Request flow for the Cache-Control header

Optimizations with Last-Modified

This is great. We’ve significantly reduced load on the server. But what if we need to optimize further? Well, with the previous implementation once the cache expires the client will always fetch the data from the server when it needs it again, even if that data hasn’t changed. Could we check if the data has changed and only send it back if it has? The Last-Modified and ETag headers allow us to do just that.

We’ll start with Last-Modified. When the client initially requests the data the server sends back a Last-Modified header with the datetime of when the resource last changed. The client holds this in cache for the endpoint.

Now once the cache expires and the client needs the data again it will make a conditional request sending the If-Modified-Since header holding the value of Last-Modified stored for that endpoint.

The server will refetch the resource and compare its updated_at field with the value of the If-Modified-Since header. If the resource hasn’t been updated since the last time it was cached, the server sends back a 304 Not Modified with no data in the payload. The client resets the cache age for that endpoint and continue pulling from local cache.

Only if the resource has been modified the server sends the updated data in the JSON of the response and the updated datetime in the Last-Modified header. The client can then reset the data in cache along with the new datetime for Last-Modified.

See the full flow mapped out.

Request flow for the If-Modified-Since header 1

The ETag header

The ETag (entity tag) header provides an alternative way to determine if a resource has been modified. The flow is very similar to using Last-Modified. Instead of sending the datetime of when a resource was last modified, the server generates a unique identifier, often a hash, from the data itself and sends this in an ETag header for the client to store in cache.

When the cache expires and the client requests the data again, it checks if the data has changed by sending an If-None-Match header in the request with the value of the ETag.

The server fetches the data again and regenerates the ETag. If it matches the If-None-Match header sent over, we know the data hasn’t changed and the server sends a 304 Not Modified. If the data has changed the server will send the updated data.

Further resources

https://stackoverflow.com/questions/24542959/how-does-a-etag-work-in-expressjs

Table of contents

The Cache-Control Header

Optimizations with Last-Modified

The ETag header

Further resources