Improving Rest API Performance Using Cloudfront

Recently we started working with a healthcare company that was facing slow API response time causing bad user experience. The customer is a US based Healthcare company solving the lack of healthcare interoperability with its patient-centered mobile platform, from where patients can engage, communicate and participate in their healthcare across various providers that might be using different PM/EHR/EMR products.
The product includes a SaaS platform for healthcare providers to share and communicate patient health information along with its cutting edge point-of-service patient check-in app for waiting rooms.
Due to the complex nature of the product, one of the requirements of the product team was to solve this issue without changing a lot of existing code. The client had already evaluated a couple of paid tools which were working great but very costly, so another challenge was to find an open source tool which can serve the purpose. Hence, we had a two pronged challenge- Improve performance and reduce cost.
We started doing some research and found tools like Solr which seemed like a good fit for us initially, but while we were almost close to setting it up, we thought to get some detailed insights from API response times which opened up new revelations for us making Solr insignificant. Some of the stats from different APIs were like this –

Before Cloudfront:

APITotal TimeDatabase Call% time can be optimised using Solr
PCP2192 ms662 ms30%
Allergy3758 ms475 ms12%
So after looking at low level timings, it was found that database call was not that a bigger culprit, there were some calculations happening on backend which were causing the major delays along with SSL handshake and a few other things. So we started rethinking about a new approach to solve this problem.
Few months back I had seen Slack case study on AWS and how they improved their API performance using Cloudfront. This triggered me to go back and check AWS documentation for the same. I started playing around with Cloudfront for custom origins. Till now, we had used CloudFront alongside S3 for static website hosting which had worked well. After configuring cloudfront for one API we were able to achieve almost 233% improvements in API response time without changing single line of code on backend. Below figure depicts the results after using Cloudfront.
APIBefore CFAfter CF% improvement
PCP2192 ms14 ms197%
Allergy3758 ms16 ms233%
So far all good, but how to configure cloudfront to achieve such results? Below are the steps which can help anyone to configure cloudfront with existing APIs.

Create Cloudfront Distribution:

  1. In the CloudFront console, choose Create Distribution.
  2. On the first page of the Create Distribution Wizard, in the Web section, choose Get Started.
  3. For Origin Domain Name, enter your API URL, which is nothing but your current API url(or base url e.g
  4. In Origin Path, keep as it is.
  5. For Origin Protocol Policy, choose HTTPS Only.
    Note: API Gateway does not support unencrypted (HTTP) endpoints. For more information, see Amazon API Gateway FAQs.
  6. To forward custom headers to your origin, enter one or more custom headers for Origin Custom Headers.
    Note: There are several custom headers that CloudFront can’t forward to your origin.
  7. Choose Create Distribution.
  8. Wait for your distribution to deploy. This might take about 30 minutes. When its Status appears as Deployed in the console, the distribution is ready.

Configure Caching using Behaviours:

Assuming cloudfront is deployed, it’s time to configure behaviours for caching. I am using above example to configure two APIs /search/pcp and /search/allergy to cache the content based on our needs. Here are the steps for same –
  1. Click on CF distribution
  2. Go to Behaviour Tab and click on Create Behaviour
  3. Enter path pattern – in my case of pcp api it was /search/pcp
  4. I will skip other settings which are self explanatory, lets move to the main settings, select customise from Object Caching section, enter minimum TTL as 0, maximum and default value 31536000(this is max)
  5. Under Query string forwarding and caching select the option forward all, cache based on whitelist,in text box enter all your query parameters which you want to be cached(meaning when these changes backend api response changes), in my case it was Keyword,Limit,Page,PracticeId and Zipcode, enter one in each line
  6. Save the behaviour
Once distribution is ready, you can copy the cloudfront URL and hit the API. Note down the response time. Now hit the same API again and compare the response time. You will see a big difference in the response time which means caching is working. Follow the above steps for adding multiple behaviours.


  • Handling selecting Caching

While implementing one of the challenges was that we needed to cache the response only when certain request parameter value is available. In our case our backend was sometimes hitting our own database or hitting third party depending on request parameter value. Whenever we are hitting third party API we don’t want to cache the result. We have handled this using max-age=1 parameter in API response, so whenever cloudfront finds max-age=1 it caches for 1 second and overrides default settings. Now backend has control on what to cache and when.

  • Automating Cache Updates

One of the requirements was to update the cache whenever backend database is updated, so we wrote AWS Lambda function to invalidate specific cache entries using cloudfront invalidate API. We exposed this lambda function using AWS API gateway so that our backend can call the APi and invalidate the specific cache entries based on which db values are updated. I will cover this in our upcoming blogs.


The new architecture turned the response time down to less than half of original resulting in faster responses which in turn made the end user delighted. All this was done without having to change any aspects of the core application or any downtime for the application.
In the end the customer was happy to see savings for time as well as cost.

subscribe to our newsletter


About Author

Pravin Uttarwar

Pravin has more than 12 years of experience in the tech industry, and he is a high energy individual who loves to use out of the box thinking to solve problems.He not only brings technical expertise to the table but also wears a C-level hat – benefiting any project with cost savings and adding more value to business strategy.
Related Posts
  • Dieter Ziegler

    Great Post , thnks for sharing with us. With regards from Helios Solutions. Read similar news about cloud computing and more here

Leave a Comment

why software design is importantClean Code – Need or Necessity?