TABLE OF CONTENTS

Text Link

Industry/News Company Updates Best Practices and How To Languages & Technologies Product Customer Stories

What is Canary Deployment? When and How To Use It

Geshan Manandhar

What is canary deployment?

Canary deployment is a software deployment technique where a new feature or version is released to a small subset of users in production prior to releasing to a larger subset or all users. It’s also sometimes called a phased rollout or incremental release. By design, it reduces risk, only exposing new features to defined subsets of users and gradually ramping up from there. In addition to reducing the risk of accidentally releasing buggy code, it provides a path to test out a new version of a feature in production to see how users respond. In this post, we will discuss when and how to use canary deployment and also benefits and relationship with feature flags. Let’s get rolling!

‍When to use a canary release

Until the 1980s, coal miners in the UK, Australia, Canada, and the US used canaries as an early warning system for harmful gases like carbon monoxide and methane. These birds would show visible distress in the presence of gas, alerting the miners of danger before they could recognise it themselves.

Today, no canaries are harmed and canary releases help engineers safely release new features and updates. As teams grow more wary of “big bang” releases, for good reason, canary deployments have been popularised and have many use cases.

In a canary deployment, the early sub-segment of users that you expose to a new feature can give you a warning if something isn’t right. That canary sub-segment can be, say, 1% of your customer base chosen randomly. The subset could also be a segment of your users who have self-identified as Beta testers or could follow some other logic that you apply. The idea is that you will expose the new version to chosen users before ramping up to a larger section of your whole user base. If everything goes well with the initial deployment, the percentage of users exposed to the feature can be increased gradually, all while monitoring the logs, errors, and overall health of the software.

For the question of when to use a canary deployment; we see it as a must for any change in the critical path. It can also be used for other non-critical features or to A/B test some new idea as an experiment. The biggest benefit of canary releases can be realised when the stakes are high, as in the example below.

Example of Payment gateway for an e-commerce company

Let’s say an e-commerce company is using Braintree for processing all its payments. For flexible payment options and to integrate with other payment providers, the company decides to switch to Stripe as the payment gateway. To make this switch, our hypothetical company should use a canary release to minimise the risk. Bear in mind that most, if not all, of the income for an e-commerce company comes from the checkout and collecting money from customers at checkout. It’s the most vital part of the critical path, and any glitch during checkout or at the payment gateway level is a real disaster for this e-commerce venture.

Let’s suppose the company has thousands—if not millions—of customers and also thousands of orders a day. Since the software development is done to a production-ready state for the new payment gateway, the first release should be the team responsible for development.

A canary release will allow the code to be deployed to production, but it will be released only to the team that developed the Stripe integration. This separates deployment from release and frees the team to test in production. Of course, at this point, and for the foreseeable future, both the Braintree and Stripe gateways will work on the checkout page. After this first release and ironing out any bugs, the next stage could be to release it to all the staff of the company; identified by their email address or the internal IP of the office.

From 1% to 5% then eventually to 100% customers using Stripe

If the second release is successful, the next step can be to release the new payment gateway to 1% of the customer base and see how this works out. At this stage, 1% of the customers will use Stripe as the payment gateway and 99% will still be using the previous integration with Braintree. Gradually the balance can be shifted to 50/50, where 50% of the customers use Stripe and the other half use Braintree.

Similarly, any bugs can be fixed and will still only affect the users using Stripe. Depending on the company’s appetite for risk, as well as the reasons for switching to a new payment gateway, this may take days, weeks, or months. Eventually, 100% of the traffic/customers will be routed to the Stripe payment gateway and after weeks/months of inactivity, the Braintree integration can be removed from the code. At this point, the Stripe integration will be generally bug-free and ready for prime time even from an infrastructure point of view.

The above is just one example, but canary releases can be used in a multitude of other scenarios.

For instance, Facebook uses canary deployment for mobile app releases. Recently, when we started seeing time insights on Google Calendar to show the breakdown of time spent on meetings, some of my friends saw the feature a week or so ahead of me. As you can see, even billion-dollar companies utilise canary deployments to release at the scale of millions of users if not billions of users.

Regardless of the scale, the idea of canary deployment is very powerful. Even if you have hundreds or thousands of users, testing out new features in production with a small set of real users can be very beneficial. In the next section, we’ll look at some of the drawbacks of canary deployments.

Downsides of canary deployments—and how to resolve them with feature flags

Canary deployments, despite their many virtues, can introduce complexity to your setup. Luckily, this complexity is easily mitigated by running canary deployments with feature flags (and feature flag software). Let’s look at a few of the potential drawbacks.

Additional infrastructure & increased complexity

Canary deployments usually necessitate additional infrastructure to manage the code and can introduce more complexity to your codebase as well. But this infrastructure is provided (and made simple) with feature flag tools like Flagsmith. You can run a canary deployment in a straightforward UI and change things like user percentages without needing to redeploy.

Management overhead and technical team overhead

Canary deployments generally require more management, particularly by engineering teams who might need to set up the deployment for product teams.

With a feature flag tool, the deployment becomes simpler to manage and can be managed by non-technical teams and product managers without needing engineering support. Essentially, the person managing the canary deployment doesn’t need to also manage the code.

Benefits of canary deployment

Without a doubt, canary deployment minimises the risk of releasing a new version of your software. There are other benefits of canary deployment too, some notable ones are:

Capacity testing

We can generally predict the resources needed for a new system and give a range of how much traffic it will get. This is important when we want to introduce a new microservice that is replacing an existing older system. With a canary release, we can divert 1% of the production traffic, for example, and test our assumptions of the resources the service will need. Any performance issue or bottleneck can be identified earlier, thereby it can get solved faster. With the safety valve of a canary release, we can turn back the traffic to 0% on the new system while issues are being fixed.

Early feedback

Even though we thoroughly test new systems or features in staging, it cannot be said with certainty how the feature will turn out with the full production traffic load. There might be edge cases that are only discovered when the feature is used in a production environment. This is where a canary release helps us get critical early feedback without affecting most of the traffic from a very small subset/percentage of the production traffic. With early feedback, we can change the feature if need be and make it even better for the next set of users who will get the new feature.

Easy rollback

Canary deployment is also about having a safety net to roll back to a working system while the new feature/version is being introduced. In the example of the payment gateway, let’s say we saw a major bug in the Stripe implementation when we released the new payment gateway to 1% of the traffic. The rollback would be very easy, simply updating a feature flag where we can change the value from 1% to a segment of software engineers who developed the feature. This ease and high level of control over a new feature or version of the software is the power of canary releases.

A/B testing

A/B testing, though new in its widespread adoption, is a 100-year-old method, which basically is a way to compare two versions of something to find out which one performs better. In software teams utilising canary deployment, we naturally serve two different versions of the software to users as we introduce the new or updated feature. This gives us an opportunity to review users’ reactions to the update and know if it’s working better. Generally speaking, A/B testing is done for a specific period of time with a control and variation group to find out which version to choose as per the data collected. Canary releases are a great way to incorporate A/B testing into your releases and get the most out of them.

There are other benefits but the above-mentioned are the main ones. Now that you know all the reasons to use a canary deployment, let’s discuss how to put the concept into practice.

How to use a canary deployment

Without delving too deep into the technical details, canary releases can mainly be done in two ways. Below, we discuss both ways to enable canary deployment—first from the infrastructure level and second from the code level.

Canary deployments from the infrastructure level

The first way to manage a canary deployment is to control the traffic flow to the new version or feature from the infrastructure level. One product that allows doing this without the need to go into the nitty-gritty of how this is done with things like a load balancer or service mesh is Google Cloud Run. It gives the user the ability to route x% of traffic to a newly deployed version. It also allows splitting traffic with tags for multiple versions, truly handling canary releases from the infrastructure level without the hassle of understanding the details underneath. AWS beanstalk also supports traffic splitting as detailed in their docs.

Obviously, unless we have a great DevOps/SRE team that enables us to do this with our existing infrastructure, this is more of a pipe dream than a reality. We will actually need to dive deep into load balancing and blue-green deployment, rolling deployment, and the like to get this into practice. This brings us to the next way to do it, from pure code and feature flags.

Canary releases enabled by code and feature flags

The second way to enable canary deployment and releases is to have the feature code always deployed and control the reach of that new feature code to a certain set of users with code and conditions. This is exactly where feature flags shine. Using feature flags, we don’t need a large SRE team to enable us to do canary or phased rollouts. Software engineers can do it themselves with the clever use of feature flags and a feature flag platform like Flagsmith. Combining the release with segments and identities adds that needed level of options and flexibility for canary releases.

With optimal use of feature flags or multivariate flags, we can send 5% of the customers to Stripe and 95% to Braintree as we saw in the above example. Below is a sample screenshot of an example multivariate flag of the case with 5% Stripe and 95% Braintree usage:

Furthermore, the change in the canary size and who can make the change becomes ultra-easy. With feature flags, it is simply going to an interface and changing the values from 1 and 99 to 5 and 95 and now 5% of the traffic can see the new version or feature, as shown above. This particular change can be done by someone nontechnical like a product manager. This is much easier compared to the earlier discussed infrastructure option where each change in the canary size might need a new deployment or change to the infrastructure, which is far riskier.

Similar to other things in software engineering it is always good to follow best practices for feature flags. Depending on your language and framework of choice, you can try out feature flags on Flagsmith with Node.js, React.js, Flutter, and even iOS. Please check out all of our SDKs on our GitHub profile.

Conclusion

Canary deployments and releases provide us with fine-grained control of the users we want to release a new feature or version to. It reduces the probability of a potentially buggy feature being released to your whole customer base and adds more confidence to your business as the new feature will first be made available to only a small subset of users. Utilise the safety shoot provided by canary releases and be confident in releasing experiments and even not-fully-baked features to a very small percentage of users to get early and super valuable feedback.

Canary releases can help you release safely and with more agility, letting you collect valuable real-world feedback before making any big changes. Always release software responsibly with user impact in mind!

About the author

Product Minded Engineer | Blogger | Speaker

July 31, 2025

Moving to Progressive Delivery with Feature Flags

Ben Rometsch

July 11, 2025

Top 7 Feature Flag Tools for Enterprises in 2025

Tanaaz Khan

June 4, 2025

TypeScript Feature Flags: A Next.js Example

Michael Dinerstein

May 14, 2025

Embracing Modernisation in Banking Through Platform Engineering

Benjamin Brial

May 9, 2025

Transitioning to Modern Authorisation Management

Alex Olivier

April 22, 2025

What Are Feature Flags? Everything Engineering Teams Need to Know

Ben Rometsch

April 7, 2025

A Conversation with Komerční Banka's Chief Software Architect

Mia Loiselle

March 26, 2025

GitOps for Feature Flags Using Terraform and Terrateam

Malcolm Matalka

March 25, 2025

Why It’s Time to Test in Production (+ How to Do It Safely)

Tanaaz Khan

January 22, 2025

How We Improved Our Docker Image Security Using Chainguard's Wolfi

Kim Gustyr

January 7, 2025

6 Best Enterprise-Grade Harness Alternatives & Competitors

Tanaaz Khan

October 28, 2024

How to Roll out Pricing Changes With Zero Customer Complaints

Matthew Elwell

September 16, 2024

How to Use Feature Flags for Trunk-Based Development

Kyle Johnson

August 21, 2024

7 Best LaunchDarkly Alternatives & Competitors

Tanaaz Khan

August 12, 2024

How Global Banks Use Feature Flags to Stay Competitive

Tanaaz Khan

July 24, 2024

How To Guide: Flagsmith Grafana Integration

Pradumna Saraf

July 23, 2024

New in Flagsmith: 2024 Feature Roundup

Matthew Elwell

July 23, 2024

Don’t Let a Flawed Release Take Your Company Down

Ben Rometsch

June 26, 2024

How to Guide: Flagsmith GitHub Integration

Pradumna Saraf

May 28, 2024

6 Best Firebase Remote Config Alternatives & Competitors

Tanaaz Khan

May 16, 2024

How to Transition to Modern Feature Management in Banking

Ben Rometsch

March 21, 2024

5 Feature Flag Management Pitfalls To Avoid To Keep Your Flags in Check

Tanaaz Khan

February 29, 2024

The Best Thing about Founding a Remote-First Company? Pickled Onion Monster Munch and The Beautiful Game

Ben Rometsch

February 28, 2024

Flagsmith Jira Integration Guide: A Comprehensive How-to Guide

Abhishek Agarwal

February 16, 2024

Guide: How to Create Observability-Driven Development with Feature Flags

Savan Kharod

January 31, 2024

Build vs. Buy for Feature Flags: My Experience as a CTO with a 20+ Engineer Team

Daniel Engelke

January 16, 2024

Announcing the Flagsmith Referral Programme

Anna Redbond

January 15, 2024

How We Measure Feature Flags’ Success

Kyle Johnson

December 20, 2023

Customer Story: Serenis

Anna Redbond

December 7, 2023

Announcing the Flagsmith Jira Integration

Anna Redbond

June 6, 2024

Spring Boot Feature Flags: A Step-by-Step Implementation Guide with a Working Java Spring Boot Application

Abhishek Agarwal

November 22, 2023

Employees on Bootstrapping

Anna Redbond

November 14, 2023

Our POV: When Bootstrapping Works (and When It Doesn't)

Anna Redbond

October 25, 2023

How to Onboard Feature Flag Management Tools

Anna Redbond

October 12, 2023

When is it time to move to feature flag software?

Olga Diaz

September 26, 2023

Why We Bootstrap

Ben Rometsch

September 6, 2023

The Enshittification of Basically all Digital Design. But in this Case, Specifically, the Slack Redesign.

Ben Rometsch

January 9, 2025

Ruby Feature Flags: A Step-by-Step Guide to Implementing Feature Flags in a Ruby on Rails Application

Zeeshan Afridi

September 1, 2023

Unlocking Efficiency: Transitioning to Modern CI Processes

Geshan Manandhar

August 29, 2023

Customer Story: Vontobel

Anna Redbond

August 17, 2023

It's Time to Move to Modern Observability Tools and Progressive Delivery: Insights from Dynatrace

Andreas (Andi) Grabner

August 2, 2023

Moving to Modern Software Development and Continuous Integration for Banks: Insights from Romano Roth (Zühlke)

Anna Redbond

August 1, 2023

Developer-Led Podcast: Bootstrapping a Commerical Open Source Company to $1M ARR

Anna Redbond

July 24, 2023

Open Source Startup Podcast: Why Feature Flagging Should be Open Source with Ben Rometsch

Anna Redbond

July 20, 2023

Get The Analytics You Need: A/B Testing with Feature Flags and Your Existing Stack

Kyle Johnson

July 18, 2023

Open-Source in Banking: Rob Moffat from FINOS Talks Barriers, Benefits, and Pushing the Battleship to Adoption

Anna Redbond

June 30, 2023

Customer Story: Rain (VP of Platform Engineering)

Anna Redbond

June 30, 2023

Customer Story: Rain (Tech Lead)

Anna Redbond

September 26, 2024

PHP Feature Flags: A Step-by-Step Guide in a Working Laravel Application

Geshan Manandhar

October 10, 2024

Node.js Feature Flags: a Step-by-Step Implementation Guide with an Express.js Example

Geshan Manandhar

June 3, 2021

Integrate Heap with Flagsmith

Ben Rometsch

April 30, 2021

Security Benefits of Self-Hosting Feature Flags On-Prem | Flagsmith

Geshan Manandhar

April 15, 2021

Best Practices to Achieve Automated Testing & Zero Downtime Deployments

Ben Rometsch

April 1, 2021

Deployment is not a release; a step-by-step guide with feature flags

Geshan Manandhar

November 25, 2024

Feature Flags vs Remote Configuration: What’s the Difference?

Ben Rometsch

December 14, 2020

Get the most out of your Feature Flags with these best practices

Ben Rometsch

December 1, 2020

Customer Story: Palo Alto Software

Ben Rometsch

March 14, 2020

What I’ve learned creating a React Native performance monitor

Kyle Johnson

September 20, 2024

How to Setup Feature Flags in Android using Kotlin

Shubham Aggarwal

June 8, 2023

Customer Story: Smartex

Anna Redbond

May 26, 2023

Our First Remote Company Off-Site: What Worked, What Didn’t, and What We’ll Do Differently Next Time

Anna Redbond

May 19, 2023

Customer Story: Wistia

Anna Redbond

April 28, 2023

A Decision Continuum: Deciding Between Feature Flagging Software vs. an In-House Solution

Anna Redbond

May 8, 2023

Customer Story: Rabbit Care

Anna Redbond

April 18, 2023

Customer Story: alt.bank

Anna Redbond

February 23, 2023

The actual infrastructure costs of running a global Edge API (part 2)

Ben Rometsch

May 3, 2023

Integrating your Flagsmith Project with Datadog: A Step-By-Step Guide with Real-Time Metrics

Abhishek Agarwal

May 10, 2024

Python Feature Flags & Toggles: A Step-by-Step Setup Guide in a Flask Application

Matthew Elwell

May 2, 2024

Java Feature Flags & Toggles: A Step-by-Step Guide with a Working Java Application

Abhishek Agarwal

November 16, 2022

Adventures in Terraform: How and why we built our Terraform Provider

Gagan Trivedi

April 8, 2025

Angular Feature Flags: a Step-by-Step Guide with a Working Application

Geshan Manandhar

January 30, 2025

Golang Feature Flags: A Step-by-Step Implementation Guide with a Working application

Abhishek Agarwal

June 29, 2022

Elixir feature flags: a step-by-step guide with an Elixir example

Ben Rometsch

June 6, 2022

How Banks Implement Feature Flags - Interview with KB Bank | Flagsmith

Ben Rometsch

June 16, 2022

.NET feature flag: a step-by-step guide with Xamarin example

Ben Rometsch

June 14, 2022

Our scariest release to date!

Ben Rometsch

June 15, 2022

The actual infrastructure costs of running SaaS at scale (billions of requests/month)

Ben Rometsch

January 2, 2022

How To Use Swift Feature Flags: iOS App with code examples

Ben Rometsch

May 11, 2022

Our CI/CD and release management process at Flagsmith

Ben Rometsch

January 21, 2022

How eFuse Uses Flagsmith for A/B & Multivariate Testing

Ben Rometsch

May 19, 2022

Flagsmith Submits OpenFeature as CNCF Sandbox Project | Flagsmith

Ben Rometsch

November 17, 2021

Using Flutter Feature Flags to Release Features Without Risk | Flagsmith

Ben Rometsch

May 24, 2024

How to Use JavaScript Feature Flags & Toggles to Deploy Safely [React.js Example]

Ben Rometsch

December 31, 2021

6 Metrics to Monitor When Rolling Out a New Feature Flag

Cassandra Polzin

September 29, 2021

How Inflow Improves Conversions Through A/B Testing with Flagsmith and Mixpanel

Ben Rometsch

October 7, 2021

5 Things I Learned Going from Open Source to Commercial Open Source

Ben Rometsch

April 25, 2024

Feature Flags Best Practices: The Complete Guide

Geshan Manandhar

September 23, 2021

Decoupling Deployment from Release with Feature Flags

Cassandra Polzin

July 8, 2021

Use feature flags to release code safely in any git branching strategy

Geshan Manandhar

July 2, 2021

Feature Flag Analytics for users of Flagsmith and Amplitude

Ben Rometsch

August 20, 2021

How to Enhance Phased Rollouts with Feature Flags

Cassandra Polzin

October 1, 2024

React Native Remote Config: A Step-by-Step Implementation Guide

Geshan Manandhar

June 29, 2021

Decouple deployment from release to achieve continuous delivery with Feature Flags

Cassandra Polzin

June 23, 2021

Integrate New Relic with Flagsmith

Cassandra Polzin

June 21, 2021

Flagsmith & AppDynamics Enable Advanced Performance Analysis

Cassandra Polzin

May 5, 2021

Introducing Multivariate Feature Flags to enable seamless AB Testing and Canary Deployments

Ben Rometsch

June 11, 2021

Monolith vs. Microservice architecture: Embracing the Monolith safely with feature flags

Ben Rometsch

December 8, 2020

Flagsmith Release! v2.4.0

Ben Rometsch

February 1, 2020

Self Hosting all the things

Ben Rometsch

December 29, 2021

Is it time to delete your staging environment?

Ben Rometsch