Postgres Solutions | The misconception of the etcd quorum when using Patroni and Postgres

The misconception of the etcd quorum when using Patroni and Postgres

JT wrote: 🕐 11-15-25 16:02

Don't be fooled!

There is a fundamental part of the Patroni architecture that is often grossly overlooked or misunderstood, the role and sizing of the Distributed Consensus Store. In the context of Patroni, this is typically etcd.

Patroni uses etcd to elect the primary, register cluster members, and ensure that only one node believes it is the leader at any given time, a concept known as a quorum. If you come from the mindset that the number of etcd nodes you need is simply based on the number of Postgres nodes divided by two, plus one, you have been profoundly misled.

etcd nodes = ( postgres nodes / 2 ) + 1

This rule of thumb is a common source of confusion and instability! The size of your etcd cluster is independent of your Postgres node count and is governed only by the need to maintain a reliable quorum for etcd itself.

Understanding why and how to size your etcd cluster correctly is essential for true high availability, and you must read the following to learn the proper methodology.

Why your Patroni Node Count doesn't Determine your etcd Quaorum

The core misunderstanding is failing to distinguish between the Patroni cluster (your data layer) and the etcd cluster (your consensus layer).

Patroni/Postgres Nodes (Data):

These nodes are the actual database servers. Their count determines how many copies of the data you have and where the Primary can run.

etcd Nodes (Consensus):

These nodes hold the metadata about the cluster (who the current Primary is, who the members are, etc.). They use an algorithm like Raft to ensure this metadata is consistently agreed upon by a quorum.

The availability of your Patroni cluster relies entirely on the availability of its etcd quorum. If the etcd cluster loses its quorum, Patroni cannot safely elect a new Primary or switch roles, even if the underlying Postgres data nodes are healthy.

On a side note, this heavy dependency on etcd and the Patroni layer managing Postgres, is why I favor pgPool in some cases.

The Correct etcd Quorum Sizing Rule

The sizing of the etcd cluster is based on the concept of fault tolerance, defined by the number of simultaneous etcd node failures.

Lets take the common misconception of a scenario where you have a Postgres cluster of 3 database servers managed by Patroni. Most likely, you placed the etcd service on each of the Postgres database server. You probably think that you just need 3 etcd nodes. Why not use the Postgres servers to host them. After all, the etcd footprint is fairly light. No big deal.

Ceiling of ( 3 / 2 ) + 1 = 3

Well, if more than one of your Postgres servers were to go down, you would be in a crisis trying to find out why you cannot reach the last database server out of 3.

The fact is, you have to take into account how many etcd node failures you are willing to tolerate in order to do a proper calculation.

If you have etcd running on the 3 database servers, and 2 of the database servers go down, you have just lost 2 of your etcd nodes leaving you with just 1. Well, 1 won't cut it for a quorum.

To survive 2 failures, you need to have a system where the remaining nodes can still form a majority.

The correct formula for determining the number of etcd nodes needed to survive 2 out of 3 etcd node failures is as follow:

N = ( 2 * F ) + 1

If F = 2 (two failures), then N = (2 * 2) + 1 = 5

You need to add enough extra nodes to your quorum so that even when two are taken away, you still have the minimum quorum number left over.

Lets break it down.

Total etcd nodes needed = 5
Quorum needed = 3 (since ceil of 5/2 + 1 = 3)
Failures Allowed = 2
If 2 nodes fail, 3 are left.
The remaining 3 nodes are still a majority of the original 5, so they can keep operating.
Lastly, 5 nodes needed minus the original 3 nodes, means you need an extra 2 etcd nodes. The original 3 on each database server, plus two additional stand alone etcd nodes.

Copy link

Postgres / Patroni Tutorial. For self paced training using Docker containers

Copy link

NEWS: Postgres Solutions Launches Comprehensive Training Program Led by Industry Veteran

🕐 10-24-25 11:06JT replied:

Simply send us an email to support@postgressolutions.com with details of howmany attendees, what type of training and we will get back to you.

JT

🕐 10-24-25 07:52Harry replied:

How to attend this training class? is there any link to subscribe or attend? what is the deatils regarding this training? Please can someone updates? Thanks

Copy link

Looking for a good docker image for postgres and patroni

🕐 02-25-25 20:54cjtorral replied:

Wow. This is awesome! Thanks. Pretty much what I was looking for.

🕐 02-24-25 16:28JT replied:

Well, I just so happen there is one here that adresses your query :)

Have a look at

https://github.com/jtorral/DockerPgHa

It spins up postgres, patroni and pgbackrest.

The best feature is the genCompose script which creates a docker-compose file for you based on your supplied input.

This repo needs some enhancements but it is a good start.

Copy link

Provide your feedback on Patroni Training

Copy link

NEWS: A case for AI. Security Post Mortem: Beyond My Own Limits

Copy link

NEWS: Postgres Solutions Launches Comprehensive Security and Compliance Audit Support Services

Copy link

NEWS: Postgres database performance review

Copy link

NEWS: PostgresSolutions Launches New Docker Image Bundle and genDeploy Release

Copy link

Advanced Pgpool-II Training

Copy link

Postgres Operations Training 3 Day class.

Copy link

NEWS: New 2 Day Pgpool-II Course announced Focuses on Installation, Configuration, and High Availability

Copy link

NEWS: Postgressolutions new 3 day operational training announced.

Copy link

NEWS: Update - AWS services operating normally

Copy link

NEWS: pg_dbms_errlog v2.2 released

Copy link

NEWS: PostgreSQL 18 released

Copy link

NEWS: pgAdmin 4 v9.9 Released

Copy link

Query to show how partitioned tables are balanced. Show count of the partitions

Copy link

PostgresSolutions.com launches

Copy link

Convert exported RDS Aurora postgres log file from json to standard format

Copy link

Simple 1 liner to get sum of file sizes with ls or du

Copy link

Quick way to create a table with new sequential id's and data from another table

Copy link

Understanding Index performance indicators

Copy link

Available now. Percona Distribution for PostgreSQL 17

Copy link

What does automatic aggressive vacuum message mean ?

Copy link

Message board

The misconception of the etcd quorum when using Patroni and Postgres

Reply to: The misconception of the etcd quorum when using Patroni and Postgres

Postgres / Patroni Tutorial. For self paced training using Docker containers

Reply to: Postgres / Patroni Tutorial. For self paced training using Docker containers

NEWS: Postgres Solutions Launches Comprehensive Training Program Led by Industry Veteran

Reply to: Postgres Solutions Launches Comprehensive Training Program Led by Industry Veteran

Looking for a good docker image for postgres and patroni

Reply to: Looking for a good docker image for postgres and patroni

Provide your feedback on Patroni Training

Reply to: Provide your feedback on Patroni Training

NEWS: A case for AI. Security Post Mortem: Beyond My Own Limits

Reply to: A case for AI. Security Post Mortem: Beyond My Own Limits

NEWS: Postgres Solutions Launches Comprehensive Security and Compliance Audit Support Services

Reply to: Postgres Solutions Launches Comprehensive Security and Compliance Audit Support Services

NEWS: Postgres database performance review

Reply to: Postgres database performance review

NEWS: PostgresSolutions Launches New Docker Image Bundle and genDeploy Release

Reply to: PostgresSolutions Launches New Docker Image Bundle and genDeploy Release

Advanced Pgpool-II Training

Reply to: Advanced Pgpool-II Training

Postgres Operations Training 3 Day class.

Reply to: Postgres Operations Training 3 Day class.

NEWS: New 2 Day Pgpool-II Course announced Focuses on Installation, Configuration, and High Availability

Reply to: New 2 Day Pgpool-II Course announced Focuses on Installation, Configuration, and High Availability

NEWS: Postgressolutions new 3 day operational training announced.

Reply to: Postgressolutions new 3 day operational training announced.

NEWS: Update - AWS services operating normally

Reply to: Update - AWS services operating normally

NEWS: pg_dbms_errlog v2.2 released

Reply to: pg_dbms_errlog v2.2 released

NEWS: PostgreSQL 18 released

Reply to: PostgreSQL 18 released

NEWS: pgAdmin 4 v9.9 Released

Reply to: pgAdmin 4 v9.9 Released

Query to show how partitioned tables are balanced. Show count of the partitions

Reply to: Query to show how partitioned tables are balanced. Show count of the partitions

PostgresSolutions.com launches

Reply to: PostgresSolutions.com launches

Convert exported RDS Aurora postgres log file from json to standard format

Reply to: Convert exported RDS Aurora postgres log file from json to standard format

Simple 1 liner to get sum of file sizes with ls or du

Reply to: Simple 1 liner to get sum of file sizes with ls or du

Quick way to create a table with new sequential id's and data from another table

Reply to: Quick way to create a table with new sequential id's and data from another table

Understanding Index performance indicators

Reply to: Understanding Index performance indicators

Available now. Percona Distribution for PostgreSQL 17

Reply to: Available now. Percona Distribution for PostgreSQL 17

What does automatic aggressive vacuum message mean ?

Reply to: What does automatic aggressive vacuum message mean ?