Fault Tolerance, Manager Distribution, and Worker Node Addition in Docker Swarm

the swarm is hung while nodes try to contact one another at their old IP addresses. Dynamic IP addresses are OK for worker nodes. ## Add manager nodes for fault tolerance You should maintain an odd number of managers in the swarm to support manager node failures. Having an odd number of managers ensures that during a network partition, there is a higher chance that the quorum remains available to process requests if the network is partitioned into two sets. Keeping the quorum is not guaranteed if you encounter more than two network partitions. | Swarm Size | Majority | Fault Tolerance | |:------------:|:----------:|:-----------------:| | 1 | 1 | 0 | | 2 | 2 | 0 | | **3** | 2 | **1** | | 4 | 3 | 1 | | **5** | 3 | **2** | | 6 | 4 | 2 | | **7** | 4 | **3** | | 8 | 5 | 3 | | **9** | 5 | **4** | For example, in a swarm with *5 nodes*, if you lose *3 nodes*, you don't have a quorum. Therefore you can't add or remove nodes until you recover one of the unavailable manager nodes or recover the swarm with disaster recovery commands. See [Recover from disaster](#recover-from-disaster). While it is possible to scale a swarm down to a single manager node, it is impossible to demote the last manager node. This ensures you maintain access to the swarm and that the swarm can still process requests. Scaling down to a single manager is an unsafe operation and is not recommended. If the last node leaves the swarm unexpectedly during the demote operation, the swarm becomes unavailable until you reboot the node or restart with `--force-new-cluster`. You manage swarm membership with the `docker swarm` and `docker node` subsystems. Refer to [Add nodes to a swarm](join-nodes.md) for more information on how to add worker nodes and promote a worker node to be a manager. ### Distribute manager nodes In addition to maintaining an odd number of manager nodes, pay attention to datacenter topology when placing managers. For optimal fault-tolerance, distribute manager nodes across a minimum of 3 availability-zones to support failures of an entire set of machines or common maintenance scenarios. If you suffer a failure in any of those zones, the swarm should maintain the quorum of manager nodes available to process requests and rebalance workloads. | Swarm manager nodes | Repartition (on 3 Availability zones) | |:-------------------:|:--------------------------------------:| | 3 | 1-1-1 | | 5 | 2-2-1 | | 7 | 3-2-2 | | 9 | 3-3-3 | ### Run manager-only nodes By default manager nodes also act as a worker nodes. This means the scheduler can assign tasks to a manager node. For small and non-critical swarms assigning tasks to managers is relatively low-risk as long as you schedule services using resource constraints for cpu and memory. However, because manager nodes use the Raft consensus algorithm to replicate data in a consistent way, they are sensitive to resource starvation. You should isolate managers in your swarm from processes that might block swarm operations like swarm heartbeat or leader elections. To avoid interference with manager node operation, you can drain manager nodes to make them unavailable as worker nodes: ```console $ docker node update --availability drain <NODE> ``` When you drain a node, the scheduler reassigns any tasks running on the node to other available worker nodes in the swarm. It also prevents the scheduler from assigning tasks to the node. ## Add worker nodes for load balancing [Add nodes to the swarm](join-nodes.md) to balance your swarm's

This section discusses adding manager nodes for fault tolerance by maintaining an odd number of managers to ensure quorum during network partitions. It explains the impact of losing nodes on quorum and the need for disaster recovery in such scenarios. The document also covers distributing manager nodes across availability zones for optimal fault tolerance and running manager-only nodes to isolate them from worker tasks. It details how to drain manager nodes to prevent resource contention and reassign tasks to worker nodes, and finally mentions adding worker nodes for load balancing in the swarm.