Packt+ | Advance your knowledge in tech

You're reading from The DevOps 2.2 Toolkit Self-Sufficient Docker Clusters

Product type Paperback

Published in Mar 2018

Publisher Packt

ISBN-13 9781788991278

Length 360 pages

Edition 1st Edition

Tools

Docker

Concepts

DevOps

Author (1):

Viktor Farcic

View More author details

Table of Contents (23) Chapters

Title Page

Dedication

Contributor

Packt Upsell

Preface

1. Introduction to Self-Adapting and Self-Healing Systems FREE CHAPTER

2. Choosing a Solution for Metrics Storage and Query

3. Deploying and Configuring Prometheus

4. Scraping Metrics

5. Defining Cluster-Wide Alerts

6. Alerting Humans

7. Alerting the System

The four quadrants of a dynamic and self-sufficient system

8. Self-Healing Applied to Services

9. Self-Adaptation Applied to Services

10. Painting the Big Picture – The Self-Sufficient System Thus Far

11. Instrumenting Services

12. Self-Adaptation Applied to Instrumented Services

13. Setting Up a Production Cluster

14. Self-Healing Applied to Infrastructure

15. Self-Adaptation Applied to Infrastructure

16. Blueprint of a Self-Sufficient System

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Exploring fault tolerance

Since we are exploring self-healing (not self-adaptation), there's no need to deploy all the stacks we used thus far. A single service will be enough to explore what happens when a node goes down. Our cluster, formed out of t2.micro instances, would not support much more anyways.

docker service create --name test \  
    --replicas 10 alpine sleep 1000000

We created a service with ten replicas. Let's confirm that they are spread across the three nodes of the cluster:

docker service ps test

The output is as follows (IDs are removed for brevity):

Let's exit the cluster before we move onto a discussion how to simulate a failure of a node.

exit

We'll simulate failure of an instance by terminating it. We'll do that by executing aws ec2 terminate-instances command that requires --instance-ids argument. So, the first line of business is to figure out how to find ID of one of the nodes.

We already saw that we could use aws ec2 describe-instances command to get information about...