Packt+ | Advance your knowledge in tech

You're reading from The DevOps 2.2 Toolkit Self-Sufficient Docker Clusters

Product type Paperback

Published in Mar 2018

Publisher Packt

ISBN-13 9781788991278

Length 360 pages

Edition 1st Edition

Tools

Docker

Concepts

DevOps

Author (1):

Viktor Farcic

View More author details

Table of Contents (23) Chapters

Title Page

Dedication

Contributor

Packt Upsell

Preface

1. Introduction to Self-Adapting and Self-Healing Systems FREE CHAPTER

2. Choosing a Solution for Metrics Storage and Query

3. Deploying and Configuring Prometheus

4. Scraping Metrics

5. Defining Cluster-Wide Alerts

6. Alerting Humans

7. Alerting the System

The four quadrants of a dynamic and self-sufficient system

8. Self-Healing Applied to Services

9. Self-Adaptation Applied to Services

10. Painting the Big Picture – The Self-Sufficient System Thus Far

11. Instrumenting Services

12. Self-Adaptation Applied to Instrumented Services

13. Setting Up a Production Cluster

14. Self-Healing Applied to Infrastructure

15. Self-Adaptation Applied to Infrastructure

16. Blueprint of a Self-Sufficient System

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Chapter 15. Self-Adaptation Applied to Infrastructure

Our goal is within reach. We adopted schedulers (Docker Swarm in this case) that provide self-healing applied to services. We saw how Docker For AWS accomplishes a similar goal but on the infrastructure level. We used Prometheus, Alertmanager, and Jenkins to build a system that automatically adapts services to ever-changing conditions. The metrics we're storing in Prometheus are a combination of those gathered through exporters and those we added to our services through instrumentation. The only thing we're missing is self-adaptation applied to infrastructure. If we manage to build it, we'll close the circle and witness a self-sufficient system capable of running without (almost) any human intervention.

The logic behind self-adaptation applied to infrastructure is not much different from the one we used with services. We need metrics, alerts, and scripts that will adapt cluster capacity whenever conditions change.

We already have all the...