使用docker-compose快速搭建prometheus
docker-compose.yml配置如下
version: '3.7'networks:
monitor:
driver: bridgeservices:
prometheus:
image: prom/prometheus
container_name: prometheus
hostname: prometheus
restart: always
volumes:
- /data/conf/prometheus/prometheus.yml:/etc/prometheus/prometheus.yml
- /data/conf/prometheus/node_down.yml:/etc/prometheus/node_down.yml
ports:
- "9090:9090"
networks:
- monitoralertmanager:
image: prom/alertmanager
container_name: alertmanager
hostname: alertmanager
restart: always
volumes:
- /data/conf/prometheus/alertmanager.yml:/etc/alertmanager/alertmanager.yml
ports:
- "9093:9093"
networks:
- monitorgrafana:
image: grafana/grafana
container_name: grafana
hostname: grafana
restart: always
ports:
- "3000:3000"
networks:
- monitornode-exporter:
image: quay.io/prometheus/node-exporter
container_name: node-exporter
hostname: node-exporter
restart: always
ports:
- "9100:9100"
networks:
- monitorcadvisor:
image: google/cadvisor:latest
container_name: cadvisor
hostname: cadvisor
restart: always
volumes:
- /:/rootfs:ro
- /var/run:/var/run:rw
- /sys:/sys:ro
- /data/docker_containers/:/var/lib/docker:ro
ports:
- "8080:8080"
networks:
- monitor
特别注意下面这个映射关系,/data/docker_containers/这个目录,是我修改了的docker存储目录
/data/docker_containers/:/var/lib/docker:ro
centos7在/usr/lib/systemd/system/docker.service文件里
ExecStart=/usr/bin/dockerd --graph=/data/docker_containers -H fd:// --containerd=/run/containerd/containerd.sock
prometheus.yml文件内容:
# my global config
global:
scrape_interval:15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
# scrape_timeout is set to the global default (10s).# Alertmanager configuration
alerting:
alertmanagers:
- static_configs:
- targets: ['172.1.5.220:9093']
# - alertmanager:9093# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.
rule_files:
- "node_down.yml"
# - "first_rules.yml"
# - "second_rules.yml"# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
# The job name is added as a label `job=` to any timeseries scraped from this config.
- job_name: 'prometheus'
static_configs:
- targets: ['172.1.5.220:9090']- job_name: 'cadvisor'
static_configs:
- targets: ['172.1.5.220:8080']- job_name: 'node'
scrape_interval: 8s
static_configs:
- targets: ['172.1.5.220:9100']
node_down.yml文件内容:
groups:
- name: node_down
rules:
- alert: InstanceDown
expr: up == 0
for: 1m
labels:
user: test
annotations:
summary: "Instance {{ $labels.instance }} down"
description: "{{ $labels.instance }} of job {{ $labels.job }} has been down for more than 1 minutes."
alertmanager.yml文件内容,即配置自己的邮件服务器,和接收人邮箱,各位对应修改即可
global:
smtp_smarthost: 'smtp.xxx.com:25'
smtp_from: 'user@xxx.com'
smtp_auth_username: 'user@xxx.com'
smtp_auth_password: 'pwd123456'
smtp_require_tls: falseroute:
group_by: ['alertname']
group_wait: 10s
group_interval: 10s
repeat_interval: 10m
receiver: live-monitoringreceivers:
- name: 'live-monitoring'
email_configs:
- to: 'xxx@qq.com'
把容器跑起来后,通过http://172.1.5.220:9090/targets
即可查看状态
文章图片
状态图
从图中可以看出当前为UP状态(健康状态),一开始,全是DOWN状态的,跟防火墙设置有关
[root@172-1-5-220 ~]# setenforce 0
[root@172-1-5-220 ~]# firewall-cmd --zone=public --list-ports[root@172-1-5-220 ~]# firewall-cmd --zone=public --add-port=9100/tcp --permanent
success
[root@172-1-5-220 ~]# firewall-cmd --zone=public --add-port=9090/tcp --permanent
success
[root@172-1-5-220 ~]# firewall-cmd --zone=public --add-port=9093/tcp --permanent
success
[root@172-1-5-220 ~]# firewall-cmd --zone=public --add-port=3000/tcp --permanent
success
[root@172-1-5-220 ~]# firewall-cmd --zone=public --add-port=8080/tcp --permanent
success
[root@172-1-5-220 ~]# firewall-cmd --reload
success
文章图片
防火墙操作过程 接下来,登录Grafana,添加prometheus数据源
文章图片
添加Prometheus数据源
文章图片
添加Prometheus数据源
文章图片
导入看板
导入看板后,我们点击左上角Granfana图标,再点击Home,再选择我们导入的Prometheus 2.0 Stats就可以看到效果了
文章图片
选择我们导入的看板
文章图片
整体效果 我们来试一下另一个更好看点的看板
文章图片
选择导入
文章图片
输入看板的id号
文章图片
导入看板
文章图片
新的看板 【使用docker-compose快速搭建prometheus】简单的安装到此为止,有了环境,接下来就可以好好学习Prometheus了,哈哈
推荐阅读
- 由浅入深理解AOP
- 【译】20个更有效地使用谷歌搜索的技巧
- mybatisplus如何在xml的连表查询中使用queryWrapper
- MybatisPlus|MybatisPlus LambdaQueryWrapper使用int默认值的坑及解决
- MybatisPlus使用queryWrapper如何实现复杂查询
- iOS中的Block
- Linux下面如何查看tomcat已经使用多少线程
- 使用composer自动加载类文件
- android|android studio中ndk的使用
- 使用协程爬取网页,计算网页数据大小