Difference between revisions of "Monitoring"

From UVOO Tech Wiki
Jump to navigation Jump to search
 
(One intermediate revision by the same user not shown)
Line 2: Line 2:
 
- https://en.wikipedia.org/wiki/Zenoss_Core
 
- https://en.wikipedia.org/wiki/Zenoss_Core
 
- https://en.wikipedia.org/wiki/OpenNMS
 
- https://en.wikipedia.org/wiki/OpenNMS
 +
- https://www.manageengine.com/network-monitoring/
  
 
For Axia Codex monitoring we chose Prometheus. We chose it because
 
For Axia Codex monitoring we chose Prometheus. We chose it because
Line 27: Line 28:
  
 
```
 
```
 
 
[Unit]
 
[Unit]
 
Description=Prometheus collected
 
Description=Prometheus collected

Latest revision as of 20:43, 15 September 2020

Solutions

For Axia Codex monitoring we chose Prometheus. We chose it because it was a simple standalone application and had libraries that integrated with node for recording custom stats.

https://prometheus.io/

Node.js client:

https://github.com/siimon/prom-client

Once you get it setup you can add alerting. You need to setup the alert manager program

https://daenney.github.io/2018/04/21/setting-up-alertmanager

Then add a receiver, such as an email server for sending alerts. Here is the telegram bot we used:

https://github.com/inCaller/prometheus_bot

All 3 of those services should be installed with systemd.

[Unit]
Description=Prometheus collected
After=network.target

[Service]
Type=simple
User=prometheus
Restart=on-failure
WorkingDirectory=/opt/prometheus-2.8.1.linux-amd64/
ExecStart=/opt/prometheus-2.8.1.linux-amd64/prometheus --config.file=config.yml --storage.tsdb.path=/home/prometheus/data2

[Install]
WantedBy=multi-user.target
[Unit]
Description=Prometheus Alert Manager
After=network.target

[Service]
Type=simple
User=prometheus
Restart=on-failure
WorkingDirectory=/opt/alertmanager-0.16.2.linux-amd64
ExecStart=/opt/alertmanager-0.16.2.linux-amd64/alertmanager --config.file=config.yaml --storage.path=/home/prometheus/alert-data

[Install]
WantedBy=multi-user.target
[Unit]
Description=Prometheus Telegram
After=network.target

[Service]
Type=simple
User=prometheus
Restart=on-failure
WorkingDirectory=/opt/prometheus_bot-master
ExecStart=/opt/prometheus_bot-master/prometheus_bot -c config.yaml

[Install]
WantedBy=multi-user.target

Axia Codex

The monitoring system for Axia Codex is here:

http://prometheus.example.io:9090/