Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

devel branch merge #14

Open
wants to merge 79 commits into
base: master
Choose a base branch
from
Open

devel branch merge #14

wants to merge 79 commits into from

Conversation

kloczek
Copy link
Owner

@kloczek kloczek commented Jul 3, 2018

This PR has been created to have visibility what will be merged from current "devel" branch.
Status of this PR shows as well any possible conflicts which needs to be sorted out before merge.

When all what is prepared on the "devel" will be ready this PR will be merged.

kloczek and others added 30 commits June 20, 2018 18:16
Graphs:
- new HOST::items which presents number of host items vs number of unsuppoerted item
Items:
- new items with current counter of total host items (supported/unsupprted/enabled/disabled)
(I must remember to change all those URL after merge to master)
non-standard port.
Macros: {$SVC_SSH_PORT}
- interfaces: use iIF-MIB::ifAdminStatus instead IF-MIB::ifOperusStatus on checking if interface is up or down
- interfaces: add to LLD iterator item IF-MIB::ifIndex and map it to {#IFINDEX} macro
- interfaces: add to all items, triggers anf graphs prototypes names [{#IFINDEX}] to display physycal port number in those names
- create item description
{$PHPFPM_HOST} -> {$SVC_PHP_FPM_HOST}
{$PHPFPM_PORT} -> {$SVC_PHP_FPM_PORT}
{$PHPFPM_STATUS_URI} -> {$SVC_PHP_FPM_STATS_URI}
even if php-fpm processes are ps command shows working php-fpm processes.
if this trigger is active it may mean as well that macro used by template
needs to be corrected (best in host template macros).
- Graphs:
  - delete SVC::zabbix_server::items/wcache::buffer::pfree graph
  - new SVC::zabbix_server::hosts graph with number of monitored hosts
  - new SVC::zabbix_server::items/triggers graph with total number of items vs. triggers
  - new SVC::zabbix_server::items::nvps graphs with zabbix stack NVPS
- Screens:
  - reorganized SVC::zabbix_server::status screen to put in rows:
    - "process busy %" and "internam process busy %"
    - "cahces usage %" and "vcache::hits/misses"
    - "processing queue" and flame like graph with "queued items"
    - "items totat/unsupported" and "items::nvps"
    - "items/triggers" and "hosts"
  - Triggers:
    - added missing "SYS::Host is down" trigger with all necessary dependencies
 - Triggers:
   - "SYS::Host is downi" corrected descrition
 - LLDs:
   - graphs improvements
  - Triggers:
    - added SYS::Host is down
    - added CPU::idle {ITEM.VALUE}
OS Linux:
 - Remove duplicated triggers on LLD VOL
Just apply changes in the templates format generated by zabbix 4.2.x
without changing anything in items, trigger etc.
Macro {$HTTP_SERVICE_PROCESS} has bee nadded to allow easy use it on Debian/Ubuntu.
Default value is "http". On Ubuntu/Debian that value needs to be changed to "apache2".
All graphs in screens resolution have been changed to 1200x200 or 600x200
in case screens with two columns of graphs. That resolution is better suited for FHD.
Added in "VOL:" LLD filter off all volumes mounted under /var/lib/lxc. If OS Linux
template will used inside of each LXC container it will provide proper not duplicated
monirtoring of the LXC container volumes.
…VALUE} LLD trigger.

Now alarms are raised <1%, <%5, <10% and now <20% free volume space.
… less than 5% or 15% CPU

idle time time for at least 1 hour and alarm is cleared ehen for next hour will be no other
flapping anbove 5% or 15%.
…ices ZFS

vdevs partitions and corrected regexp to remove all SCSI partitions
Remove control caracters.
Fixed sd* SCSI devices partitions.
Add info that now this LLD is able to handle ZFS vdevs.
Added new MEM:: metrics which are now available in zabbix 4.2.x (added by my patch
https://support.zabbix.com/browse/ZBX-13233):
- MEM::active
- MEM::anon
- MEM::inactive
- MEM::slab
Added new graph MEM::active vs inactive.
Added to MEM screen MEM::active vs inactive graph.

Closing https://github.com/kloczek/zabbix-templates/projects/1#card-22419180.
Forgot add in prev commit description about that
"Discard unchanged" filter preprocessor cannot be used for now
https://support.zabbix.com/browse/ZBX-16456 so this is kind of workaround
for that issue.
When dissue with "Discard unchanged" filter on presentation layer will be sorted out
I'm going to change sampling that metrics to 10 or 20 min and even with that
it will be less wrritten data to the database however any live changes meomnory in
containers/namespaces settings will be almost instantly visable in the monitoring data.
…story period=1h

As "Discard unchanged" filter preprocessor cannot be used for now
https://support.zabbix.com/browse/ZBX-16456 this is only temporary
workaround until that issue will be sorted out.
…ible.

Template has been tested on Ubuntu Xenian and Bionic.
CPU::idle trigger now uses macros:
- {$OS_CPU_IDLE}: threshold below which is raised alarm about low
  CPU::idle time
- {$OS_CPU_IDLE_HIST}: hysteresis time for CPU::idle after which alarm
  starts and recovers after additional period defined in {$OS_CPU_IDLE_HIST}
If those macros are not defined in system macros alarms about low CPU::idle time will be not created
- Items:
  - PROC::mysqld
    - change hardcoded MySQL process name to the macro {$SVC_MYSQL_PROC}
    - add "proceses" missing unit
  - "version" add "Discard unchanged with hartbeat" filter which allows
    change sampling rate from 1d to 5min with HB preriod 7d
- Triggers:
  - "SYS::processes::{$SVC_MYSQL_PROC} is not running" severity changed
    to disaster
- Add use md markups in template description
All trigges should not be active when "SYS::Host is down" trigger is active.
Add proper dependency for all trigges which still does not depend on "SYS::Host is down"
Add proper dependency for all trigges which still does not depend on "SYS::Host is down"
and {$NGINX_PORT} to {$SVC_NGINX_PORT}.
Just to keep consistency with other templates.
Fist rule for triggers like:
 - Version has changed
 - Configuration parametr has changed
Assigne them as "Not classified" severity.
…rsion change

Service MySQL:
 - Triggers:
    - change "SVC::MySQL::version has changed to {ITEM.LASTVALUE}" to "Not classified"
Service Zabbix Agent:
 - Triggers:
    - change "SVC::zabbix_agent::version has changed to {ITEM.LASTVALUE}" to "Not classified"
Complete swap space monitoring using LLD, so system swap monitoring
is added only when the monitored system uses swap device(s).
During the implementation of the swap monitoring, I found some
difficulties about which you can read on:
https://support.zabbix.com/browse/ZBX-16821
This is an excellent example of how to add similar things which add
the set of items/triggers/graphs or not depends on the state of some
single resource.

Monitoring of the swap consists of:
- "SWAP:" new LLD
  It adds prototypes:
  - items: "in", "total", "out" and "used" with discard unchanged filter
  - "SWAP" and "SWAP::in/out" graphs
  - trigger: "SWAP::low space ({$ITEM.VALUE}% used)"
    the trigger is activated when more the "{$SWAP_HIGH}" percent of the swap
    is used with default "{$SWAP_HIGH}=80" value
- Screens:
  - "SWAP" added with "SWAP" and "SWAP::in/out" graphs

TODO: add similar modification for L2ARC monitoring and MySQL slave databases
connections.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants