1. ebb071c Fix incorrect state in Loadbalancer monitoring by JustHumanz · 3 weeks ago
  2. 26a6748 [stable/zed] Enhance `MySQLDown` alert (#2291) by vexxhost-bot · 9 weeks ago
  3. 4a0ca84 [ATMOSPHERE-368] [stable/zed] Add the NodeTimeSkewDetected alert (#2179) by vexxhost-bot · 4 months ago
  4. 2a500b4 [ATMOSPHERE-580] [stable/zed] Update NovaServiceGroupDown rule and Added failing tests (#2107) by vexxhost-bot · 4 months ago
  5. 1ff9767 [ATMOSPHERE-527] [stable/zed] Improve NeutronNetworkOutOfIPs alarm (#2064) by vexxhost-bot · 5 months ago
  6. 78279a1 [ATMOSPHERE-520][stable/zed] Remove NodeNonLTSKernel alert (#2045) by vexxhost-bot · 5 months ago
  7. 9559cf3 [stable/zed] [ATMOSPHERE-503] fix: remove softnet squeeze rules in kube-prometheus-stack (#2039) by vexxhost-bot · 5 months ago
  8. 1454d86 [ATMOSPHERE-512][stable/zed] Disable CephPGImbalance (#2037) by vexxhost-bot · 5 months ago
  9. 62597e6 [ATMOSPHERE-432] Fix goldpinger grafana dashboard threshold for nodes… (#1845) by Yaguang Tang · 7 months ago
  10. e47a25b [stable/zed] Add TLS to node exporter (#1780) by Yaguang Tang · 7 months ago
  11. ffdb3c6 [ATMOSPHERE-401] Add CommonName for monitoring stack (#1764) by vexxhost-bot · 7 months ago
  12. ffbf3dd [ATMOSPHERE-342] zed enable softirq monitoring (#1734) by Yaguang Tang · 8 months ago
  13. 1a63115 [ATMOSPHERE-315] Add support of ceph dashboard in grafana (#1709) by vexxhost-bot · 8 months ago
  14. 49e6e00 [ATMOSPHERE-305] fix: set variables for cluster issuer name for keycl… (#1702) by Oleksandr K. · 8 months ago
  15. 88f72a2 [stable/zed] Add Goldpinger + node-exporter-full (#1640) (#1684) by Oleksandr K. · 8 months ago
  16. 8bde924 [stable/zed] Change promethues to use pvc for data store (#1667) by vexxhost-bot · 8 months ago
  17. 154e6b4 [stable/zed] fix: nova capacity alert (#1596) by vexxhost-bot · 8 months ago
  18. 41408df [stable/zed] Add support to collect keycloak application metrics to prometheus (#1562) by vexxhost-bot · 8 months ago
  19. 0d48254 [stable/zed] Fix `libvirt_exporter` missing `namespaceSelector: (#1527) by vexxhost-bot · 8 months ago
  20. 732bdcd [stable/zed] grafana: Allow user lookups by email (#1515) by vexxhost-bot · 8 months ago
  21. 257a178 [stable/zed] ceph: Add CephHealthDetail alerts (#1502) by vexxhost-bot · 8 months ago
  22. c1b3f24 Fix JSONNET rendiner for alerts by Mohammed Naser · 9 months ago
  23. e4bb7fc [stable/zed] Add build request failure monitoring [ATMOSPHERE-249] (#1433) by Mohammed Naser · 9 months ago
  24. ebcd7d7 [stable/zed] Improve CI reliability (#1413) by Mohammed Naser · 9 months ago
  25. 9ae1303 [stable/zed] fix: use openstack_helm_ingress_secret_name when set for monitoring (#1407) by vexxhost-bot · 9 months ago
  26. 18adb06 [stable/zed] fix: add CA mounts in the Prometheus oauth2 container (#1335) by vexxhost-bot · 9 months ago
  27. a546734 [stable/zed] Switch docs to Sphinx (#1166) (#1169) by Mohammed Naser · 11 months ago
  28. 2bc6f4b [stable/zed] Add monitoring for stuck VMs (#1133) by vexxhost-bot · 11 months ago
  29. d206f5d feat: Add openstack db exporter (#1039) by Rico Lin · 12 months ago
  30. 37ebfde fix: fix CI with aritubee not define issue (#989) by Rico Lin · 1 year, 1 month ago
  31. 91e2fa0 feat(monitoring): expose prom/am via sso (#987) by Mohammed Naser · 1 year, 1 month ago
  32. 0b59744 feat: increase EL compatibility (#963) by Tadas Sutkaitis · 1 year, 1 month ago
  33. 8dc7add fix(keycloak): add no_log and disable become by Mohammed Naser · 1 year, 2 months ago
  34. 2e937c9 fix: added monitoring for high 500s count by Mohammed Naser · 1 year, 3 months ago
  35. 93c165d chore: update doc for kube-prom-stack ingresses (#713) by Oleksandr Kozachenko · 1 year, 4 months ago
  36. 947a84a feat(libvirt): Enable exporter ootb (#573) by Oleksandr Kozachenko · 1 year, 5 months ago
  37. 2beb903 fix(monitoring): fire IpmiCollectorDown after 15m by Mohammed Naser · 1 year, 5 months ago
  38. 6589394 fix(monitoring): drop ethtool exporter (#572) by Mohammed Naser · 1 year, 6 months ago
  39. b009349 feat: Add keycloak (#510) by Oleksandr Kozachenko · 1 year, 6 months ago
  40. 5b49cbb feat(monitoring): refactor (#555) by Mohammed Naser · 1 year, 7 months ago
  41. 4a761bb fix: added NodeNetworkMulticast by Mohammed Naser · 1 year, 9 months ago
  42. 7ae2b65 fix: ignore vxlan- in node exporter by Mohammed Naser · 1 year, 9 months ago
  43. 610ff8c add alerts for node softnet by ricolin · 1 year, 9 months ago
  44. dce06d4 fix: ignore osa interfaces by Mohammed Naser · 1 year, 9 months ago
  45. 403a42a fix: Add NodeNonLTSKernel alert (#404) by Rico Lin · 1 year, 10 months ago
  46. 3e5885e Update main.yml by Mohammed Naser · 1 year, 11 months ago
  47. 55100d5 Add missing default for grafana host by ricolin · 1 year, 11 months ago
  48. d778add Correct grafana variable name by ricolin · 1 year, 11 months ago
  49. cc14968 feat: unify all monitoring via grafana by Mohammed Naser · 1 year, 11 months ago
  50. f0314a8 fix: implement isolated clusters by Mohammed Naser · 1 year, 11 months ago
  51. 574d650 fix: use updated vexxhost.k8s by Mohammed Naser · 2 years ago
  52. 6b7acca fix: tune net.core.netdev_budget by Mohammed Naser · 2 years ago
  53. 7538f02 chore: refactor to v.k8s.upload_helm_chart by Mohammed Naser · 2 years ago
  54. 31171f4 chore: refactor to vexxhost.k8s.docker_image by Mohammed Naser · 2 years ago
  55. 9118f67 feat(monitoring): add metrics for ingress-nginx by Mohammed Naser · 2 years ago
  56. 7500421 fix: misc monitoring updates by Mohammed Naser · 2 years ago
  57. 40eb429 doc: fix typo in grafana by Mohammed Naser · 2 years, 1 month ago
  58. 8a2c8fb feat: add logging via vector + loki by Mohammed Naser · 2 years, 1 month ago
  59. 36f1de2 docs: clean-up opsgenie integration by Mohammed Naser · 2 years, 1 month ago
  60. e119d8b docs(monitoring): fix opsgenie by Mohammed Naser · 2 years, 1 month ago
  61. 53c04a3 doc: update monitoring docs by Mohammed Naser · 2 years, 2 months ago
  62. 273d3ca chore: move monitoring to offline install by Mohammed Naser · 2 years, 2 months ago
  63. 8b5c306 fix: use atmosphere_images for an image manifest by Mohammed Naser · 2 years, 2 months ago
  64. 7d3c797 feat(monitoring): add to operator by Mohammed Naser · 2 years, 4 months ago
  65. 6ed255b build: fix galaxy publishing by Mohammed Naser · 2 years, 6 months ago
  66. b8d3432 fix: stop waiting for kube-prometheus-stack by Mohammed Naser · 2 years, 6 months ago
  67. 09b3b54 chore: refactor servicemonitors into kube-prometheus-stack by Mohammed Naser · 2 years, 6 months ago
  68. 08c6224 chore: refactor *monitors to kube-prometheus-stack by Mohammed Naser · 2 years, 6 months ago
  69. 64da5c6 feat: clean-up more code for helm repos by Mohammed Naser · 2 years, 6 months ago
  70. 2a8ce6a fix(metrics): don't wait for entire helmrelease, just deployment by Mohammed Naser · 2 years, 6 months ago
  71. 6bf6535 ci: move ansible-lint to pre-commit by Mohammed Naser · 2 years, 6 months ago
  72. c8e1a45 Add Flux CD for Helm deployment by Mohammed Naser · 2 years, 7 months ago
  73. ba40eb3 Add exception for gre_sys by ricolin · 2 years, 8 months ago
  74. bff9371 Add persistence to AlertManager by Mohammed Naser · 2 years, 8 months ago
  75. d92c5f7 Add exception for tbr instances by Mohammed Naser · 2 years, 8 months ago
  76. 0ae4144 Drop CephNodeDiskspaceWarning by Mohammed Naser · 2 years, 9 months ago
  77. 3a15345 monitoring: upgrade kube-prometheus-stack by Mohammed Naser · 2 years, 9 months ago
  78. 55cc241 monitoring: disable noisy alerts by Mohammed Naser · 2 years, 10 months ago
  79. 6cd7291 Fix webhook errors for monitoring by Mohammed Naser · 2 years, 10 months ago
  80. f3dffa8 Fix nodeSelector for services by Mohammed Naser · 2 years, 10 months ago
  81. 49e80bd Added ability to run overrides for monitoring by Mohammed Naser · 2 years, 11 months ago
  82. 511c3fa Add ansible-lint job by Mohammed Naser · 3 years ago
  83. b7b97d6 Added OpenStack services by Mohammed Naser · 3 years ago