Page MenuHomeDevCentral
Feed Advanced Search

Aug 4 2024

dereckson added projects to T2002: Export metrics for MySQL and MariaDB: Monitoring and reporting, Nasqueron Docker deployment squad.
Aug 4 2024, 17:31 · Nasqueron Docker deployment squad, Monitoring and reporting
dereckson renamed T1999: Export metrics for php-fpm from Metrics for php-fpm to Export metrics for php-fpm.
Aug 4 2024, 17:07 · Operations sprints (Ignite Alkane Propulsion), freebsd-port-wanted, Alkane, PHP 8.x support, Monitoring and reporting
dereckson added hashtags to Monitoring and reporting: #observability, #prometheus.
Aug 4 2024, 17:06
dereckson moved T1983: Enable telemetry on Vault from Backlog to Prometheus on the Monitoring and reporting board.
Aug 4 2024, 17:05 · Vault, Monitoring and reporting
dereckson moved T1999: Export metrics for php-fpm from Backlog to Prometheus on the Monitoring and reporting board.
Aug 4 2024, 17:05 · Operations sprints (Ignite Alkane Propulsion), freebsd-port-wanted, Alkane, PHP 8.x support, Monitoring and reporting
dereckson moved T2000: Export metrics for PostgreSQL from Backlog to Prometheus on the Monitoring and reporting board.
Aug 4 2024, 17:05 · Servers, Monitoring and reporting
dereckson triaged T2000: Export metrics for PostgreSQL as Normal priority.
Aug 4 2024, 17:04 · Servers, Monitoring and reporting
dereckson created T1999: Export metrics for php-fpm.
Aug 4 2024, 16:50 · Operations sprints (Ignite Alkane Propulsion), freebsd-port-wanted, Alkane, PHP 8.x support, Monitoring and reporting
dereckson closed T1392: Evaluate Prometheus as Resolved.

Graduated and adopted. It's easy to deploy, easy to configure, easy to scrape.

Aug 4 2024, 16:36 · Monitoring and reporting, Product evaluation
dereckson closed T1623: Deploy Prometheus to gain observability, a subtask of T1392: Evaluate Prometheus, as Resolved.
Aug 4 2024, 16:34 · Monitoring and reporting, Product evaluation
dereckson closed T1623: Deploy Prometheus to gain observability as Resolved.
Aug 4 2024, 16:34 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson claimed T1623: Deploy Prometheus to gain observability.

Prometheus is available, regardless of the initial goal to offer a service mesh on Kubernetes.

Aug 4 2024, 16:34 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson moved T1990: Export metrics for Postfix from Backlog to Prometheus on the Monitoring and reporting board.
Aug 4 2024, 16:32 · Mail, Monitoring and reporting
dereckson added a project to T650: Deploy PCP on Docker engines: Monitoring and reporting.
Aug 4 2024, 16:32 · Monitoring and reporting, Servers
dereckson closed T651: Deploy Grafana as Resolved.
Aug 4 2024, 16:30 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a comment to T651: Deploy Grafana.

Documentation added in https://agora.nasqueron.org/Operations_grimoire/Grafana and links to other dashboards added to relevant places.

Aug 4 2024, 16:30 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson placed T1693: Evaluate Sensu for monitoring up for grabs.

Just a small note this product becomes more and more open core, and we're less in favour of that one "specifically".

Aug 4 2024, 09:54 · Servers, Monitoring and reporting, Product evaluation

Aug 3 2024

dereckson moved T1693: Evaluate Sensu for monitoring from In progress to Backlog on the User-Dereckson board.
Aug 3 2024, 19:50 · Servers, Monitoring and reporting, Product evaluation

Jul 30 2024

dereckson assigned T1990: Export metrics for Postfix to DorianWinty.

Author seems to report issues with the exporter and uses mtail.

Jul 30 2024, 21:30 · Mail, Monitoring and reporting
DorianWinty added a revision to T1990: Export metrics for Postfix: D3386: Scrape Postfix metrics into Prometheus.
Jul 30 2024, 21:00 · Mail, Monitoring and reporting
DorianWinty added a parent task for T1990: Export metrics for Postfix: T1930: Postfix Provisioning.
Jul 30 2024, 20:59 · Mail, Monitoring and reporting

Jul 29 2024

dereckson triaged T1990: Export metrics for Postfix as Normal priority.
Jul 29 2024, 20:33 · Mail, Monitoring and reporting

Jul 27 2024

dereckson added a revision to T1623: Deploy Prometheus to gain observability: D3381: Configure Docker metrics service in firewalld.
Jul 27 2024, 17:07 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson added a revision to T651: Deploy Grafana: D3380: Set correct Grafana URL.
Jul 27 2024, 13:57 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a revision to T651: Deploy Grafana: D3379: Move Grafana plugins directory to default location.
Jul 27 2024, 13:51 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers

Jul 26 2024

dereckson removed a project from T651: Deploy Grafana: Nasqueron Docker deployment squad.

Not deployed to Docker but bare-metal.

Jul 26 2024, 23:09 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson moved T651: Deploy Grafana from Backlog - Monitoring / misc to Working on on the Operations sprints (Ignite Alkane Propulsion) board.
Jul 26 2024, 23:09 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson moved T651: Deploy Grafana from Backlog to Working on on the Servers board.
Jul 26 2024, 23:09 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a comment to T651: Deploy Grafana.

Deployed at https://grafana.nasqueron.org/

Jul 26 2024, 19:38 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a revision to T651: Deploy Grafana: D3377: Deploy Grafana.
Jul 26 2024, 19:38 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a comment to T1496: Deploy StatsD and Graphite.

With a transformation, we can multiply the field by 10 to solve that issue: https://grafana.nasqueron.org/d/fdsy3oogbchs0f/graphite-quux-sandbox?orgId=1&tab=transform&from=1722018733917&to=1722022333917

Jul 26 2024, 19:32 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson added a comment to T1496: Deploy StatsD and Graphite.

The graphite Docker image provides both pieces of software.

Jul 26 2024, 19:24 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson renamed T1496: Deploy StatsD and Graphite from Deploy graphite to Deploy StatsD and Graphite.
Jul 26 2024, 19:13 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson merged task T1495: Deploy StatsD into T1496: Deploy StatsD and Graphite.
Jul 26 2024, 19:13 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson merged T1495: Deploy StatsD into T1496: Deploy StatsD and Graphite.
Jul 26 2024, 19:13 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting

Jul 25 2024

dereckson added a comment to T651: Deploy Grafana.

DNS: grafana. CNAME www-dev.nasqueron.org

Jul 25 2024, 18:49 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a comment to T651: Deploy Grafana.

Deployment can be using sqlite3 as long as it's still performant
as we want our monitoring tools to be resiliant.

Jul 25 2024, 18:49 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson raised the priority of T651: Deploy Grafana from Low to Normal.
Jul 25 2024, 18:12 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson claimed T651: Deploy Grafana.

This task has been created in 2016 to publish metrics from PCP (Performance Co-Pilot) on RHEL-like servers, especially our Docker engines.

Jul 25 2024, 18:12 · Monitoring and reporting, Operations sprints (Ignite Alkane Propulsion), Servers
dereckson added a comment to T1623: Deploy Prometheus to gain observability.

RabbitMQ exporters have been added to NetBox under the tag observability -> https://netbox.nasqueron.org/ipam/services/?tag=observability 🔒

Jul 25 2024, 18:04 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson added a subtask for T1623: Deploy Prometheus to gain observability: T1987: Dovecot Metrics.
Jul 25 2024, 18:03 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers

Jul 24 2024

dereckson updated the task description for T1983: Enable telemetry on Vault.
Jul 24 2024, 00:00 · Vault, Monitoring and reporting

Jul 23 2024

dereckson triaged T1983: Enable telemetry on Vault as Low priority.
Jul 23 2024, 23:57 · Vault, Monitoring and reporting
dereckson added a revision to T1623: Deploy Prometheus to gain observability: D3373: Enable rabbitmq_prometheus plugin.
Jul 23 2024, 23:15 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson added a revision to T1623: Deploy Prometheus to gain observability: D3372: Collect Docker metrics with Prometheus.
Jul 23 2024, 23:05 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson added a revision to T1623: Deploy Prometheus to gain observability: D3370: Deploy Prometheus on WindRiver.
Jul 23 2024, 22:28 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson removed a subtask for T1980: ZFS collector doesn't work everywhere: T1981: Upgrade to FreeBSD 14.1.
Jul 23 2024, 21:15 · Monitoring and reporting, Servers
dereckson closed T1980: ZFS collector doesn't work everywhere as Resolved.

I suspect the version 1.6.1 (currently in packages) is compatible with FreeBSD 13 while the version 1.8.2 is compatible with FreeBSD 14.

Jul 23 2024, 21:15 · Monitoring and reporting, Servers
dereckson moved T1978: Document monitoring checks from Backlog to Checks on the Monitoring and reporting board.
Jul 23 2024, 20:58 · Salt, Monitoring and reporting
dereckson moved T1945: Deploy a simple Nagios or Naemon to have a reference implementation from Backlog to Checks on the Monitoring and reporting board.
Jul 23 2024, 20:58 · Monitoring and reporting
dereckson moved T1623: Deploy Prometheus to gain observability from Backlog to Prometheus on the Monitoring and reporting board.
Jul 23 2024, 20:57 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson moved T1980: ZFS collector doesn't work everywhere from Backlog to Prometheus on the Monitoring and reporting board.
Jul 23 2024, 20:57 · Monitoring and reporting, Servers
dereckson moved T1392: Evaluate Prometheus from Backlog to Prometheus on the Monitoring and reporting board.
Jul 23 2024, 20:57 · Monitoring and reporting, Product evaluation
dereckson triaged T1980: ZFS collector doesn't work everywhere as Low priority.
Jul 23 2024, 20:55 · Monitoring and reporting, Servers
dereckson added a revision to T1623: Deploy Prometheus to gain observability: D3369: Deploy Prometheus Node Exporter.
Jul 23 2024, 18:30 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson added a comment to T1623: Deploy Prometheus to gain observability.

Independently of the 2020 plan for a service mesh, we're going to deploy Prometheus right now to gain observability on currently deployed services.

Jul 23 2024, 18:05 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers
dereckson added a comment to T1392: Evaluate Prometheus.

Prometheus has gained in traction and timeseries become now more and more standard.

Jul 23 2024, 18:01 · Monitoring and reporting, Product evaluation
dereckson renamed T1392: Evaluate Prometheus from Deploy Prometheus to Evaluate Prometheus.
Jul 23 2024, 17:57 · Monitoring and reporting, Product evaluation
dereckson added a project to T1392: Evaluate Prometheus: Monitoring and reporting.
Jul 23 2024, 17:57 · Monitoring and reporting, Product evaluation
dereckson added a comment to T1978: Document monitoring checks.

In D2648, NRPE directory has been set to dirs.share + "/monitoring/checks/nrpe", resolved on FreeBSD to /usr/local/share/monitoring/checks/nrpe directory.

Jul 23 2024, 17:30 · Salt, Monitoring and reporting

Jul 21 2024

dereckson triaged T1978: Document monitoring checks as High priority.
Jul 21 2024, 13:39 · Salt, Monitoring and reporting

Jan 22 2024

dereckson added a comment to T1945: Deploy a simple Nagios or Naemon to have a reference implementation.

Both naemon-core and naemon-livestatus deployed successfully on WindRiver. Checks run as expected.

Jan 22 2024, 00:34 · Monitoring and reporting
dereckson added a revision to T1945: Deploy a simple Nagios or Naemon to have a reference implementation: D3298: New port to test: net-mgmt/naemon-livestatus.
Jan 22 2024, 00:32 · Monitoring and reporting

Jan 21 2024

dereckson added a revision to T1945: Deploy a simple Nagios or Naemon to have a reference implementation: D3297: Deploy Naemon on WindRiver.
Jan 21 2024, 23:52 · Monitoring and reporting

Jan 16 2024

dereckson added a revision to T1945: Deploy a simple Nagios or Naemon to have a reference implementation: D3293: New port to test: net-mgmt/naemon-core.
Jan 16 2024, 23:40 · Monitoring and reporting
dereckson triaged T1945: Deploy a simple Nagios or Naemon to have a reference implementation as Normal priority.
Jan 16 2024, 23:40 · Monitoring and reporting

May 25 2023

dereckson triaged T1878: Allow to run queries for reporting as Wishlist priority.
May 25 2023, 04:23 · Monitoring and reporting, security, DBA, Servers
dereckson moved T1878: Allow to run queries for reporting from Backlog to Services / Features on the DBA board.
May 25 2023, 04:23 · Monitoring and reporting, security, DBA, Servers

May 24 2023

dereckson moved T1693: Evaluate Sensu for monitoring from Current focus to Backlog on the Product evaluation board.
May 24 2023, 22:37 · Servers, Monitoring and reporting, Product evaluation

May 20 2023

dereckson updated the task description for T1878: Allow to run queries for reporting.
May 20 2023, 15:45 · Monitoring and reporting, security, DBA, Servers
dereckson added a comment to T1878: Allow to run queries for reporting.

As a minimum, to have somewhere (a reports repository?) where we can write those report queries could already be useful, so we don't lose them.

May 20 2023, 15:43 · Monitoring and reporting, security, DBA, Servers
dereckson created T1878: Allow to run queries for reporting.
May 20 2023, 15:42 · Monitoring and reporting, security, DBA, Servers

May 18 2023

dereckson moved T1623: Deploy Prometheus to gain observability from Backlog to Not for this sprint on the Operations sprints (Consolidate them all) board.
May 18 2023, 11:49 · Monitoring and reporting, Operations sprints (Consolidate them all), Servers

May 6 2023

dereckson moved T1740: Monitor container disk usage from Backlog to Backlog - Monitoring / misc on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 18:00 · Operations sprints (Ignite Alkane Propulsion), Monitoring and reporting, Nasqueron Docker deployment squad, Servers
dereckson moved T1495: Deploy StatsD from Backlog to Backlog - Monitoring / misc on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 18:00 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1704: Monitor HTTP back-end from Docker containers from Backlog to Backlog - Monitoring / misc on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 18:00 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1496: Deploy StatsD and Graphite from Backlog to Backlog - Monitoring / misc on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 18:00 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1705: Monitor a container is up from Backlog to Backlog - Monitoring / misc on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 18:00 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1496: Deploy StatsD and Graphite from Backlog - Docker to Backlog on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:56 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1495: Deploy StatsD from Backlog - Docker to Backlog on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:56 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1740: Monitor container disk usage from Backlog - Docker to Backlog on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:56 · Operations sprints (Ignite Alkane Propulsion), Monitoring and reporting, Nasqueron Docker deployment squad, Servers
dereckson moved T1705: Monitor a container is up from Backlog - Docker to Backlog on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:56 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1704: Monitor HTTP back-end from Docker containers from Backlog - Docker to Backlog on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:56 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1704: Monitor HTTP back-end from Docker containers from Backlog to Backlog - Docker on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:55 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1495: Deploy StatsD from Backlog to Backlog - Docker on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:55 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1496: Deploy StatsD and Graphite from Backlog to Backlog - Docker on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:55 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1740: Monitor container disk usage from Backlog to Backlog - Docker on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:55 · Operations sprints (Ignite Alkane Propulsion), Monitoring and reporting, Nasqueron Docker deployment squad, Servers
dereckson moved T1705: Monitor a container is up from Backlog to Backlog - Docker on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:55 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1816: Automate Kafka cluster healing from Backlog to Backlog - Docker on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 15:55 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson moved T1809: Propagate containers-related events from Backlog to Blocked on the Operations sprints (Ignite Alkane Propulsion) board.
May 6 2023, 13:07 · Operations sprints (Ignite Alkane Propulsion), Monitoring and reporting, Nasqueron Docker deployment squad, Servers
dereckson added a project to T1809: Propagate containers-related events: Operations sprints (Ignite Alkane Propulsion).
May 6 2023, 12:54 · Operations sprints (Ignite Alkane Propulsion), Monitoring and reporting, Nasqueron Docker deployment squad, Servers

May 1 2023

dereckson moved T1790: Trace eggdrop errors from Backlog to Eggdrop dev on the IRC board.
May 1 2023, 20:36 · IRC, Monitoring and reporting, Dæghrefn

Apr 2 2023

dereckson closed T1818: Sentry database was forcefully updated during a PostgreSQL deployment as Resolved by committing rOPSc3de759c1b33: Use nasqueron/postgres-sentry image.
Apr 2 2023, 15:03 · Nasqueron Docker deployment squad, Monitoring and reporting, Salt
dereckson added a revision to T1818: Sentry database was forcefully updated during a PostgreSQL deployment: D2969: Use nasqueron/postgres-sentry image.
Apr 2 2023, 15:03 · Nasqueron Docker deployment squad, Monitoring and reporting, Salt
dereckson edited projects for T1818: Sentry database was forcefully updated during a PostgreSQL deployment, added: Nasqueron Docker deployment squad; removed Docker images.

This deployment created the incident:

Apr 2 2023, 14:52 · Nasqueron Docker deployment squad, Monitoring and reporting, Salt
dereckson moved T1818: Sentry database was forcefully updated during a PostgreSQL deployment from Backlog to Sentry on the Monitoring and reporting board.
Apr 2 2023, 14:41 · Nasqueron Docker deployment squad, Monitoring and reporting, Salt
dereckson moved T1818: Sentry database was forcefully updated during a PostgreSQL deployment from Backlog to Bug and issues on the Salt board.
Apr 2 2023, 14:41 · Nasqueron Docker deployment squad, Monitoring and reporting, Salt
dereckson triaged T1818: Sentry database was forcefully updated during a PostgreSQL deployment as Unbreak Now! priority.
Apr 2 2023, 14:41 · Nasqueron Docker deployment squad, Monitoring and reporting, Salt
dereckson updated the task description for T1816: Automate Kafka cluster healing.
Apr 2 2023, 11:38 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting
dereckson updated the task description for T1816: Automate Kafka cluster healing.
Apr 2 2023, 11:29 · Operations sprints (Ignite Alkane Propulsion), Nasqueron Docker deployment squad, Monitoring and reporting