Page MenuHomeSoftware Heritage

Metrics/monitoringTag
ActivePublic

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Recent Activity

Apr 21 2020

olasd closed T1270: Investigate an application monitoring tool to automate error detection in our workers as Resolved.

I'm pretty sure this is done now ;p

Apr 21 2020, 11:36 AM · Metrics/monitoring, Development environment

Feb 15 2020

vlorentz moved T2175: Deploy swh-icinga-plugins from Backlog to deployed on the Sprint 2019/12 (Monitor and Conquer) board.
Feb 15 2020, 8:18 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
vlorentz moved T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services from Backlog to deployed on the Sprint 2019/12 (Monitor and Conquer) board.
Feb 15 2020, 8:18 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration

Jan 27 2020

vlorentz added a comment to T1365: Archive coverage metrics in prometheus.

https://grafana.softwareheritage.org/d/3SAW_JEmk/software-heritage-archive-counters

Jan 27 2020, 4:44 PM · Metrics/monitoring, Restricted Project
vlorentz closed T1365: Archive coverage metrics in prometheus, a subtask of T1364: Have production metrics in prometheus or kibana, as Resolved.
Jan 27 2020, 4:44 PM · Metrics/monitoring, Restricted Project
vlorentz closed T1365: Archive coverage metrics in prometheus as Resolved.
Jan 27 2020, 4:44 PM · Metrics/monitoring, Restricted Project

Jan 23 2020

ardumont closed T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services as Resolved.

Deployed.

Jan 23 2020, 12:09 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
ardumont added a parent task for T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services: T2238: Configure Sentry environments.
Jan 23 2020, 11:13 AM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration

Jan 22 2020

ardumont added a revision to T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services: D2576: sentry: Define setup for swh services (servers, workers, ...).
Jan 22 2020, 6:50 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
vlorentz added a project to T2228: Metrics and monitoring: Metrics/monitoring.
Jan 22 2020, 4:27 PM · Metrics/monitoring, Restricted Project
ardumont claimed T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services.

Adapting the puppet manifest so we can discriminate issues per environment in sentry.

Jan 22 2020, 4:13 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
ardumont closed T2175: Deploy swh-icinga-plugins, a subtask of T1011: Enable continuous monitoring of deposit, as Resolved.
Jan 22 2020, 3:29 PM · Metrics/monitoring, SWORD deposit
ardumont closed T2175: Deploy swh-icinga-plugins as Resolved.
Jan 22 2020, 3:29 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
ardumont added a comment to T2175: Deploy swh-icinga-plugins.

Vault check deployed!

Jan 22 2020, 3:28 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
ardumont added a comment to T2175: Deploy swh-icinga-plugins.

Deposit check deployed!

Jan 22 2020, 2:12 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
ardumont added a comment to T2175: Deploy swh-icinga-plugins.

debian package this

Jan 22 2020, 2:12 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
vlorentz updated the task description for T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services.
Jan 22 2020, 2:11 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
vlorentz renamed T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services from Set SWH_MAIN_PACKAGE for all services to Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services.
Jan 22 2020, 2:10 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration

Jan 20 2020

ardumont added a comment to T2175: Deploy swh-icinga-plugins.

debian package this

Jan 20 2020, 12:04 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring

Jan 17 2020

ardumont claimed T2175: Deploy swh-icinga-plugins.

As far as i could tell so far:

  • debian package this
  • update puppet configuration to add the checks [1]
Jan 17 2020, 5:56 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring

Jan 15 2020

vlorentz renamed T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services from Set SWH_MAIN_PACKAGE for all SWH services to Set SWH_MAIN_PACKAGE for all services.
Jan 15 2020, 2:59 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
vlorentz triaged T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services as Normal priority.
Jan 15 2020, 2:59 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
vlorentz updated subscribers of T2180: Configure Jenkins to publish releases to Sentry.
Jan 15 2020, 2:58 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring
vlorentz created T2181: Set SWH_MAIN_PACKAGE and SWH_SENTRY_ENVIRONMENT for all services.
Jan 15 2020, 2:58 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer), System administration
vlorentz updated the task description for T2180: Configure Jenkins to publish releases to Sentry.
Jan 15 2020, 2:56 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring
vlorentz triaged T2180: Configure Jenkins to publish releases to Sentry as Normal priority.
Jan 15 2020, 2:56 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring
vlorentz added a project to T2175: Deploy swh-icinga-plugins: Sprint 2019/12 (Monitor and Conquer).
Jan 15 2020, 1:37 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring

Jan 13 2020

vlorentz closed T2118: Deposit: End to End monitoring, a subtask of T2175: Deploy swh-icinga-plugins, as Resolved.
Jan 13 2020, 3:24 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
vlorentz closed T2118: Deposit: End to End monitoring, a subtask of T1011: Enable continuous monitoring of deposit, as Resolved.
Jan 13 2020, 3:24 PM · Metrics/monitoring, SWORD deposit
vlorentz closed T2126: Production Vault end to end testing, a subtask of T2175: Deploy swh-icinga-plugins, as Resolved.
Jan 13 2020, 3:24 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
vlorentz added subtasks for T2175: Deploy swh-icinga-plugins: T2118: Deposit: End to End monitoring, T2126: Production Vault end to end testing.
Jan 13 2020, 3:23 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring
vlorentz triaged T2175: Deploy swh-icinga-plugins as Normal priority.
Jan 13 2020, 3:23 PM · Sprint 2019/12 (Monitor and Conquer), System administration, Metrics/monitoring

Jan 6 2020

olasd closed T1202: swh services: Monitor swh-worker@.service's status as Resolved.

I guess https://grafana.softwareheritage.org/d/Gyww7RfWz/workers-overview?orgId=1 implements this.

Jan 6 2020, 4:28 PM · Metrics/monitoring, System administration

Dec 19 2019

olasd moved T2133: Scheduler listener/runner: add statsd probes from done to deployed on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 19 2019, 2:07 PM · Metrics/monitoring, Scheduling utilities, Sprint 2019/12 (Monitor and Conquer)
olasd moved T1359: Add sentry support in every swh running service from done to deployed on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 19 2019, 2:06 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
olasd moved T1358: Setup a sentry service from done to deployed on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 19 2019, 2:06 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
olasd moved T1358: Setup a sentry service from in progress to done on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 19 2019, 2:06 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
olasd closed T1358: Setup a sentry service as Resolved.

Sentry is now available at https://sentry.softwareheritage.org/.

Dec 19 2019, 10:19 AM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
zack closed T1359: Add sentry support in every swh running service as Resolved.

(marking as done as it was moved to the done column on the sprint board, please reopen if not ok)

Dec 19 2019, 10:06 AM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
zack closed T1359: Add sentry support in every swh running service , a subtask of T1358: Setup a sentry service, as Resolved.
Dec 19 2019, 10:06 AM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration

Dec 16 2019

vlorentz changed the status of T2118: Deposit: End to End monitoring, a subtask of T1011: Enable continuous monitoring of deposit, from Open to Work in Progress.
Dec 16 2019, 4:09 PM · Metrics/monitoring, SWORD deposit
vlorentz moved T1359: Add sentry support in every swh running service from in progress to done on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 16 2019, 3:57 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration

Dec 11 2019

vlorentz closed T2142: Document how to use Sentry with the docker dev environment, a subtask of T1359: Add sentry support in every swh running service , as Resolved.
Dec 11 2019, 3:42 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
vlorentz closed T2142: Document how to use Sentry with the docker dev environment as Resolved.
Dec 11 2019, 3:42 PM · Docker environment, Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring
vlorentz added revisions to T1359: Add sentry support in every swh running service : D2428: Add sentry integration to the JS code., D2426: Initialize Sentry on Celery worker startup., D2423: Add sentry integration to swh-web, D2411: Make the CLI initialize sentry-sdk based on CLI options/envvars., D2418: Add gunicorn config script to initialize sentry-sdk based on envvars., D2420: Import gunicorn config from swh-core..
Dec 11 2019, 3:41 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
vlorentz claimed T1359: Add sentry support in every swh running service .
Dec 11 2019, 3:40 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
vlorentz moved T1359: Add sentry support in every swh running service from Backlog to in progress on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 11 2019, 3:40 PM · Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring, System administration
vlorentz moved T2142: Document how to use Sentry with the docker dev environment from in progress to deployed on the Sprint 2019/12 (Monitor and Conquer) board.
Dec 11 2019, 3:40 PM · Docker environment, Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring
vlorentz added a comment to T2142: Document how to use Sentry with the docker dev environment.

Resolved by D2424.

Dec 11 2019, 3:39 PM · Docker environment, Sprint 2019/12 (Monitor and Conquer), Metrics/monitoring

Dec 10 2019

olasd added a comment to T2128: Monitor journal consumer lag.

Packaged and deployed the consumer group exporter on getty for both kafka clusters.

Dec 10 2019, 8:10 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer)