Page MenuHomeSoftware Heritage

MonitoringTag
ActivePublic

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

Related to grafana, prometheus, icinga, sentry, ...

Recent Activity

Fri, Sep 17

ardumont added a parent task for T3583: check icinga alert for svn save-code-now: T3458: save code now: Requests are not getting updated from time to time.
Fri, Sep 17, 9:34 AM · Scheduling utilities, Save Code Now, Monitoring
ardumont updated the task description for T3583: check icinga alert for svn save-code-now.
Fri, Sep 17, 9:33 AM · Scheduling utilities, Save Code Now, Monitoring

Tue, Sep 14

ardumont closed T3538: Send scheduler metrics to prometheus as Resolved.
Tue, Sep 14, 11:00 AM · System administration, Monitoring, Scheduling utilities
ardumont moved T3538: Send scheduler metrics to prometheus from in-progress to deployed/landed on the System administration board.
Tue, Sep 14, 11:00 AM · System administration, Monitoring, Scheduling utilities

Wed, Sep 8

vlorentz added a revision to T3376: Visualize metadata of a deposit in the admin (moderation) view: D6190: Add link to extrinsic metadata API from the browse view.
Wed, Sep 8, 3:36 PM · Monitoring, SWORD deposit, Web app

Fri, Sep 3

ardumont added a parent task for T3538: Send scheduler metrics to prometheus: T2345: Improve handling of recurrent loading tasks in scheduler.
Fri, Sep 3, 5:17 PM · System administration, Monitoring, Scheduling utilities
ardumont changed the status of T3538: Send scheduler metrics to prometheus from Open to Work in Progress.
Fri, Sep 3, 5:16 PM · System administration, Monitoring, Scheduling utilities
anlambert added a comment to T3375: Add column 'client' in moderation view.

Would you have time during September to help on this task? (it is a roadmap task, btw)

Fri, Sep 3, 3:16 PM · Monitoring, SWORD deposit, Web app
moranegg updated subscribers of T3375: Add column 'client' in moderation view.

@anlambert we have discussed this task this morning with @ardumont and @vlorentz.
I want to start working on the improvements of the deposit admin view to open it up for deposit clients.
Would you have time during September to help on this task? (it is a roadmap task, btw)

Fri, Sep 3, 12:24 PM · Monitoring, SWORD deposit, Web app
vlorentz claimed T3376: Visualize metadata of a deposit in the admin (moderation) view.

Easiest option would be to add a link to the API endpoint.

Fri, Sep 3, 12:07 PM · Monitoring, SWORD deposit, Web app
ardumont triaged T3548: Monitor and raise alert when save code now requests seem unusually long as Normal priority.
Fri, Sep 3, 11:27 AM · System administration, Save Code Now, Monitoring
ardumont added a revision to T3538: Send scheduler metrics to prometheus: D6177: Send scheduler metrics to prometheus.
Fri, Sep 3, 9:34 AM · System administration, Monitoring, Scheduling utilities
ardumont moved T3538: Send scheduler metrics to prometheus from Backlog to Weekly backlog on the System administration board.
Fri, Sep 3, 9:02 AM · System administration, Monitoring, Scheduling utilities
ardumont added a project to T3538: Send scheduler metrics to prometheus: System administration.
Fri, Sep 3, 9:02 AM · System administration, Monitoring, Scheduling utilities

Mon, Aug 30

ardumont triaged T3538: Send scheduler metrics to prometheus as Normal priority.
Mon, Aug 30, 10:48 AM · System administration, Monitoring, Scheduling utilities

Aug 6 2021

vsellier closed T2912: Next generation archive counters as Resolved.

The cleanup of the old counters is done so it can be closed

Aug 6 2021, 6:32 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3417: Cleanup the old counters environment as Resolved.
Aug 6 2021, 6:31 PM · System administration, Monitoring
vsellier closed T3417: Cleanup the old counters environment, a subtask of T2912: Next generation archive counters, as Resolved.
Aug 6 2021, 6:31 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier added a comment to T3417: Cleanup the old counters environment.
  • D6064 landed
  • manual cleanup:
    • the apache vhost was removed by puppet
    • /var/www/stats.export.softwareheritage.org directory removed
    • the crontab was removed by puppet
    • /usr/local/bin/export_archive_counters.py file removed
    • /usr/local/share/swh-data directory removed
  • the refresh of the database counter is now scheduled each monday at 6:29 AM
postgres@belvedere:~$ crontab -l | grep counter
29 6  *   *  mon     /usr/bin/chronic /usr/bin/flock -xn /srv/softwareheritage/postgres/swh-update-counter.lock /usr/bin/psql -p 5433 softwareheritage -c "select swh_update_counter(object_type) from object_counts where single_update = true order by last_update limit 1"
Aug 6 2021, 6:31 PM · System administration, Monitoring
ardumont added a comment to T3417: Cleanup the old counters environment.

For the counters frequency, I have still not found which cron/command regularly is responsible of lauching the refresh

Maybe looking around what calls the following functions could help.

Aug 6 2021, 10:02 AM · System administration, Monitoring
ardumont added a comment to T3417: Cleanup the old counters environment.

For the counters frequency, I have still not found which cron/command regularly is responsible of lauching the refresh

Aug 6 2021, 9:33 AM · System administration, Monitoring

Aug 5 2021

vsellier added a comment to T3417: Cleanup the old counters environment.

Pergamon manual cleanup after D6064 is apply:

  • Remove /var/www/stats.export.softwareheritage.org directory
  • Remove apache vhosts:
    • /etc/apache2/sites-enabled/25-stats.export.softwareheritage.org_non-ssl.conf
    • /etc/apache2/sites-enabled/25-stats.export.softwareheritage.org_ssl.conf
    • /etc/apache2/sites-available/25-stats.export.softwareheritage.org_non-ssl.conf
    • /etc/apache2/sites-available/25-stats.export.softwareheritage.org_ssl.conf
  • check crontab removal: export_archive_counters
  • remove '/usr/local/bin/export_archive_counters.py'
  • remove '/usr/local/share/swh-data' directory
Aug 5 2021, 7:43 PM · System administration, Monitoring
vsellier added a revision to T3417: Cleanup the old counters environment: D6064: Clean counter statistic scripts, data and vhosts on pergamon.
Aug 5 2021, 7:33 PM · System administration, Monitoring
vsellier changed the status of T3417: Cleanup the old counters environment, a subtask of T2912: Next generation archive counters, from Open to Work in Progress.
Aug 5 2021, 5:37 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier changed the status of T3417: Cleanup the old counters environment from Open to Work in Progress.
Aug 5 2021, 5:37 PM · System administration, Monitoring

Aug 4 2021

ardumont updated the task description for T3452: Replication lag between the dbs should raise icinga alerts.
Aug 4 2021, 10:05 AM · Monitoring, System administration
ardumont updated the task description for T3452: Replication lag between the dbs should raise icinga alerts.
Aug 4 2021, 10:04 AM · Monitoring, System administration

Aug 3 2021

vsellier closed T3452: Replication lag between the dbs should raise icinga alerts as Resolved.

The probe is deployed on the monitoring: https://icinga.softwareheritage.org/dashboard#!/monitoring/service/show?host=belvedere.internal.softwareheritage.org&service=Postgresql%20replication%20lag%20%28belvedere%20-%3E%20somerset%29

Aug 3 2021, 11:32 AM · Monitoring, System administration
vsellier added a revision to T3452: Replication lag between the dbs should raise icinga alerts: D6050: monitor postgresql replication lag through prometheus data.
Aug 3 2021, 9:03 AM · Monitoring, System administration

Aug 2 2021

ardumont changed the status of T3452: Replication lag between the dbs should raise icinga alerts from Open to Work in Progress.
Aug 2 2021, 6:27 PM · Monitoring, System administration
ardumont moved T3452: Replication lag between the dbs should raise icinga alerts from Backlog to Weekly backlog on the System administration board.
Aug 2 2021, 6:27 PM · Monitoring, System administration

Jul 30 2021

vlorentz added a project to T3452: Replication lag between the dbs should raise icinga alerts: Monitoring.
Jul 30 2021, 1:32 PM · Monitoring, System administration

Jul 29 2021

ardumont moved T3159: Deploy swh-counters:v0.1.0 in staging from in-progress to done on the System administration board.
Jul 29 2021, 1:23 PM · Staging environment, System administration, Monitoring
ardumont moved T2774: Fix vault end-to-end check from deployed/landed to done on the System administration board.
Jul 29 2021, 1:22 PM · Vault, System administration, Monitoring

Jun 29 2021

moranegg added a comment to T3376: Visualize metadata of a deposit in the admin (moderation) view.

An example of a metadata icon for the deposit-admin view:
https://thenounproject.com/term/metadata/77940/

Jun 29 2021, 9:22 PM · Monitoring, SWORD deposit, Web app
ardumont renamed T3173: Create profiles in keycloak for the deposit-client to view dedicated moderation page from Create profiles in keycloack for the deposit-client to view dedicated moderation page to Create profiles in keycloak for the deposit-client to view dedicated moderation page.
Jun 29 2021, 11:48 AM · Monitoring, SWORD deposit, Web app
vsellier updated the task description for T3417: Cleanup the old counters environment.
Jun 29 2021, 11:25 AM · System administration, Monitoring
vsellier triaged T3417: Cleanup the old counters environment as Normal priority.
Jun 29 2021, 11:20 AM · System administration, Monitoring
moranegg added a subtask for T3128: Improve deposit integration, management and display: T2858: Use keycloak authentication for the deposit.
Jun 29 2021, 11:12 AM · meta-task, Roadmap 2021, Monitoring, SWORD deposit, Web app
moranegg renamed T3174: Filter deposit-admin view by deposit client on admin (moderation) page from Filter deposit-admin view by deposit client on moderation page to Filter deposit-admin view by deposit client on admin (moderation) page.
Jun 29 2021, 10:57 AM · Monitoring, SWORD deposit, Web app
moranegg moved T3128: Improve deposit integration, management and display from Backlog to Work in progress on the Roadmap 2021 board.
Jun 29 2021, 10:53 AM · meta-task, Roadmap 2021, Monitoring, SWORD deposit, Web app
moranegg removed a project from T3174: Filter deposit-admin view by deposit client on admin (moderation) page: Roadmap 2021.
Jun 29 2021, 10:52 AM · Monitoring, SWORD deposit, Web app
vlorentz removed a project from T3377: Add icon/button in moderation view to go to deposit in new tab: Roadmap 2021.
Jun 29 2021, 10:51 AM · Monitoring, SWORD deposit, Web app
moranegg removed projects from T3376: Visualize metadata of a deposit in the admin (moderation) view: meta-task, Roadmap 2021.
Jun 29 2021, 10:50 AM · Monitoring, SWORD deposit, Web app
moranegg renamed T3376: Visualize metadata of a deposit in the admin (moderation) view from Visualize metadata of a deposit in the moderation view to Visualize metadata of a deposit in the admin (moderation) view.
Jun 29 2021, 10:40 AM · Monitoring, SWORD deposit, Web app
moranegg placed T3376: Visualize metadata of a deposit in the admin (moderation) view up for grabs.
Jun 29 2021, 10:39 AM · Monitoring, SWORD deposit, Web app

Jun 28 2021

ardumont added a project to T3174: Filter deposit-admin view by deposit client on admin (moderation) page: Roadmap 2021.
Jun 28 2021, 10:56 AM · Monitoring, SWORD deposit, Web app
ardumont added a subtask for T3174: Filter deposit-admin view by deposit client on admin (moderation) page: T2996: Add possibility to fetch a list of deposits on the deposit cli.
Jun 28 2021, 10:56 AM · Monitoring, SWORD deposit, Web app

Jun 24 2021

moranegg removed a project from T3377: Add icon/button in moderation view to go to deposit in new tab: meta-task.
Jun 24 2021, 11:00 AM · Monitoring, SWORD deposit, Web app
moranegg placed T3377: Add icon/button in moderation view to go to deposit in new tab up for grabs.
Jun 24 2021, 11:00 AM · Monitoring, SWORD deposit, Web app