Page MenuHomeSoftware Heritage
Feed Advanced Search

Jan 28 2022

vsellier requested review of D6926: First iteration of prometheus export of the e2e metrics.
Jan 28 2022, 5:00 PM
vsellier accepted D7048: base_check: Add typing annotation to constructor.

Thanks for making mypy happy ;)

Jan 28 2022, 4:46 PM

Jan 27 2022

vsellier accepted D7046: vagrant: Fix dns search domains per deployment.

LGTM, it's not really ideal but it avoid to break the current subnet spliting we are using

Jan 27 2022, 4:09 PM
vsellier added inline comments to D7045: Migrate sentry node to admin vlan.
Jan 27 2022, 11:38 AM
vsellier changed the status of T3889: Admin database backup from Open to Work in Progress.
Jan 27 2022, 10:50 AM · System administration
vsellier added a comment to T3889: Admin database backup.

Unfornunately I didn't find any tool supporting natively the backup of postgresql through zfs snapshots.
We have a couple of alternatives:

  • Use borg to to implement a clean backup of the postgresql data directory based on hooks [1]
  • Use a tool like pgbackrest[2], the backup type and storage target need to be choosen (S3, storage server, ...)
  • Implement a backup based on zfs snapshots, which should not be too complicated but we need to manage all the plumbing to deal with full/incremental backups and the history cleaning
Jan 27 2022, 10:50 AM · System administration
vsellier added a comment to P1269 quick and dirty postgresql backup script based on zfs snapshot.

related to T3889

Jan 27 2022, 10:26 AM
vsellier created P1269 quick and dirty postgresql backup script based on zfs snapshot.
Jan 27 2022, 10:25 AM

Jan 25 2022

vsellier updated the task description for T3889: Admin database backup.
Jan 25 2022, 5:28 PM · System administration
vsellier created T3889: Admin database backup.
Jan 25 2022, 5:27 PM · System administration
vsellier closed D6979: Document the mirror credentials management.
Jan 25 2022, 5:05 PM
vsellier committed rDDOC4ac8648fb0a7: Document the mirror credentials management (authored by vsellier).
Document the mirror credentials management
Jan 25 2022, 5:05 PM
vsellier updated the diff for D6979: Document the mirror credentials management.
  • rebase
  • update according the feedback
Jan 25 2022, 4:57 PM
vsellier added inline comments to D6979: Document the mirror credentials management.
Jan 25 2022, 4:36 PM
vsellier accepted D7034: common/origin_save: Fix extraction of elasticsearch document fields.

LGTM

Jan 25 2022, 4:34 PM
vsellier accepted D7035: Automate weekly-planning script.

thanks, just a small inlined remark regarding the httpie package installation

Jan 25 2022, 4:28 PM

Jan 24 2022

vsellier closed D7018: netbox: migrate the vm to the admin vlan.
Jan 24 2022, 3:05 PM
vsellier committed rSPSITEd6d9ca9a373c: netbox: migrate the vm to the admin vlan (authored by vsellier).
netbox: migrate the vm to the admin vlan
Jan 24 2022, 3:05 PM
vsellier updated the diff for D7018: netbox: migrate the vm to the admin vlan.

rebase

Jan 24 2022, 3:04 PM
vsellier accepted D7023: Add missing maven lister task subscription to lister config.
Jan 24 2022, 12:00 PM

Jan 21 2022

vsellier closed D7020: Configure the kafka clusters environment for the consumer lag exporter.
Jan 21 2022, 7:07 PM
vsellier committed rSPSITEe2fd42a3f28c: Configure the kafka clusters environment for the consumer lag exporter (authored by vsellier).
Configure the kafka clusters environment for the consumer lag exporter
Jan 21 2022, 7:07 PM
vsellier requested review of D7020: Configure the kafka clusters environment for the consumer lag exporter.
Jan 21 2022, 7:00 PM
vsellier updated the diff for D7018: netbox: migrate the vm to the admin vlan.

rebase

Jan 21 2022, 5:45 PM
vsellier closed D6986: kafka: add a script to create the kafka credentials.
Jan 21 2022, 5:43 PM
vsellier committed rSPSITE87f392051fb0: vagrant: declare the getty ip (authored by vsellier).
vagrant: declare the getty ip
Jan 21 2022, 5:43 PM
vsellier committed rSPSITE847db4bbe80f: kafka: add a script to create the kafka credentials (authored by vsellier).
kafka: add a script to create the kafka credentials
Jan 21 2022, 5:43 PM
vsellier updated the task description for T3873: Migrate inventory to the admin vlan.
Jan 21 2022, 5:35 PM · System administration
vsellier requested review of D7018: netbox: migrate the vm to the admin vlan.
Jan 21 2022, 5:32 PM
vsellier added a revision to T3873: Migrate inventory to the admin vlan: D7018: netbox: migrate the vm to the admin vlan.
Jan 21 2022, 5:32 PM · System administration
vsellier committed rSENV7c0fa9643d40: Prepare the bojiman migration to the admin vlan (authored by vsellier).
Prepare the bojiman migration to the admin vlan
Jan 21 2022, 5:18 PM
vsellier updated the task description for T3873: Migrate inventory to the admin vlan.
Jan 21 2022, 4:51 PM · System administration
vsellier committed rSENVa00824d0f4f5: vagrant: update the facts at each run (authored by vsellier).
vagrant: update the facts at each run
Jan 21 2022, 4:18 PM
vsellier updated the task description for T3873: Migrate inventory to the admin vlan.
Jan 21 2022, 3:46 PM · System administration
vsellier updated the task description for T3873: Migrate inventory to the admin vlan.
Jan 21 2022, 3:26 PM · System administration
vsellier updated the task description for T3873: Migrate inventory to the admin vlan.
Jan 21 2022, 3:14 PM · System administration
vsellier changed the status of T3873: Migrate inventory to the admin vlan from Open to Work in Progress.
Jan 21 2022, 3:13 PM · System administration
vsellier updated the diff for D6986: kafka: add a script to create the kafka credentials.

fix the typo on the getty hostname

Jan 21 2022, 12:26 PM
vsellier updated the diff for D6979: Document the mirror credentials management.
  • Rebase
  • Update the kafka credentials part to use the new user management script
  • Add a section to describe the username pattern
  • Add a section to describe how to create the read-only storage credentials
Jan 21 2022, 11:52 AM
vsellier closed D6993: fix the admin layout.
Jan 21 2022, 10:16 AM
vsellier committed rDDOC50f23abeb1e2: fix the admin layout (authored by vsellier).
fix the admin layout
Jan 21 2022, 10:16 AM
vsellier updated the diff for D6993: fix the admin layout.

rebase

Jan 21 2022, 10:16 AM
vsellier requested review of D6993: fix the admin layout.
Jan 21 2022, 10:13 AM
vsellier updated the diff for D6986: kafka: add a script to create the kafka credentials.

add the puppet header on the script

Jan 21 2022, 9:41 AM

Jan 20 2022

vsellier updated the test plan for D6986: kafka: add a script to create the kafka credentials.
Jan 20 2022, 7:32 PM
vsellier updated the diff for D6986: kafka: add a script to create the kafka credentials.
  • Install the scripts for all the environments in getty, the journal orchestrator;
  • as the cluster configurations are now global, it impacts the consumer group exporter. It make sense to move it from storage1 to getty to also centralize this part (FW rules will need to be adapted accordingly);
Jan 20 2022, 7:32 PM
vsellier committed rSENV52b1bc7a1ac7: vagrant: Declare the getty node (authored by vsellier).
vagrant: Declare the getty node
Jan 20 2022, 7:04 PM
vsellier committed rSENVc0a1faa68152: vagrant: Declare the default vm type to debian (authored by vsellier).
vagrant: Declare the default vm type to debian
Jan 20 2022, 6:07 PM
vsellier committed rSPSITE58fe00948b49: staging: remove duplicate properties (authored by vsellier).
staging: remove duplicate properties
Jan 20 2022, 11:36 AM
vsellier committed rSPSITEa5ae23ec1c37: staging: Use the right elasticsearch cluster to display the scn task logs (authored by vsellier).
staging: Use the right elasticsearch cluster to display the scn task logs
Jan 20 2022, 11:36 AM
vsellier committed rSENV2530f6f3b1a5: remove the fact of the decommissioned webapp0 production server (authored by vsellier).
remove the fact of the decommissioned webapp0 production server
Jan 20 2022, 11:36 AM
vsellier added a comment to D6986: kafka: add a script to create the kafka credentials.

Yes it could, It will be cleaner, I will update in this way.
I took a shortcut estimating we should know which broker is up or not, to avoid to have to build the connection string :)

Jan 20 2022, 11:13 AM
vsellier updated the diff for D6986: kafka: add a script to create the kafka credentials.

rebase

Jan 20 2022, 10:59 AM
vsellier requested review of D6986: kafka: add a script to create the kafka credentials.
Jan 20 2022, 10:54 AM

Jan 19 2022

vsellier closed T3857: unexpected long time between a task is received and the loading start for few tasks as Resolved.

The ack late propery is deployed in staging and production.
The number of scn in the 'scheduled' status should decrease.

Jan 19 2022, 4:21 PM · System administration, Scheduling utilities
vsellier requested review of D6979: Document the mirror credentials management.
Jan 19 2022, 3:33 PM
vsellier added a revision to T3829: Document mirror - how to create and deploy a mirror from scratch: D6979: Document the mirror credentials management.
Jan 19 2022, 3:23 PM · Mirror
vsellier closed D6974: Reactivate the ack late configuration for the high priority tasks.
Jan 19 2022, 2:31 PM
vsellier committed rSPSITE768ce43e60f9: Reactivate the ack late configuration for the high priority tasks (authored by vsellier).
Reactivate the ack late configuration for the high priority tasks
Jan 19 2022, 2:31 PM
vsellier updated the diff for D6974: Reactivate the ack late configuration for the high priority tasks.

rebase

Jan 19 2022, 2:31 PM
vsellier closed D6966: Move mirror documentation from devel to sysadm.
Jan 19 2022, 2:23 PM
vsellier committed rDDOC09ef5e2c6aa7: Move mirror documentation from devel to sysadm (authored by vsellier).
Move mirror documentation from devel to sysadm
Jan 19 2022, 2:23 PM
vsellier updated the diff for D6966: Move mirror documentation from devel to sysadm.

fix according the review

Jan 19 2022, 2:10 PM
vsellier updated the diff for D6966: Move mirror documentation from devel to sysadm.
  • rebase
Jan 19 2022, 12:29 PM
vsellier requested review of D6966: Move mirror documentation from devel to sysadm.
Jan 19 2022, 12:27 PM
vsellier requested review of D6974: Reactivate the ack late configuration for the high priority tasks.
Jan 19 2022, 11:56 AM
vsellier added a revision to T3857: unexpected long time between a task is received and the loading start for few tasks: D6974: Reactivate the ack late configuration for the high priority tasks.
Jan 19 2022, 11:56 AM · System administration, Scheduling utilities

Jan 18 2022

vsellier closed D6964: Remove deprecated documentation.
Jan 18 2022, 5:03 PM
vsellier committed rCDFPbfcc0229921f: Remove deprecated documentation (authored by vsellier).
Remove deprecated documentation
Jan 18 2022, 5:03 PM
vsellier added a revision to T3829: Document mirror - how to create and deploy a mirror from scratch: D6966: Move mirror documentation from devel to sysadm.
Jan 18 2022, 5:00 PM · Mirror
vsellier committed rDDOC578f414acaed: Fix link ref (authored by vsellier).
Fix link ref
Jan 18 2022, 4:26 PM
vsellier updated the diff for D6964: Remove deprecated documentation.

Add the related task

Jan 18 2022, 3:53 PM
vsellier requested review of D6964: Remove deprecated documentation.
Jan 18 2022, 3:44 PM
vsellier added a comment to T3857: unexpected long time between a task is received and the loading start for few tasks.

It looks it's the normal behavior of celery with the current configuration:

Jan 18 2022, 12:25 PM · System administration, Scheduling utilities
vsellier changed the status of T3857: unexpected long time between a task is received and the loading start for few tasks from Open to Work in Progress.
Jan 18 2022, 11:53 AM · System administration, Scheduling utilities
vsellier added a comment to T3857: unexpected long time between a task is received and the loading start for few tasks.

After analyzing further the logs, it seems the celery process is accepting the task eb038b42-713e-4c23-a655-17d04f23870a whereas a previous loading is still running
A more contextual logs:

  • Previous task is received and started:
Jan 17 16:19:31 worker14 python3[2180500]: [2022-01-17 16:19:31,868: INFO/MainProcess] Received task: swh.loader.git.tasks.UpdateGitRepository[70c13ea8-19f4-4ac7-ae4f-ea71f80fc25a]  
Jan 17 16:19:32 worker14 python3[2721731]: [2022-01-17 16:19:32,467: INFO/ForkPoolWorker-133] Load origin 'https://git.libreoffice.org/translations' with type 'git'
  • The new task is accepted whereas the previous one is not yet finished:
Jan 17 16:26:10 worker14 python3[2180500]: [2022-01-17 16:26:10,683: INFO/MainProcess] Received task: swh.loader.git.tasks.UpdateGitRepository[eb038b42-713e-4c23-a655-17d04f23870a]
  • the first task ends:
Jan 17 19:48:20 worker14 python3[2721731]: [51B blob data]
Jan 17 19:48:20 worker14 python3[2721731]: [3.6K blob data]
Jan 17 19:49:57 worker14 python3[2721731]: [2022-01-17 19:49:57,085: ERROR/ForkPoolWorker-133] Loading failure, updating to `failed` status
                                           Traceback (most recent call last):
                                             File "/usr/lib/python3/dist-packages/swh/loader/core/loader.py", line 338, in load
                                               more_data_to_fetch = self.fetch_data()
                                             File "/usr/lib/python3/dist-packages/swh/loader/git/loader.py", line 277, in fetch_data
                                               self.origin.url, base_repo, do_progress
                                             File "/usr/lib/python3/dist-packages/swh/loader/git/loader.py", line 213, in fetch_pack_from_origin
                                               progress=do_activity,
                                             File "/usr/lib/python3/dist-packages/dulwich/client.py", line 2001, in fetch_pack
                                               progress,
                                             File "/usr/lib/python3/dist-packages/dulwich/client.py", line 845, in _handle_upload_pack_tail
                                               SIDE_BAND_CHANNEL_PROGRESS: progress,
                                             File "/usr/lib/python3/dist-packages/dulwich/client.py", line 604, in _read_side_band64k_data
                                               cb(pkt)
                                             File "/usr/lib/python3/dist-packages/swh/loader/git/loader.py", line 201, in do_pack
                                               f"Pack file too big for repository {origin_url}, "
                                           OSError: Pack file too big for repository https://git.libreoffice.org/translations, limit is 4294967296 bytes, current size is 4294966867, would write 65515
Jan 17 19:49:59 worker14 python3[2721731]: [2022-01-17 19:49:59,468: INFO/ForkPoolWorker-133] Task swh.loader.git.tasks.UpdateGitRepository[70c13ea8-19f4-4ac7-ae4f-ea71f80fc25a] succeeded in 12627.452697345056s: {'status': 'failed'}
  • the second task loading starts after 3 hours:
Jan 17 19:50:02 worker14 python3[2736039]: [2022-01-17 19:50:02,768: INFO/ForkPoolWorker-134] Load origin 'https://github.com/webtorrent/webtorrent-desktop' with type 'git'
Jan 18 2022, 11:53 AM · System administration, Scheduling utilities
vsellier triaged T3857: unexpected long time between a task is received and the loading start for few tasks as Normal priority.
Jan 18 2022, 11:46 AM · System administration, Scheduling utilities

Jan 14 2022

vsellier closed D6934: journalbeat: create a missing directory not created in the last packages.

closed by 0256827f9ef17fba02126492cd26448908803b8c

Jan 14 2022, 5:36 PM
vsellier closed D6872: create the admin vm grafana0.
Jan 14 2022, 5:35 PM
vsellier committed rSPRE6230309a88c2: create the admin vm grafana0 (authored by vsellier).
create the admin vm grafana0
Jan 14 2022, 5:35 PM
vsellier updated the diff for D6872: create the admin vm grafana0.

Update after the apply

Jan 14 2022, 5:34 PM
vsellier closed T3817: Install grafana on its own server, a subtask of T3808: Migrate pergamon to bullseye, as Resolved.
Jan 14 2022, 5:13 PM · System administration (Component upgrades)
vsellier closed T3817: Install grafana on its own server as Resolved.
Jan 14 2022, 5:13 PM · System administration
vsellier added a comment to T3817: Install grafana on its own server.
  • pergamon cleanup done with D6952.
Jan 14 2022, 5:13 PM · System administration
vsellier closed D6952: grafana: clean pergamon resources.
Jan 14 2022, 4:58 PM
vsellier committed rSPSITEd23774db02bd: grafana: clean pergamon resources (authored by vsellier).
grafana: clean pergamon resources
Jan 14 2022, 4:58 PM
vsellier updated the diff for D6952: grafana: clean pergamon resources.

rebase

Jan 14 2022, 4:57 PM
vsellier updated the diff for D6952: grafana: clean pergamon resources.

remove unecessary spaces

Jan 14 2022, 4:57 PM
vsellier updated the diff for D6952: grafana: clean pergamon resources.

restore useful removed comment

Jan 14 2022, 4:18 PM
vsellier added a revision to T3817: Install grafana on its own server: D6952: grafana: clean pergamon resources.
Jan 14 2022, 4:12 PM · System administration
vsellier requested review of D6952: grafana: clean pergamon resources.
Jan 14 2022, 4:12 PM
vsellier accepted D6951: sentry: Migrate sentry db to use the new one.
Jan 14 2022, 4:09 PM
vsellier committed rSPSITEcfed293525b9: grafana: disable grafana0 backups (authored by vsellier).
grafana: disable grafana0 backups
Jan 14 2022, 3:27 PM
vsellier added a comment to T3817: Install grafana on its own server.
  • grafana0 server created
  • configuration applied on the reverse proxy
  • network configuration checks:
    • grafana0 -> pergamon:9090 (prometheus): Allowed for all in a floating rule
    • grafana0 -> esnode[1-3]:9200 (elasticsearch queries):
  • public DNS updated to use rp1 (swh-rproxy3.inria.fr) as the public entry point
  • pergamon apache configuration temporary updated to point to the new grafana0 backend server ( and add a temporary rule on the firewall to allow it)
Jan 14 2022, 12:37 PM · System administration
vsellier committed rSPSITEae13e62c814a: grafana: fix the missing database name properties (authored by vsellier).
grafana: fix the missing database name properties
Jan 14 2022, 11:37 AM
vsellier committed rSPSITE8b670491bf64: webapp: Temporary use the main database (authored by vsellier).
webapp: Temporary use the main database
Jan 14 2022, 11:02 AM
vsellier closed D6949: webapp: Temporary use the main database.
Jan 14 2022, 11:02 AM
vsellier requested review of D6949: webapp: Temporary use the main database.
Jan 14 2022, 10:53 AM
vsellier closed D6871: Move grafana on a dedicated server behind the admin RP.
Jan 14 2022, 10:41 AM
vsellier committed rSPSITE8aa49be10300: Move grafana on a dedicated server behind the admin RP (authored by vsellier).
Move grafana on a dedicated server behind the admin RP
Jan 14 2022, 10:41 AM