Page MenuHomeSoftware Heritage
Feed Advanced Search

Mar 25 2021

vsellier closed D5329: counters: fix wrong service port in prometheus job.
Mar 25 2021, 11:34 AM
vsellier committed rSPSITE15c5ae96f603: counters: fix wrong service port in prometheus job (authored by vsellier).
counters: fix wrong service port in prometheus job
Mar 25 2021, 11:34 AM
vsellier requested review of D5329: counters: fix wrong service port in prometheus job.
Mar 25 2021, 11:30 AM
vsellier closed D5324: counters: add a prometheus job to read the new metrics end-point.
Mar 25 2021, 9:27 AM
vsellier committed rSPSITE5db040600da7: counters: add a prometheus job to read the new metrics end-point (authored by vsellier).
counters: add a prometheus job to read the new metrics end-point
Mar 25 2021, 9:27 AM

Mar 24 2021

vsellier added a revision to T3164: Expose counters in prometheus format: D5324: counters: add a prometheus job to read the new metrics end-point.
Mar 24 2021, 6:42 PM · System administration, Monitoring
vsellier requested review of D5324: counters: add a prometheus job to read the new metrics end-point.
Mar 24 2021, 6:42 PM
vsellier committed rSENVf8e4410fec19: Update octocatalog-diff facts (authored by vsellier).
Update octocatalog-diff facts
Mar 24 2021, 6:36 PM
vsellier closed D5321: Allow prometheus to retrieve the counter values.
Mar 24 2021, 6:23 PM
vsellier committed rDCNT7dbe186b010a: Allow prometheus to retrieve the counter values (authored by vsellier).
Allow prometheus to retrieve the counter values
Mar 24 2021, 6:23 PM
vsellier committed rDENV3a7d8fdb9877: docker: Configure prometheus to retrieve swh-counters metrics (authored by vsellier).
docker: Configure prometheus to retrieve swh-counters metrics
Mar 24 2021, 6:21 PM
vsellier closed D5322: docker: Configure prometheus to retrieve swh-counters metrics.
Mar 24 2021, 6:21 PM
vsellier accepted D5323: keycloak/deposit: Drop option direct_grant_flow.

LGTM

Mar 24 2021, 6:20 PM
vsellier added a revision to T3164: Expose counters in prometheus format: D5322: docker: Configure prometheus to retrieve swh-counters metrics.
Mar 24 2021, 5:45 PM · System administration, Monitoring
vsellier requested review of D5322: docker: Configure prometheus to retrieve swh-counters metrics.
Mar 24 2021, 5:45 PM
vsellier requested review of D5321: Allow prometheus to retrieve the counter values.
Mar 24 2021, 5:33 PM
vsellier added a revision to T3164: Expose counters in prometheus format: D5321: Allow prometheus to retrieve the counter values.
Mar 24 2021, 5:32 PM · System administration, Monitoring
vsellier accepted D5317: Deploy memcached on deposit instance.

lgtm

Mar 24 2021, 10:39 AM
vsellier added a comment to T3164: Expose counters in prometheus format.

The current serie were the counters are stored is named sql_swh_archive_object_count, the serie for swh-counters could be swh_archive_object_count

Mar 24 2021, 10:28 AM · System administration, Monitoring
vsellier committed rSENV160a357c4621: vagrant: declare moma and its certificates (authored by vsellier).
vagrant: declare moma and its certificates
Mar 24 2021, 10:25 AM
vsellier changed the status of T3164: Expose counters in prometheus format, a subtask of T2912: Next generation archive counters, from Open to Work in Progress.
Mar 24 2021, 10:08 AM · Roadmap 2021, System administration, Monitoring, Web app
vsellier changed the status of T3164: Expose counters in prometheus format from Open to Work in Progress.
Mar 24 2021, 10:08 AM · System administration, Monitoring
vsellier closed T3086: Prepare disk replacement on granet as Resolved.

The 2 remaining disks were inserted in place of the 2 old sdd and sdf.
They needed to be configured in JBOD mode:

root@granet:~# megacli -PDMakeJBOD  -physdrv[32:3] -a0
Mar 24 2021, 10:06 AM · System administration

Mar 22 2021

vsellier removed a project from T3165: Generate historical data from the new counters series: Web app.
Mar 22 2021, 6:31 PM · System administration, Monitoring
vsellier triaged T3165: Generate historical data from the new counters series as Normal priority.
Mar 22 2021, 6:31 PM · System administration, Monitoring
vsellier triaged T3164: Expose counters in prometheus format as Normal priority.
Mar 22 2021, 5:50 PM · System administration, Monitoring
vsellier closed T3159: Deploy swh-counters:v0.1.0 in staging, a subtask of T2912: Next generation archive counters, as Resolved.
Mar 22 2021, 5:34 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3159: Deploy swh-counters:v0.1.0 in staging as Resolved.

A new vm counters0.internal.staging.swh.network is deployed and hosting redis, swh-counters and its journal-client.
The lag in staging will be recovered in a couple of hours.

Mar 22 2021, 5:34 PM · Staging environment, System administration, Monitoring
vsellier closed D5297: staging: Add counters0 vm.
Mar 22 2021, 5:09 PM
vsellier committed rSPREba5211fafa29: staging: Add counters0 vm (authored by vsellier).
staging: Add counters0 vm
Mar 22 2021, 5:09 PM
vsellier committed rSPSITE221db263a3f1: counters: fix the journal client configuration (authored by vsellier).
counters: fix the journal client configuration
Mar 22 2021, 4:06 PM
vsellier requested review of D5297: staging: Add counters0 vm.
Mar 22 2021, 3:40 PM
vsellier added a revision to T3159: Deploy swh-counters:v0.1.0 in staging: D5297: staging: Add counters0 vm.
Mar 22 2021, 3:40 PM · Staging environment, System administration, Monitoring
vsellier closed D5296: Add swh-counters deployment configuration.
Mar 22 2021, 11:38 AM
vsellier committed rSPSITE1618407da8f1: Add swh-counters deployment configuration (authored by vsellier).
Add swh-counters deployment configuration
Mar 22 2021, 11:38 AM
vsellier added inline comments to D5296: Add swh-counters deployment configuration.
Mar 22 2021, 9:47 AM
vsellier requested review of D5296: Add swh-counters deployment configuration.
Mar 22 2021, 8:32 AM
vsellier added a revision to T3159: Deploy swh-counters:v0.1.0 in staging: D5296: Add swh-counters deployment configuration.
Mar 22 2021, 8:32 AM · Staging environment, System administration, Monitoring
vsellier committed rSPPRIVC4b18183f034c: Add sentry token for swh-counters (authored by vsellier).
Add sentry token for swh-counters
Mar 22 2021, 8:16 AM

Mar 21 2021

vsellier committed rSENV70552433dbea: vagrant: add staging-counters0 (authored by vsellier).
vagrant: add staging-counters0
Mar 21 2021, 5:26 PM

Mar 19 2021

vsellier committed rDCNT8cc6b94ba70a: Remove the swh/__init__.py file from the package (authored by vsellier).
Remove the swh/__init__.py file from the package
Mar 19 2021, 5:38 PM
vsellier committed rDCNTf72c2b2ad3d3: Remove the swh/__init__.py file from the package (authored by vsellier).
Remove the swh/__init__.py file from the package
Mar 19 2021, 5:35 PM
vsellier committed rDENV18290e1da1ef: docker: Install swh-counters in the docker image (authored by vsellier).
docker: Install swh-counters in the docker image
Mar 19 2021, 2:17 PM
vsellier moved T3043: journalbeat:/filebeat Add an environment field on the logs from in-progress to Weekly backlog on the System administration board.
Mar 19 2021, 12:40 PM · System administration
vsellier moved T3159: Deploy swh-counters:v0.1.0 in staging from Backlog to in-progress on the System administration board.
Mar 19 2021, 12:39 PM · Staging environment, System administration, Monitoring
vsellier changed the status of T3159: Deploy swh-counters:v0.1.0 in staging from Open to Work in Progress.
Mar 19 2021, 12:39 PM · Staging environment, System administration, Monitoring
vsellier added a comment to T3086: Prepare disk replacement on granet.

To identify the disks to replace, the front led can be activated via the idrac interface.
The disks are :
scsi-35000c500ae759873 | /dev/sdd: Serial ZA1G3R1S -> Physical Disk 0:1:3
scsi-35000c500ae750b2f | /dev/sdf : Serial ZA1G3H81 -> Physical Disk 0:1:5

Mar 19 2021, 12:28 PM · System administration
vsellier added a comment to T3086: Prepare disk replacement on granet.

Removing the mirror mirror-2 with the disks scsi-35000c500ae750b2f and scsi-35000c500ae759873

root@granet:~# zpool remove hdd mirror-2
root@granet:~# zpool status hdd
  pool: hdd
 state: ONLINE
  scan: scrub repaired 0B in 0 days 17:39:22 with 0 errors on Sun Mar 14 18:03:23 2021
remove: Evacuation of mirror in progress since Fri Mar 19 10:45:22 2021
    1.03G copied out of 6.19T at 118M/s, 0.02% done, 15h18m to go
config:
Mar 19 2021, 11:46 AM · System administration
vsellier added a comment to T3086: Prepare disk replacement on granet.

And added to the hdd zfs pool:

  • before
root@granet:~# zpool list
NAME   SIZE  ALLOC   FREE  CKPOINT  EXPANDSZ   FRAG    CAP  DEDUP    HEALTH  ALTROOT
hdd   29.1T  22.8T  6.29T        -         -    20%    78%  1.00x    ONLINE  -
ssd   10.3T  7.91T  2.44T        -         -    24%    76%  1.00x    ONLINE  -
  • configuration
root@granet:~# ls -l /dev/disk/by-id | grep -e "wwn.*sdk" -e "wwn.*sdl"
lrwxrwxrwx 1 root root  9 Mar 19 10:10 wwn-0x5000c500cb46da4b -> ../../sdl
lrwxrwxrwx 1 root root  9 Mar 19 10:09 wwn-0x5000c500cb46e41b -> ../../sdk
root@granet:~# zpool add hdd mirror wwn-0x5000c500cb46da4b wwn-0x5000c500cb46e41b
root@granet:~# zpool status hdd
  pool: hdd
 state: ONLINE
  scan: scrub repaired 0B in 0 days 17:39:22 with 0 errors on Sun Mar 14 18:03:23 2021
config:
Mar 19 2021, 11:32 AM · System administration
vsellier added a comment to T3086: Prepare disk replacement on granet.

Disks configured in JBOD mode:

root@granet:~# megacli -PDMakeJBOD  -physdrv[32:10] -a0
Mar 19 2021, 11:15 AM · System administration
vsellier closed T3081: ZFS failures detected on belvedere as Resolved.

ZFS status was reset and a scrub restart after the upgrade of the zfs packages.
No more errors are detected.

Mar 19 2021, 11:13 AM · System administration
vsellier closed T3115: Upgrade zfs on all servers as Resolved.

All the servers were updated. We took the opportunity to upgrade and restart them to apply the last updates.

Mar 19 2021, 9:40 AM · System administration
vsellier closed T3115: Upgrade zfs on all servers, a subtask of T3081: ZFS failures detected on belvedere, as Resolved.
Mar 19 2021, 9:40 AM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 19 2021, 9:28 AM · System administration

Mar 18 2021

vsellier claimed T3086: Prepare disk replacement on granet.
Mar 18 2021, 6:43 PM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 3:50 PM · System administration
vsellier changed the status of T3086: Prepare disk replacement on granet from Open to Work in Progress.
Mar 18 2021, 3:19 PM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 3:12 PM · System administration
vsellier committed rSPSITE37f6fd13bfd8: webapp: use replica as main database on production (authored by vsellier).
webapp: use replica as main database on production
Mar 18 2021, 3:06 PM
vsellier closed D5278: webapp: use replica as main database on production.
Mar 18 2021, 3:06 PM
vsellier requested review of D5278: webapp: use replica as main database on production.
Mar 18 2021, 3:05 PM
vsellier added a revision to T3115: Upgrade zfs on all servers: D5278: webapp: use replica as main database on production.
Mar 18 2021, 3:05 PM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 1:18 PM · System administration
vsellier added a comment to T3115: Upgrade zfs on all servers.

plan for hypervisors / nodes upgrades:

  • beaubourg relative
  • workers[13..16]: stop services, upgrade package, shutdown, no needs to move them
  • stop azure workers
  • moma: upgrade packages, stop and restart on pompidou
  • tate: upgrade packages, stop and restart on pompidou
  • upgrade and stop somerset
  • upgrade beaubourg and restart beaubourg
  • restart somerset
  • upgrade moma configuration to use somerset as database
  • move back tate to beaubourg
  • move back moma to beaubourg
Mar 18 2021, 1:11 PM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 12:20 PM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 12:17 PM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 11:39 AM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 10:57 AM · System administration
vsellier added a comment to T3115: Upgrade zfs on all servers.
  • *esnode*
    • delaying node down detection and limit shard allocation to primaries
esnode2 ~ % export ES_NODE=192.168.100.62:9200                                              
esnode2 ~ % curl -XPUT -H "Content-Type: application/json" http://$ES_NODE/_cluster/settings -d '{
  "persistent": {
    "cluster.routing.allocation.enable": "primaries"
  }
}'
Mar 18 2021, 10:21 AM · System administration
vsellier updated subscribers of T3115: Upgrade zfs on all servers.

Plan:

  • first the upgrade will be done on the elasticsearch server
  • in parallel somerset can be updated
  • after the webapp can be configured to use somerset as the principal database
  • Upgrade of belveder
  • upgrade of saam (with the help of @olasd)
  • upgrade of belvedere
  • and finally the kafka servers
Mar 18 2021, 9:45 AM · System administration
vsellier updated the task description for T3115: Upgrade zfs on all servers.
Mar 18 2021, 9:41 AM · System administration

Mar 17 2021

vsellier closed T3147: Package swh-counters module as a debian package as Resolved.

branches unstable and buster configured
The builds passed and the package is available :

root@search0:~# apt search python3-swh.counters
Sorting... Done
Full Text Search... Done
python3-swh.counters/unknown 0.1.0-1+swh2~bpo10+1 all
  Software Heritage counters utilities
Mar 17 2021, 6:18 PM · System administration
vsellier closed T3147: Package swh-counters module as a debian package, a subtask of T2912: Next generation archive counters, as Resolved.
Mar 17 2021, 6:18 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier committed rDCNT01a38f0f57b5: Rebuild to unstuck the debian packaging. (authored by vsellier).
Rebuild to unstuck the debian packaging.
Mar 17 2021, 6:11 PM
vsellier committed rDCNTd99c18878822: Rebuild for buster-swh (authored by vsellier).
Rebuild for buster-swh
Mar 17 2021, 6:01 PM
vsellier committed rDCNTec090999bdbd: pristine-tar data for swh.counters_0.1.0.orig.tar.gz (authored by vsellier).
pristine-tar data for swh.counters_0.1.0.orig.tar.gz
Mar 17 2021, 5:51 PM
vsellier committed rDCNTf6f981721927: Initial packaging for swh.counters (authored by vsellier).
Initial packaging for swh.counters
Mar 17 2021, 5:50 PM
vsellier committed rDCNT5fbd90f1a4f7: New upstream version 0.1.0 (authored by vsellier).
New upstream version 0.1.0
Mar 17 2021, 5:50 PM
vsellier changed the status of T3147: Package swh-counters module as a debian package, a subtask of T2912: Next generation archive counters, from Open to Work in Progress.
Mar 17 2021, 4:24 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier changed the status of T3147: Package swh-counters module as a debian package from Open to Work in Progress.
Mar 17 2021, 4:24 PM · System administration
vsellier closed T3146: Add pytest-redis package on the swh repository, a subtask of T2912: Next generation archive counters, as Resolved.
Mar 17 2021, 4:24 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3146: Add pytest-redis package on the swh repository as Resolved.

Build is working for unstable and buster.

Mar 17 2021, 4:24 PM · System administration
vsellier committed rPPTREe54e46210740: Complete rebuild from unstable (authored by vsellier).
Complete rebuild from unstable
Mar 17 2021, 4:17 PM
vsellier committed rPPTRE188916bab9c7: Rebuild for buster-swh (authored by vsellier).
Rebuild for buster-swh
Mar 17 2021, 4:09 PM
vsellier committed rPPTRE2bb9e3996ab9: Rebuild for buster-swh (authored by vsellier).
Rebuild for buster-swh
Mar 17 2021, 3:59 PM
vsellier committed rPPTREb10c6d123840: Update changelog (authored by vsellier).
Update changelog
Mar 17 2021, 3:36 PM
vsellier committed rPPTRE071ca3fef7e6: Initial packaging for pytest-redis (authored by vsellier).
Initial packaging for pytest-redis
Mar 17 2021, 3:36 PM
vsellier closed D5263: jobs/dependency-packages: Add pytest-redis deps packages.
Mar 17 2021, 2:31 PM
vsellier committed rCJSWH6a4178b73e6b: jobs/dependency-packages: Add pytest-redis deps packages (authored by vsellier).
jobs/dependency-packages: Add pytest-redis deps packages
Mar 17 2021, 2:31 PM
vsellier updated the diff for D5263: jobs/dependency-packages: Add pytest-redis deps packages.

fix pkg name

Mar 17 2021, 2:26 PM
vsellier added a revision to T3146: Add pytest-redis package on the swh repository: D5263: jobs/dependency-packages: Add pytest-redis deps packages.
Mar 17 2021, 12:51 PM · System administration
vsellier requested review of D5263: jobs/dependency-packages: Add pytest-redis deps packages.
Mar 17 2021, 12:51 PM
vsellier committed rPPTREd0acf1a2f0b6: Initial packaging for pytest-redis (authored by vsellier).
Initial packaging for pytest-redis
Mar 17 2021, 12:36 PM
vsellier committed rPPTRE94dc9f68e8a4: New upstream version 2.0.0 (authored by vsellier).
New upstream version 2.0.0
Mar 17 2021, 12:36 PM
vsellier committed rPPTREf4ac919b5d29: pristine-tar data for pytest-redis_2.0.0.orig.tar.gz (authored by vsellier).
pristine-tar data for pytest-redis_2.0.0.orig.tar.gz
Mar 17 2021, 12:36 PM
vsellier committed rPPTREf173dd946edb: New upstream version 2.0.0 (authored by vsellier).
New upstream version 2.0.0
Mar 17 2021, 12:36 PM
vsellier added a comment to T3146: Add pytest-redis package on the swh repository.

New repository created and configured: https://forge.softwareheritage.org/source/pytest-redis/

Mar 17 2021, 12:30 PM · System administration
vsellier triaged T3147: Package swh-counters module as a debian package as Normal priority.
Mar 17 2021, 12:25 PM · System administration
vsellier changed the status of T3146: Add pytest-redis package on the swh repository from Open to Work in Progress.
Mar 17 2021, 12:20 PM · System administration

Mar 16 2021

vsellier closed D5253: Implement remote service.
Mar 16 2021, 6:34 PM