Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 14 2021

vsellier updated the diff for D5518: Make the project usable in the webapp tests.

rebase

Apr 14 2021, 11:17 AM
vsellier updated the diff for D5518: Make the project usable in the webapp tests.

take review's feedbacks in consideration

Apr 14 2021, 11:16 AM
vsellier requested review of D5518: Make the project usable in the webapp tests.
Apr 14 2021, 10:19 AM
vsellier added a revision to T3231: Make the source of the object counts configurable: D5518: Make the project usable in the webapp tests.
Apr 14 2021, 10:18 AM · System administration, Monitoring, Web app

Apr 13 2021

vsellier closed D5502: Add a method get several counter values at once.
Apr 13 2021, 7:09 PM
vsellier committed rDCNT38eab384f275: Add a method get several counter values at once (authored by vsellier).
Add a method get several counter values at once
Apr 13 2021, 7:09 PM
vsellier closed T3232: remove hardcoded historical values from the webapp, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 13 2021, 6:59 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3232: remove hardcoded historical values from the webapp as Resolved.

swh-web v0.0.295 is released and deployed on staging and production.

Apr 13 2021, 6:59 PM · Monitoring, Web app
vsellier added a comment to T3232: remove hardcoded historical values from the webapp.

The puppet script generating the aggragated data is updated and was run to refresh the data.
The webapp can be released with this diff now

Apr 13 2021, 6:02 PM · Monitoring, Web app
vsellier removed a task from D5502: Add a method get several counter values at once: T3232: remove hardcoded historical values from the webapp.
Apr 13 2021, 6:01 PM
vsellier removed a revision from T3232: remove hardcoded historical values from the webapp: D5502: Add a method get several counter values at once.
Apr 13 2021, 6:01 PM · Monitoring, Web app
vsellier updated the summary of D5502: Add a method get several counter values at once.
Apr 13 2021, 6:01 PM
vsellier added a revision to T3231: Make the source of the object counts configurable: D5502: Add a method get several counter values at once.
Apr 13 2021, 6:01 PM · System administration, Monitoring, Web app
vsellier updated the diff for D5502: Add a method get several counter values at once.

Reference the right task

Apr 13 2021, 6:01 PM
vsellier closed D5494: counters: Aggregate values for origin and revision graphs.
Apr 13 2021, 5:44 PM
vsellier committed rSPSITE7ec10abc3cb9: counters: Aggregate values for origin and revision graphs (authored by vsellier).
counters: Aggregate values for origin and revision graphs
Apr 13 2021, 5:44 PM
vsellier requested review of D5502: Add a method get several counter values at once.
Apr 13 2021, 5:44 PM
vsellier added a revision to T3232: remove hardcoded historical values from the webapp: D5502: Add a method get several counter values at once.
Apr 13 2021, 5:43 PM · Monitoring, Web app
vsellier added inline comments to D5494: counters: Aggregate values for origin and revision graphs.
Apr 13 2021, 3:24 PM
vsellier triaged T3242: Decommission ClearlyDefined resources as Normal priority.
Apr 13 2021, 3:13 PM · System administration
vsellier closed D5490: counters: Move the hardcoded points to the static historical file.
Apr 13 2021, 3:12 PM
vsellier committed rDWAPPS78a038b13927: counters: Move the hardcoded points to the static historical file (authored by vsellier).
counters: Move the hardcoded points to the static historical file
Apr 13 2021, 3:12 PM
vsellier added a revision to T3232: remove hardcoded historical values from the webapp: D5494: counters: Aggregate values for origin and revision graphs.
Apr 13 2021, 3:10 PM · Monitoring, Web app
vsellier requested review of D5494: counters: Aggregate values for origin and revision graphs.
Apr 13 2021, 3:10 PM
vsellier added a comment to T3232: remove hardcoded historical values from the webapp.

one additional point before releasing this, the puppet script making the aggregation need to be improved as it only merge the data for the content graph :
https://forge.softwareheritage.org/source/puppet-swh-site/browse/production/site-modules/profile/files/stats_exporter/export_archive_counters.py$109

Apr 13 2021, 12:50 PM · Monitoring, Web app
vsellier changed the status of T3231: Make the source of the object counts configurable, a subtask of T2912: Next generation archive counters, from Open to Work in Progress.
Apr 13 2021, 12:14 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier changed the status of T3231: Make the source of the object counts configurable from Open to Work in Progress.
Apr 13 2021, 12:14 PM · System administration, Monitoring, Web app
vsellier requested review of D5490: counters: Move the hardcoded points to the static historical file.
Apr 13 2021, 12:14 PM
vsellier added a revision to T3232: remove hardcoded historical values from the webapp: D5490: counters: Move the hardcoded points to the static historical file.
Apr 13 2021, 12:07 PM · Monitoring, Web app
vsellier added a comment to T3232: remove hardcoded historical values from the webapp.

The P1005 convert the data added by the webapp to json data that can be added to the /usr/local/share/swh-data/history-counters.munin.json file.
This content can be added on the file before the change on the webapp is released. it will just add few duplicate points to render, but with no effect on the final rendering

Apr 13 2021, 11:21 AM · Monitoring, Web app
vsellier added a comment to P1005 Convert webapp historical data to historical json.

this is the result to add to the current historical data:

{"revision": [[1441065600000, 0], [1467331200000, 594305600], [1473811200000, 644628800], [1479945600000, 704845952], [1494374400000, 780882048], [1506384000000, 853277241], [1516752000000, 943061517], [1518480000000, 946216028], [1521936000000, 980390191], [1538611200000, 1126348335], [1548547200000, 1248389319], [1554681600000, 1293870115], [1561593600000, 1326776432], [1563926400000, 1358421267], [1569110400000, 1379380527], [1569715200000, 1385477933], [1577836800000, 1414420369], [1580947200000, 1428955761], [1586217600000, 1590436149], [1589673600000, 1717420203], [1590537600000, 1744034936]], "origin": [[1441065600000, 0], [1467331200000, 22777052], [1473811200000, 25258776], [1479945600000, 53488904], [1494374400000, 58257484], [1506384000000, 65546644], [1516752000000, 71814787], [1518480000000, 81655813], [1521936000000, 83797945], [1538611200000, 85202432], [1548547200000, 88288721], [1554681600000, 88297714], [1561593600000, 89301694], [1563926400000, 89601149], [1569110400000, 90231104], [1569715200000, 90487661], [1577836800000, 91400586], [1580947200000, 91512130], [1586217600000, 107875943], [1589673600000, 121172621], [1590537600000, 123781438]], "content": [[1441065600000, 0]]}
Apr 13 2021, 11:17 AM
vsellier created P1005 Convert webapp historical data to historical json.
Apr 13 2021, 11:17 AM
vsellier changed the status of T3232: remove hardcoded historical values from the webapp, a subtask of T2912: Next generation archive counters, from Open to Work in Progress.
Apr 13 2021, 9:49 AM · Roadmap 2021, System administration, Monitoring, Web app
vsellier changed the status of T3232: remove hardcoded historical values from the webapp from Open to Work in Progress.
Apr 13 2021, 9:49 AM · Monitoring, Web app

Apr 12 2021

vsellier changed the status of T3243: Replace /dev/sdb and /dev/sdc on storage1.staging, a subtask of T3236: staging: Disk error on storage1, from Open to Work in Progress.
Apr 12 2021, 7:31 PM · System administration, Staging environment
vsellier changed the status of T3243: Replace /dev/sdb and /dev/sdc on storage1.staging from Open to Work in Progress.

The disks are removed from the zfs pool. The replacement be done

Apr 12 2021, 7:31 PM · System administration, Staging environment
vsellier closed T3236: staging: Disk error on storage1 as Resolved.
Apr 12 2021, 7:30 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The mirror is removed fro the pool:

root@storage1:~# zpool list
NAME   SIZE  ALLOC   FREE  CKPOINT  EXPANDSZ   FRAG    CAP  DEDUP    HEALTH  ALTROOT
data  21.8T  2.50T  19.3T        -         -    20%    11%  1.00x    ONLINE  -
Apr 12 2021, 7:30 PM · System administration, Staging environment
vsellier closed T3228: Free 4 of 5 remaining ips still used on vlan210 as Resolved.

All the 3 vms are reconfigured. the ips are released.

Apr 12 2021, 6:11 PM · System administration
vsellier added a comment to T3228: Free 4 of 5 remaining ips still used on vlan210.
  • puppet disabled on all the 3 nodes (pergamon, moma, tate)
  • old /etc/network/interface file backuped
cp /etc/network/interfaces T3228-interfaces
  • configuration changed on promox to swtich eth0 from vlan210 to vlan1300
  • applying puppet configuration
  • restart
  • Former vlan1300 interface removed on proxmox
Apr 12 2021, 5:55 PM · System administration
vsellier updated the task description for T3228: Free 4 of 5 remaining ips still used on vlan210.
Apr 12 2021, 5:28 PM · System administration
vsellier updated the task description for T3228: Free 4 of 5 remaining ips still used on vlan210.
Apr 12 2021, 5:28 PM · System administration
vsellier closed D5479: network: Remove network interface on deprecated VLAN210 network.
Apr 12 2021, 5:18 PM
vsellier committed rSPSITE657bfb390b33: network: Remove network interface on deprecated VLAN210 network (authored by vsellier).
network: Remove network interface on deprecated VLAN210 network
Apr 12 2021, 5:18 PM
vsellier updated the diff for D5479: network: Remove network interface on deprecated VLAN210 network.

rebase

Apr 12 2021, 5:14 PM
vsellier added a comment to T3087: Implement support for takedown notices (infra, admin tools, workflow).

Are we planning to add a way to notify the mirrors of the takedown notices ?
I'm just thinking if it could be interesting to subscribe the staging environment to it to ensure the content is also removed from it (and also flagged to avoid any further ingestion).

Apr 12 2021, 4:31 PM · Roadmap 2022, meta-task, Roadmap 2021, Web app
vsellier added a comment to T3221: elk: automatically limit log retention.

👍 thanks

Apr 12 2021, 4:19 PM · System administration
vsellier added a comment to T3243: Replace /dev/sdb and /dev/sdc on storage1.staging.

Ticket opened on the seagate site for the replacement of these 2 disks, the information will be transferred to the DSI for the packaging (as soon the disk will be removed from the pool)

Apr 12 2021, 4:14 PM · System administration, Staging environment
vsellier added a comment to T2939: Replace out of order disks on db1.staging and storage1.staging.

storage disks will be replaced in T3243

Apr 12 2021, 2:33 PM · System administration
vsellier triaged T3243: Replace /dev/sdb and /dev/sdc on storage1.staging as High priority.
Apr 12 2021, 2:26 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The mirror-1 removal is in progress:

root@storage1:~# zpool remove data mirror-1
Apr 12 2021, 2:19 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

There are 2 disks with errors that should now be replaced:

  • /dev/sdb/wwn-0x5000c500a23e3868 An old one
  • /dev/sdc/wwn-0x5000c500a22f48c9 the disk just removed from the pool
Apr 12 2021, 12:56 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The failing disk was removed from the pool:

root@storage1:~# zpool detach data wwn-0x5000c500a22f48c9
Apr 12 2021, 12:49 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The new failing drive is /dev/sdc

root@storage1:~# ls -al /dev/disk/by-id/ | grep wwn-0x5000c500a22f48c9
lrwxrwxrwx 1 root root    9 Apr 11 03:42 wwn-0x5000c500a22f48c9 -> ../../sdc
lrwxrwxrwx 1 root root   10 Mar 11 17:08 wwn-0x5000c500a22f48c9-part1 -> ../../sdc1
lrwxrwxrwx 1 root root   10 Mar 11 17:08 wwn-0x5000c500a22f48c9-part9 -> ../../sdc9
Apr 12 2021, 12:46 PM · System administration, Staging environment
vsellier changed the status of T3236: staging: Disk error on storage1 from Open to Work in Progress.
Apr 12 2021, 12:09 PM · System administration, Staging environment
vsellier created T3242: Decommission ClearlyDefined resources.
Apr 12 2021, 11:55 AM · System administration
vsellier added a comment to T3221: elk: automatically limit log retention.

A script is regurarly executed to close the oldest indexes (30days) : P1004
It should be added on puppet and scheduled in a cron

Apr 12 2021, 10:37 AM · System administration
vsellier triaged T3236: staging: Disk error on storage1 as High priority.
Apr 12 2021, 9:36 AM · System administration, Staging environment

Apr 11 2021

vsellier committed rDENV771f979f057f: Parallelize builds (authored by vsellier).
Parallelize builds
Apr 11 2021, 11:04 PM
vsellier committed rDENV02cf8839533c: kubernetes/Readme: Simplify readme instructions (authored by ardumont).
kubernetes/Readme: Simplify readme instructions
Apr 11 2021, 11:04 PM
vsellier committed rDENVffd820384372: reduce kafka startup time (authored by vsellier).
reduce kafka startup time
Apr 11 2021, 11:04 PM
vsellier committed rDENVf4436b121daf: speedup rebuild (authored by vsellier).
speedup rebuild
Apr 11 2021, 11:04 PM
vsellier committed rDENV08d51153ad04: counters: use labels in prometheus queries (authored by vsellier).
counters: use labels in prometheus queries
Apr 11 2021, 11:04 PM
vsellier committed rDENVec0bf540f11d: Add a UI on top of the registry (authored by vsellier).
Add a UI on top of the registry
Apr 11 2021, 11:04 PM

Apr 10 2021

vsellier committed rDENV75135ca327f4: webapp use counters to display the history count graph (authored by vsellier).
webapp use counters to display the history count graph
Apr 10 2021, 8:27 PM
vsellier committed rDENV16ef24f70775: monitoring: use consistent names for exporter job (authored by vsellier).
monitoring: use consistent names for exporter job
Apr 10 2021, 8:27 PM
vsellier committed rDENVb09a2a45ab03: use a local storage for the registry (authored by vsellier).
use a local storage for the registry
Apr 10 2021, 8:27 PM
vsellier committed rDENV0c220f3eecfd: try to perform a warm shutdown of the lister and loaders (authored by vsellier).
try to perform a warm shutdown of the lister and loaders
Apr 10 2021, 8:27 PM
vsellier committed rDENV1ca15f499b05: loaders: use a dynamic hostname (authored by vsellier).
loaders: use a dynamic hostname
Apr 10 2021, 8:27 PM

Apr 9 2021

vsellier triaged T3232: remove hardcoded historical values from the webapp as Normal priority.
Apr 9 2021, 7:33 PM · Monitoring, Web app
vsellier triaged T3231: Make the source of the object counts configurable as Normal priority.
Apr 9 2021, 7:22 PM · System administration, Monitoring, Web app
vsellier closed T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 9 2021, 7:02 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3165: Generate historical data from the new counters series as Resolved.

Everything is released correctly and deployed on staging

Apr 9 2021, 7:02 PM · System administration, Monitoring
vsellier closed T3215: Deploy the new counters in staging, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 9 2021, 6:56 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3215: Deploy the new counters in staging as Resolved.

I finally found why the graphs looks weird : https://forge.softwareheritage.org/source/swh-web/browse/master/swh/web/misc/urls.py$31
With a dirty patch on the server, it's way better:

Apr 9 2021, 6:56 PM · System administration, Monitoring, Web app
vsellier added a project to T3228: Free 4 of 5 remaining ips still used on vlan210: System administration.
Apr 9 2021, 6:31 PM · System administration
vsellier retitled D5479: network: Remove network interface on deprecated VLAN210 network from nerwtork: Remove network interface on deprecated VLAN210 network to network: Remove network interface on deprecated VLAN210 network.
Apr 9 2021, 3:34 PM
vsellier updated the diff for D5479: network: Remove network interface on deprecated VLAN210 network.

fix a typo on the commit message

Apr 9 2021, 3:34 PM
vsellier requested review of D5479: network: Remove network interface on deprecated VLAN210 network.
Apr 9 2021, 3:14 PM
vsellier added a revision to T3228: Free 4 of 5 remaining ips still used on vlan210: D5479: network: Remove network interface on deprecated VLAN210 network.
Apr 9 2021, 3:14 PM · System administration
vsellier committed rSPSITE011681733315: fix wrong usage of alias/lookup (authored by vsellier).
fix wrong usage of alias/lookup
Apr 9 2021, 2:39 PM
vsellier committed rSPSITE4f9371e81f3d: staging: Fix the counters history url (authored by vsellier).
staging: Fix the counters history url
Apr 9 2021, 2:28 PM
vsellier changed the status of T3228: Free 4 of 5 remaining ips still used on vlan210 from Open to Work in Progress.
Apr 9 2021, 2:16 PM · System administration
vsellier added a comment to T3215: Deploy the new counters in staging.

The pipeline is deployed in staging.
It's working but it seems the graphs need some initial values in staging to make the rendering correctly:

Apr 9 2021, 12:48 PM · System administration, Monitoring, Web app
vsellier closed D5470: staging: configure counters history pipeline.
Apr 9 2021, 12:21 PM
vsellier committed rSPSITE52709f7a0330: staging: configure counters history pipeline (authored by vsellier).
staging: configure counters history pipeline
Apr 9 2021, 12:20 PM
vsellier updated the diff for D5470: staging: configure counters history pipeline.

Add a filter to limit the metrics to the current environment

Apr 9 2021, 12:19 PM
vsellier renamed T3228: Free 4 of 5 remaining ips still used on vlan210 from Free 3 of 4 remaing ip still used on vlan210 to Free 4 of 5 remaing ips still used on vlan210.
Apr 9 2021, 10:50 AM · System administration
vsellier triaged T3228: Free 4 of 5 remaining ips still used on vlan210 as Normal priority.
Apr 9 2021, 10:41 AM · System administration
vsellier added a revision to T3215: Deploy the new counters in staging: D5470: staging: configure counters history pipeline.
Apr 9 2021, 9:47 AM · System administration, Monitoring, Web app
vsellier requested review of D5470: staging: configure counters history pipeline.
Apr 9 2021, 9:47 AM

Apr 8 2021

vsellier committed rDCNT46d1d61bb92c: Fix history endpoint path (authored by vsellier).
Fix history endpoint path
Apr 8 2021, 11:52 PM
vsellier closed D5468: Let flask manage json response by itself.
Apr 8 2021, 7:27 PM
vsellier committed rDCNT9958f4035e49: Let flask manage json response by itself (authored by vsellier).
Let flask manage json response by itself
Apr 8 2021, 7:27 PM
vsellier requested review of D5468: Let flask manage json response by itself.
Apr 8 2021, 7:25 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5468: Let flask manage json response by itself.
Apr 8 2021, 7:24 PM · System administration, Monitoring
vsellier closed T3219: No logs are ingested on elasticsearch since 2021-03-26 as Resolved.
Apr 8 2021, 4:36 PM · System administrators
vsellier triaged T3223: Elasticsearch: Monitor the max opened shards on a cluster as Normal priority.
Apr 8 2021, 4:35 PM · System administrators
vsellier triaged T3222: Monitor daily indexes are present on the log cluster and logs are correctly ingested as Normal priority.
Apr 8 2021, 4:32 PM · System administration
vsellier triaged T3221: elk: automatically limit log retention as Normal priority.
Apr 8 2021, 4:30 PM · System administration