Page MenuHomeSoftware Heritage
Feed All Stories

Aug 27 2021

vlorentz committed rDDOC5104ef5ccc68: Document swh-counters' default port (authored by vlorentz).
Document swh-counters' default port
Aug 27 2021, 11:03 AM
anlambert accepted D6148: Reword 'no more available to download' in more natural English.
Aug 27 2021, 10:45 AM
vlorentz added inline comments to D6148: Reword 'no more available to download' in more natural English.
Aug 27 2021, 10:41 AM
vlorentz added a comment to T3489: Implement iframe view for content and directory elements.

I agree, it looks great!

Aug 27 2021, 10:39 AM · Software Stories, Web app
ardumont added inline comments to D6148: Reword 'no more available to download' in more natural English.
Aug 27 2021, 10:32 AM
ardumont moved T3507: prod: vault: Deploy v1.0.0 from in-progress to deployed/landed/monitoring on the System administration board.
Aug 27 2021, 10:28 AM · System administration, Vault, Web app
ardumont added a comment to T3507: prod: vault: Deploy v1.0.0.

Note that the cache invalidation is not completely done though as the objstorage used
is an azure one.

Currently investigating how to clean that up.

Aug 27 2021, 10:28 AM · System administration, Vault, Web app
vsellier added a comment to T3357: Perform some tests of the cassandra storage on Grid5000.

interesting:

Depending on the data characteristics of the table, compressing its data can result in:

25-33% reduction in data size
25-35% performance improvement on reads
5-10% performance improvement on writes
Aug 27 2021, 10:15 AM · System administration, Storage manager
vlorentz requested review of D6148: Reword 'no more available to download' in more natural English.
Aug 27 2021, 10:13 AM
vsellier added a comment to T3357: Perform some tests of the cassandra storage on Grid5000.

The replaying is currently stopped as the data disks are now almost full.
I will try to activate the compression on some big tables to see if it can help.
I will probably need to start on small tables to recover some space before being able to compress the biggest tables

Aug 27 2021, 10:02 AM · System administration, Storage manager
ardumont added a comment to T3507: prod: vault: Deploy v1.0.0.

Note that the cache invalidation is not completely done though as the objstorage used
is an azure one.

Aug 27 2021, 9:55 AM · System administration, Vault, Web app
ardumont added a comment to T3507: prod: vault: Deploy v1.0.0.
  • status.io: Open maintenance ticket to notify of the partial disruption in service
  • vangogh: Stop puppet
  • vangogh: Stop gunicorn-swh-vault
  • vault db: Schema migration [1]
  • Upgrade workers and webapp nodes with latest swh.vault and restart cooker service
  • Start back gunicorn-swh-vault
  • Try a cooking and check result -> ok
  • Close maintenance ticket as everything is fine
Aug 27 2021, 9:54 AM · System administration, Vault, Web app
ardumont closed T3503: staging: vault: Deploy v1.0.0 as Resolved.
Aug 27 2021, 9:52 AM · Vault, Web app, System administration
ardumont closed T3503: staging: vault: Deploy v1.0.0, a subtask of T3507: prod: vault: Deploy v1.0.0, as Resolved.
Aug 27 2021, 9:52 AM · System administration, Vault, Web app
ardumont changed the status of T3507: prod: vault: Deploy v1.0.0, a subtask of T3506: Get rid of the concept of vault "object_type", from Open to Work in Progress.
Aug 27 2021, 9:24 AM · Vault, Web app
ardumont changed the status of T3507: prod: vault: Deploy v1.0.0 from Open to Work in Progress.
Aug 27 2021, 9:24 AM · System administration, Vault, Web app

Aug 26 2021

moranegg added a comment to T3489: Implement iframe view for content and directory elements.

This looks great!

Aug 26 2021, 10:26 PM · Software Stories, Web app
vsellier added a comment to D6139: cassandra: Add option to select (hopefully) more efficient batch insertion algos.

The patch was test in a loader and in the replayers.
The difference was not really significant on the loader but I'm not really confident in the tests as the cluster had a pretty high load (running replayers + second datacenter synchronization).
I will retry with a more quieter environment to be able to isolate the loader behavior.

Aug 26 2021, 7:26 PM
ardumont updated the summary of D6134: Raise alert when postfix service is down.
Aug 26 2021, 6:17 PM
ardumont updated the summary of D6134: Raise alert when postfix service is down.
Aug 26 2021, 6:13 PM
ardumont updated the diff for D6134: Raise alert when postfix service is down.

Rework commit message

Aug 26 2021, 6:13 PM
ardumont updated the diff for D6134: Raise alert when postfix service is down.

Use the correct postfix service to check. With this, the check actually detects crash of
the processed started by the service.

Aug 26 2021, 6:13 PM
anlambert changed the status of T3489: Implement iframe view for content and directory elements, a subtask of T3480: Provide iframe element for the SWH Stories, from Open to Work in Progress.
Aug 26 2021, 5:58 PM · Acquisition Process (SWHAP), Software Stories, Web app
anlambert changed the status of T3489: Implement iframe view for content and directory elements from Open to Work in Progress.

Some progress report on my work of the last days on the subject.

Aug 26 2021, 5:58 PM · Software Stories, Web app
ardumont accepted D6147: Refine scheduling policy for origins with no known last update.
Aug 26 2021, 5:50 PM
ardumont accepted D6146: Add a swh scheduler origin send-to-celery subcommand.
Aug 26 2021, 5:50 PM
ardumont added a comment to T2345: Improve handling of recurrent loading tasks in scheduler.

What's next, as a summary, subsequent subtasks should be created later:

Aug 26 2021, 5:45 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
ardumont added a comment to D5818: send-to-celery: Add more options to allow scheduling of edge case origins.

This needs rebase on top of D6147

Aug 26 2021, 5:05 PM
ardumont added inline comments to D5818: send-to-celery: Add more options to allow scheduling of edge case origins.
Aug 26 2021, 5:03 PM
olasd requested review of D6147: Refine scheduling policy for origins with no known last update.
Aug 26 2021, 4:56 PM
olasd requested review of D6146: Add a swh scheduler origin send-to-celery subcommand.
Aug 26 2021, 4:53 PM
ardumont accepted D6145: Add table sampling option to grab_next_visits.
Aug 26 2021, 4:51 PM
olasd requested review of D6145: Add table sampling option to grab_next_visits.
Aug 26 2021, 4:51 PM
ardumont committed rDSCHcc76a573e795: journal_client: Only upsert if we have something to upsert (authored by ardumont).
journal_client: Only upsert if we have something to upsert
Aug 26 2021, 3:38 PM
ardumont closed D6140: journal_client: Only upsert if we have something to upsert.
Aug 26 2021, 3:38 PM
anlambert closed D6144: developer-setup: Add elasticsearch install instructions.
Aug 26 2021, 3:16 PM
anlambert committed rDDOCc4e3e1c0b1e1: developer-setup: Add elasticsearch install instructions (authored by anlambert).
developer-setup: Add elasticsearch install instructions
Aug 26 2021, 3:16 PM
ardumont accepted D6144: developer-setup: Add elasticsearch install instructions.
Aug 26 2021, 3:10 PM
vlorentz changed the status of T3505: Make the git-bare cooker available to the staff and beta-testers in the production webapp from Open to Work in Progress.
Aug 26 2021, 3:08 PM · Vault, Web app
vlorentz changed the status of T3505: Make the git-bare cooker available to the staff and beta-testers in the production webapp, a subtask of T3504: Make the git-bare cooker publicly available, from Open to Work in Progress.
Aug 26 2021, 3:08 PM · Vault, Web app
vlorentz added a subtask for T3505: Make the git-bare cooker available to the staff and beta-testers in the production webapp: T843: Vault: Add a "git bare" tarball cooker.
Aug 26 2021, 3:08 PM · Vault, Web app
vlorentz added a parent task for T843: Vault: Add a "git bare" tarball cooker: T3505: Make the git-bare cooker available to the staff and beta-testers in the production webapp.
Aug 26 2021, 3:08 PM · Vault
vlorentz removed a parent task for T843: Vault: Add a "git bare" tarball cooker: T3504: Make the git-bare cooker publicly available.
Aug 26 2021, 3:07 PM · Vault
vlorentz removed a subtask for T3504: Make the git-bare cooker publicly available: T843: Vault: Add a "git bare" tarball cooker.
Aug 26 2021, 3:07 PM · Vault, Web app
vlorentz added a parent task for T843: Vault: Add a "git bare" tarball cooker: T3504: Make the git-bare cooker publicly available.
Aug 26 2021, 3:07 PM · Vault
vlorentz added a subtask for T3504: Make the git-bare cooker publicly available: T843: Vault: Add a "git bare" tarball cooker.
Aug 26 2021, 3:07 PM · Vault, Web app
vlorentz closed T843: Vault: Add a "git bare" tarball cooker as Resolved.

done, and released as swh-vault v1.0.0 :)

Aug 26 2021, 3:06 PM · Vault
vlorentz closed T843: Vault: Add a "git bare" tarball cooker, a subtask of T3096: Efficient and reliable download via the Vault, as Resolved.
Aug 26 2021, 3:06 PM · meta-task, Roadmap 2021, Vault
ardumont moved T3507: prod: vault: Deploy v1.0.0 from Backlog to Weekly backlog on the System administration board.
Aug 26 2021, 3:06 PM · System administration, Vault, Web app
vlorentz added projects to T3503: staging: vault: Deploy v1.0.0: Web app, Vault.
Aug 26 2021, 3:06 PM · Vault, Web app, System administration
vlorentz added a parent task for T3503: staging: vault: Deploy v1.0.0: T3507: prod: vault: Deploy v1.0.0.
Aug 26 2021, 3:05 PM · Vault, Web app, System administration
vlorentz added a subtask for T3507: prod: vault: Deploy v1.0.0: T3503: staging: vault: Deploy v1.0.0.
Aug 26 2021, 3:05 PM · System administration, Vault, Web app
vlorentz triaged T3507: prod: vault: Deploy v1.0.0 as Normal priority.
Aug 26 2021, 3:05 PM · System administration, Vault, Web app
vlorentz added a task to D6126: vault.spec.js: Add test checking the LocalStorage is used to show the task list: T3506: Get rid of the concept of vault "object_type".
Aug 26 2021, 3:05 PM
vlorentz added a task to D6113: vault API: Rename bundle types and use SWHIDs to identify objects: T3506: Get rid of the concept of vault "object_type".
Aug 26 2021, 3:05 PM
vlorentz added a task to D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git: T3506: Get rid of the concept of vault "object_type".
Aug 26 2021, 3:05 PM
vlorentz added a task to D6141: Bump minimum swh-vault version: T3506: Get rid of the concept of vault "object_type".
Aug 26 2021, 3:05 PM
vlorentz added revisions to T3506: Get rid of the concept of vault "object_type": D6141: Bump minimum swh-vault version, D6126: vault.spec.js: Add test checking the LocalStorage is used to show the task list, D6113: vault API: Rename bundle types and use SWHIDs to identify objects, D6112: Rename bundle types and use SWHIDs everywhere instead of raw sha1_git.
Aug 26 2021, 3:05 PM · Vault, Web app
vlorentz changed the status of T3506: Get rid of the concept of vault "object_type" from Open to Work in Progress.
Aug 26 2021, 3:04 PM · Vault, Web app
vlorentz changed the status of T3506: Get rid of the concept of vault "object_type", a subtask of T3505: Make the git-bare cooker available to the staff and beta-testers in the production webapp, from Open to Work in Progress.
Aug 26 2021, 3:04 PM · Vault, Web app
vlorentz triaged T3506: Get rid of the concept of vault "object_type" as Normal priority.
Aug 26 2021, 3:04 PM · Vault, Web app
vlorentz triaged T3505: Make the git-bare cooker available to the staff and beta-testers in the production webapp as Normal priority.
Aug 26 2021, 3:02 PM · Vault, Web app
vlorentz triaged T3504: Make the git-bare cooker publicly available as Normal priority.
Aug 26 2021, 3:01 PM · Vault, Web app
ardumont moved T3503: staging: vault: Deploy v1.0.0 from code-review/await-feedback/pause to deployed/landed/monitoring on the System administration board.
Aug 26 2021, 3:01 PM · Vault, Web app, System administration
ardumont moved T3503: staging: vault: Deploy v1.0.0 from in-progress to code-review/await-feedback/pause on the System administration board.
Aug 26 2021, 3:00 PM · Vault, Web app, System administration
ardumont added a comment to T3503: staging: vault: Deploy v1.0.0.
  • Stop puppet
  • Stop gunicorn-swh-vault
  • Schema migration [1]
  • Clean up the objstorage [2]
  • Start back gunicorn-swh-vault
  • Upgrade staging workers and webapp nodes with latest swh.vault
  • Try a cooking and check result
Aug 26 2021, 2:59 PM · Vault, Web app, System administration
vlorentz committed rDVAUd5a7c7f2bd95: tests: Fix compatibility with dulwich 0.19.11. (authored by vlorentz).
tests: Fix compatibility with dulwich 0.19.11.
Aug 26 2021, 2:23 PM
ardumont changed the status of T3503: staging: vault: Deploy v1.0.0 from Open to Work in Progress.
Aug 26 2021, 2:21 PM · Vault, Web app, System administration
ardumont updated the task description for T3503: staging: vault: Deploy v1.0.0.
Aug 26 2021, 2:21 PM · Vault, Web app, System administration
stsp requested review of D5992: add support for the CVS loader to 'Save Code Now'.
Aug 26 2021, 2:18 PM
ardumont updated the task description for T3503: staging: vault: Deploy v1.0.0.
Aug 26 2021, 2:17 PM · Vault, Web app, System administration
vlorentz committed rDVAU5a1c3d01610e: Re-add pytest.mark.graph (authored by vlorentz).
Re-add pytest.mark.graph
Aug 26 2021, 2:13 PM
anlambert added a comment to D6140: journal_client: Only upsert if we have something to upsert.

Looks good to me. Could you add a test for this or is it too complicated ? The code is covered but not the introduced behavior,

I'm not entirely sure on how to proceed for the test indeed.

But for sure, it's currently working as expected on the production patched swh-scheduler-journal-client (saatchi).

Aug 26 2021, 2:05 PM
ardumont triaged T3503: staging: vault: Deploy v1.0.0 as Normal priority.
Aug 26 2021, 2:02 PM · Vault, Web app, System administration
ardumont added a comment to D6140: journal_client: Only upsert if we have something to upsert.

Looks good to me. Could you add a test for this or is it too complicated ? The code is covered but not the introduced behavior,

Aug 26 2021, 2:00 PM
anlambert requested review of D6144: developer-setup: Add elasticsearch install instructions.
Aug 26 2021, 12:56 PM
vlorentz requested review of D6143: Add support for 'development' installs.
Aug 26 2021, 12:50 PM
vsellier committed rDSNIP4a4eaea90026: grid5000/cassandra Adapt the script to support a multidc deployment (authored by vsellier).
grid5000/cassandra Adapt the script to support a multidc deployment
Aug 26 2021, 12:46 PM
vsellier added a comment to T3465: Test multidatacenter replication.

These are the steps done to initialized the new cluster [1]:

  • add a file datacenter-rackdc.properties on the server with the according DC
gros-50:~$ cat /etc/cassandra/cassandra-rackdc.properties 
dc=datacenter2
rack=rack1
  • change the value of the properties endpoint_snitch from SimpleSnitch to GossipingPropertyFileSnitch [2].

The recommanded value for production is GossipingPropertyFileSnitch so it should have been this since the beginning

  • configure the disk_optimization_strategy to ssd on the new datacenter
  • update the seed_provider to have one node on each datacenter
  • restart the datacenter1 nodes to apply the new configuration
  • start the datacenter2 nodes one by one, wait until the status of the node is UN (Up and Normal) before starting another one (They can be stay in the UJ (joining) state for a couple of minutes)
  • when done, update the swh keyspace to declare the replication strategy of the second DC
ALTER KEYSPACE swh WITH REPLICATION = {'class' : 'NetworkTopologyStrategy', 'datacenter1' : 3, 'datacenter2': 3};

The replication of the new changes starts here but the full table contents need to be copied

  • rebuild the cluster content:
vsellier@fnancy:~/cassandra$ seq 0 9 | parallel -t ssh gros-5{} nodetool rebuild -ks swh -- datacenter1

The progression can be monitored with nodetool command:

gros-50:~$ nodetool netstats                                                                 
Mode: NORMAL                                                                                           
Rebuild e5e64920-0644-11ec-92a6-31a241f39914                                                            
    /172.16.97.4                                                                                                                                      
        Receiving 199 files, 147926499702 bytes total. Already received 125 files (62.81%), 57339885570 bytes total (38.76%)
            swh/release-4 1082347/1082347 bytes (100%) received from idx:0/172.16.97.4                                                                           
            swh/content_by_blake2s256-2 3729362955/3729362955 bytes (100%) received from idx:0/172.16.97.4
            swh/release-3 224510803/224510803 bytes (100%) received from idx:0/172.16.97.4                
            swh/content_by_blake2s256-1 240283216/240283216 bytes (100%) received from idx:0/172.16.97.4
            swh/content_by_blake2s256-4 29491504/29491504 bytes (100%) received from idx:0/172.16.97.4
            swh/release-2 6409474/6409474 bytes (100%) received from idx:0/172.16.97.4                
...
Read Repair Statistics:                                                                                     
Attempted: 0                                                                                          
Mismatch (Blocking): 0                                                                                
Mismatch (Background): 0                                                                            
Pool Name                    Active   Pending      Completed   Dropped                                
Large messages                  n/a         0             23         0                                
Small messages                  n/a         3      132753939         0                          
Gossip messages                 n/a         0          43915         0

or to filter only running transfers:

gros-50:~$ nodetool netstats  | grep -v 100%
Mode: NORMAL
Rebuild e5e64920-0644-11ec-92a6-31a241f39914
    /172.16.97.4
        Receiving 199 files, 147926499702 bytes total. Already received 125 files (62.81%), 57557961160 bytes total (38.91%)
            swh/directory_entry-7 4819168032/4925484261 bytes (97%) received from idx:0/172.16.97.4
    /172.16.97.2
        Receiving 202 files, 111435975646 bytes total. Already received 139 files (68.81%), 60583670773 bytes total (54.37%)
            swh/directory_entry-12 1631210003/2906113367 bytes (56%) received from idx:0/172.16.97.2
    /172.16.97.6
        Receiving 236 files, 186694443984 bytes total. Already received 142 files (60.17%), 58869656747 bytes total (31.53%)
            swh/snapshot_branch-10 4449235102/7845572885 bytes (56%) received from idx:0/172.16.97.6
    /172.16.97.5
        Receiving 221 files, 143384473640 bytes total. Already received 132 files (59.73%), 58300913015 bytes total (40.66%)
            swh/directory_entry-4 982247023/3492851311 bytes (28%) received from idx:0/172.16.97.5
Read Repair Statistics:
Attempted: 0
Mismatch (Blocking): 0
Mismatch (Background): 0
Pool Name                    Active   Pending      Completed   Dropped
Large messages                  n/a         0             23         0
Small messages                  n/a         2      135087921         0
Gossip messages                 n/a         0          44176         0
Aug 26 2021, 12:41 PM · System administration, Storage manager
swh-public-ci added a comment to D6083: hypothesis: Run with more examples by default.

Build is green

Aug 26 2021, 12:38 PM
vlorentz closed D6131: Remove .wasm from all distributions, and .so from source distributions.
Aug 26 2021, 12:37 PM
vlorentz committed rDSEA3940daa889ce: Remove .wasm from all distributions, and .so from source distributions (authored by vlorentz).
Remove .wasm from all distributions, and .so from source distributions
Aug 26 2021, 12:37 PM
swh-public-ci added a comment to D6090: Refactor `ProvenanceStoragePostgreSql`.

Build is green

Aug 26 2021, 12:31 PM
swh-public-ci added a comment to D6131: Remove .wasm from all distributions, and .so from source distributions.

Build is green

Aug 26 2021, 12:31 PM
ardumont created P1134 current load-git tasks status.
Aug 26 2021, 12:31 PM
olasd merged T1278: swh-journal: the monitoring tool question! into T2128: Monitor journal consumer lag.
Aug 26 2021, 12:30 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer)
olasd merged task T1278: swh-journal: the monitoring tool question! into T2128: Monitor journal consumer lag.
Aug 26 2021, 12:30 PM · Journal
aeviso updated the diff for D6090: Refactor `ProvenanceStoragePostgreSql`.

Re-enable provenance storage tests that were disabled by mistake

Aug 26 2021, 12:28 PM
vlorentz requested review of D6139: cassandra: Add option to select (hopefully) more efficient batch insertion algos.
Aug 26 2021, 12:28 PM
vlorentz updated the diff for D6131: Remove .wasm from all distributions, and .so from source distributions.

rebase

Aug 26 2021, 12:27 PM
olasd added a comment to T2128: Monitor journal consumer lag.

This would have caught T3502 earlier too.

Aug 26 2021, 12:27 PM · Metrics/monitoring, Sprint 2019/12 (Monitor and Conquer)
olasd requested review of D6142: kafka: bump consumer offset retention to 30 days.
Aug 26 2021, 12:27 PM
swh-public-ci added a comment to D6131: Remove .wasm from all distributions, and .so from source distributions.

Build is green

Aug 26 2021, 12:26 PM
vlorentz closed D6141: Bump minimum swh-vault version.
Aug 26 2021, 12:25 PM
vlorentz committed rDWAPPS66865c37ce33: Bump minimum swh-vault version (authored by vlorentz).
Bump minimum swh-vault version
Aug 26 2021, 12:25 PM
vlorentz requested review of D6141: Bump minimum swh-vault version.
Aug 26 2021, 12:25 PM
vlorentz updated the diff for D6131: Remove .wasm from all distributions, and .so from source distributions.

fix support for old yarn versions (without -s)

Aug 26 2021, 12:22 PM
vlorentz updated the diff for D6083: hypothesis: Run with more examples by default.

rebase

Aug 26 2021, 12:14 PM
anlambert accepted D6140: journal_client: Only upsert if we have something to upsert.

Looks good to me. Could you add a test for this or is it too complicated ? The code is covered but not the introduced behavior,

Aug 26 2021, 12:09 PM
vsellier added a comment to T3465: Test multidatacenter replication.

The second cassandra cluster is finally up and synchronizing with the first one. The rebuild should be done by the end of the day or tomorrow.

Aug 26 2021, 12:05 PM · System administration, Storage manager