Page MenuHomeSoftware Heritage
Feed Advanced Search

Jun 28 2022

vsellier updated the diff for D8040: Limit the number of entries in the cache.

make mypy happy

Jun 28 2022, 9:38 AM

Jun 27 2022

vsellier updated the diff for D8040: Limit the number of entries in the cache.

add missing parenthesis

Jun 27 2022, 10:59 PM
vsellier added inline comments to D8040: Limit the number of entries in the cache.
Jun 27 2022, 10:52 PM
vsellier updated the diff for D8040: Limit the number of entries in the cache.

update according the reviews

  • simplify the cache management
  • fix the doc strings
Jun 27 2022, 10:52 PM
vsellier accepted D8038: Provision scrubber1 for checker services.
Jun 27 2022, 4:48 PM
vsellier requested review of D8040: Limit the number of entries in the cache.
Jun 27 2022, 4:10 PM
vsellier added a revision to T4313: [provenance] some process are oom killed: D8040: Limit the number of entries in the cache.
Jun 27 2022, 3:58 PM · System administration, Provenance database
vsellier changed the status of T4313: [provenance] some process are oom killed from Open to Work in Progress.
Jun 27 2022, 3:56 PM · System administration, Provenance database
vsellier accepted D8039: Install scrubber services on scrubber nodes.
Jun 27 2022, 3:24 PM
vsellier requested changes to D8038: Provision scrubber1 for checker services.
Jun 27 2022, 3:24 PM

Jun 24 2022

vsellier accepted D8034: Deploy production swh-scrubber db connection.

looks good (for what it's worth)

Jun 24 2022, 6:47 PM
vsellier added a comment to T4340: swh-graph timeouts.

It's confirmed that the issue seems to be on the python part of the current implementation so I'm eager to see D7890 landed ;)

Jun 24 2022, 10:13 AM · Compressed graph service

Jun 22 2022

vsellier added a comment to T4340: swh-graph timeouts.

I reversed engineered the py4j communication protocol, so next time it will hang, we should be able to tell if the issue is on the gateway server side or on the python side:

  • Create a name pipe
mkfifo /tmp/test
chmod a+w /tmp/test
tail -F /tmp/test
  • query the graph
ss -ltp | grep java
<get the port number>
telnet localhost <port number>
c
o0
get_handler
s/tmp/test
e
Jun 22 2022, 2:07 PM · Compressed graph service
vsellier added a comment to T4347: gitlab migration reset state routine is flaky.

Looks like something is wrong in the operator state management.
For what I found on internet, it could be related to the cert-manager version but it should be already fixed. For example: https://gitlab.com/gitlab-org/cloud-native/gitlab-operator/-/issues/315
(The current cert-manager version in the cluster is 1.8.0)

Jun 22 2022, 10:40 AM · System administration, GitLab migration, Roadmap 2020

Jun 20 2022

vsellier updated the task description for T4340: swh-graph timeouts.
Jun 20 2022, 10:17 AM · Compressed graph service
vsellier updated the task description for T4340: swh-graph timeouts.
Jun 20 2022, 10:16 AM · Compressed graph service
vsellier triaged T4340: swh-graph timeouts as High priority.
Jun 20 2022, 10:16 AM · Compressed graph service

Jun 17 2022

vsellier closed T4315: [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph as Resolved.

A dozen of clients running in the provenance-client01 are using the multiplexer configuration.
It seems to work correctly

Jun 17 2022, 5:03 PM · Provenance database
vsellier committed rDPROV80434e3b2191: Reduce multiplexer logs output (authored by vsellier).
Reduce multiplexer logs output
Jun 17 2022, 9:43 AM

Jun 16 2022

vsellier accepted D8000: docs/journal-clients: Reference a new anchor title.
Jun 16 2022, 6:02 PM
vsellier closed D7962: kafka: add more options to the user management script.
Jun 16 2022, 5:49 PM
vsellier committed rSPSITE395c4a2a79de: kafka: add more options to the user management script (authored by vsellier).
kafka: add more options to the user management script
Jun 16 2022, 5:49 PM
vsellier updated the diff for D7962: kafka: add more options to the user management script.

rebase

Jun 16 2022, 5:48 PM
vsellier committed R259:6742c91df028: swh-provenance-client: update requirements-frozen.txt (authored by vsellier).
swh-provenance-client: update requirements-frozen.txt
Jun 16 2022, 10:34 AM
vsellier committed rDPROV2453c3ef3b53: Add logs relative to the cache flush performances (authored by vsellier).
Add logs relative to the cache flush performances
Jun 16 2022, 10:31 AM
vsellier committed rDPROV12c45c828f2a: Don't stop the ingestion if an error occurs in one of the archive backend (authored by vsellier).
Don't stop the ingestion if an error occurs in one of the archive backend
Jun 16 2022, 10:31 AM
vsellier committed rDPROVf5ed9de87b39: Improve origin layer logs (authored by vsellier).
Improve origin layer logs
Jun 16 2022, 10:31 AM
vsellier committed rDPROVb69c0f7689f0: Add a new multiplexed archive type (authored by vsellier).
Add a new multiplexed archive type
Jun 16 2022, 10:31 AM
vsellier closed D7985: [provenance] Implement a naive archive multiplexer.
Jun 16 2022, 10:31 AM · Provenance database
vsellier committed rDPROVd45f066a8c51: Declare the missing swh-graph dependency (authored by vsellier).
Declare the missing swh-graph dependency
Jun 16 2022, 10:31 AM
vsellier committed R259:3cb73d1fe0ca: swh-provenance-client: update requirements-frozen.txt (authored by vsellier).
swh-provenance-client: update requirements-frozen.txt
Jun 16 2022, 9:19 AM
vsellier updated the diff for D7985: [provenance] Implement a naive archive multiplexer.

Update according the reviews

  • Add and fix license headers
  • Ensure the _revisions_count variable was computed before returning its value
Jun 16 2022, 8:34 AM · Provenance database

Jun 15 2022

vsellier added inline comments to D7985: [provenance] Implement a naive archive multiplexer.
Jun 15 2022, 8:38 PM · Provenance database
vsellier added a comment to T4064: Test GitLab migration scripts.

\o/ well done

Jun 15 2022, 6:41 PM · System administration, GitLab migration, Roadmap 2020
vsellier retitled D7985: [provenance] Implement a naive archive multiplexer from Declare the missing swh-graph dependency to [provenance] Implement a naive archive multiplexer.
Jun 15 2022, 10:57 AM · Provenance database
vsellier added a comment to D7985: [provenance] Implement a naive archive multiplexer.

I've deliberately created the diff with the 3 commits inside, I just forgot to update the title ;)

Jun 15 2022, 10:55 AM · Provenance database
vsellier requested review of D7985: [provenance] Implement a naive archive multiplexer.
Jun 15 2022, 9:38 AM · Provenance database

Jun 14 2022

vsellier added a revision to T4315: [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph: D7985: [provenance] Implement a naive archive multiplexer.
Jun 14 2022, 12:23 PM · Provenance database

Jun 8 2022

vsellier closed T4314: [provenance] HistoryGraph building is failing when reaching the first revision as Resolved.

fixed with the workaround in D7972

Jun 8 2022, 11:51 PM · Provenance database
vsellier closed D7972: Support empty response from swh-graph.
Jun 8 2022, 11:50 PM
vsellier committed rDPROVd6e88dabb3bc: Support empty response from swh-graph (authored by vsellier).
Support empty response from swh-graph
Jun 8 2022, 11:50 PM
vsellier updated subscribers of D7972: Support empty response from swh-graph.

Because it looks simpler and faster to have this workaround here.

Jun 8 2022, 2:49 PM
vsellier triaged T4316: Push of swh-graph to pypi is broken as High priority.
Jun 8 2022, 2:28 PM · System administration, Compressed graph service
vsellier renamed T4315: [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph from [provenance] Fallback to swh-storage if a revision is not found in swh-graph to [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph.
Jun 8 2022, 2:17 PM · Provenance database
vsellier triaged T4315: [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph as Normal priority.
Jun 8 2022, 2:16 PM · Provenance database
vsellier requested review of D7972: Support empty response from swh-graph.
Jun 8 2022, 2:07 PM
vsellier added a revision to T4314: [provenance] HistoryGraph building is failing when reaching the first revision: D7972: Support empty response from swh-graph.
Jun 8 2022, 1:59 PM · Provenance database
vsellier triaged T4314: [provenance] HistoryGraph building is failing when reaching the first revision as Normal priority.
Jun 8 2022, 12:43 AM · Provenance database

Jun 7 2022

vsellier triaged T4313: [provenance] some process are oom killed as Normal priority.
Jun 7 2022, 8:25 PM · System administration, Provenance database
vsellier added a comment to D7965: postgres db: Create guest user at db initialization time.

is it not possible to use a sql script directly, in the idea of [1] ?
I suppose it will make the script simpler as the postgresql image logic will take care of running the script only during the database initialization.

Jun 7 2022, 2:45 PM
vsellier committed R259:38b9ab71471a: swh-provenance-client: refresh the dependencies (authored by vsellier).
swh-provenance-client: refresh the dependencies
Jun 7 2022, 11:33 AM

Jun 4 2022

vsellier closed T4302: provenance: origin processed several time by different clients as Invalid.

This issue was a false lead, different snapshots are declared for the same origin letting me believe there were some duplicates.
I completely missed it and fall head first into the trap

Jun 4 2022, 8:19 AM · Provenance database

Jun 3 2022

vsellier committed rDPROV5cbbb9a333b4: exclude local .venv from git changes (authored by vsellier).
exclude local .venv from git changes
Jun 3 2022, 7:52 PM
vsellier committed rDPROVbfd7cac5c89e: Initiate a quick and dirty docker environment (authored by vsellier).
Initiate a quick and dirty docker environment
Jun 3 2022, 7:52 PM
vsellier committed rDPROVd45d3628c831: Add the missing zmq dependency needed for the csv servers (authored by vsellier).
Add the missing zmq dependency needed for the csv servers
Jun 3 2022, 7:52 PM
vsellier closed D7956: Provenance: Docker for origin layer + logging improvement.
Jun 3 2022, 7:52 PM
vsellier committed rDPROV13d6089d6cda: Improve origin layer logging (authored by vsellier).
Improve origin layer logging
Jun 3 2022, 7:52 PM
vsellier updated the diff for D7956: Provenance: Docker for origin layer + logging improvement.

only use the postgresql paramater to configure the max connection

Jun 3 2022, 7:52 PM
vsellier updated the diff for D7962: kafka: add more options to the user management script.

update the parameter count limit according the new supported ones

Jun 3 2022, 6:07 PM
vsellier requested review of D7962: kafka: add more options to the user management script.
Jun 3 2022, 6:04 PM
vsellier added a revision to T4265: Create test topics on staging's kafka for the mirror tests: D7962: kafka: add more options to the user management script.
Jun 3 2022, 6:04 PM · System administration
vsellier added inline comments to D7956: Provenance: Docker for origin layer + logging improvement.
Jun 3 2022, 5:57 PM
vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

I've also added the permissions to the user mirror-test-rw to create and destroy topics. So you should be able to manage the swh.test.objects[_privileged] topics lifecycle

Jun 3 2022, 4:01 PM · System administration
vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

btw the credentials are pushed in the credential store

Jun 3 2022, 3:21 PM · System administration
vsellier reopened T4265: Create test topics on staging's kafka for the mirror tests as "Work in Progress".

heh, it misses the last part of the task (the commands to manage the topics)

Jun 3 2022, 3:18 PM · System administration
vsellier closed T4265: Create test topics on staging's kafka for the mirror tests as Resolved.

Changing the status to resolved.
@douardda don't hesitate to reopen if it's not working as expected

Jun 3 2022, 3:17 PM · System administration
vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

Permission of users should be ok:

  • mirror-test-rw
root@getty:~# diff -U3 /usr/local/sbin/create_kafka_users_rocquencourt_staging.sh create_kafka_test_users_rw_rocquencourt_staging.sh 
--- /usr/local/sbin/create_kafka_users_rocquencourt_staging.sh	2022-01-21 16:57:22.076322616 +0000
+++ create_kafka_test_users_rw_rocquencourt_staging.sh	2022-06-03 13:02:03.497371791 +0000
@@ -56,15 +72,15 @@
 	--entity-type users \
 	--entity-name $username
Jun 3 2022, 3:16 PM · System administration
vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

Topics created:

storage1 /opt/kafka/bin% TOPICS="content directory extid metadata_authority metadata_fetcher origin origin_visit origin_visit_status raw_extrinsic_metadata release revision skipped_content snapshot"
Jun 3 2022, 12:16 PM · System administration
vsellier requested review of D7956: Provenance: Docker for origin layer + logging improvement.
Jun 3 2022, 11:55 AM
vsellier added a comment to T4302: provenance: origin processed several time by different clients.

after some ingestion time, it seems the first analysis is wrong:

(provenance) provenance-client01:~$ sort ~/origin-client.log |grep Processing | uniq -c  | sort -n | tail -n 10
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/360factors/workflow-conditions.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/4s/cda-document-viewer.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/4tic/goc.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/ALEXks/sapfor_2017.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/ASIWeb/testcomplete.git
      7 INFO:swh.provenance.origin:Processing origin https://anonhg.netbsd.org/xsrc-draft/
      7 INFO:swh.provenance.origin:Processing origin https://anonhg.netbsd.org/xsrc-public/
      8 INFO:swh.provenance.origin:Processing origin https://anonhg.netbsd.org/xsrc/
     13 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/9front/plan9front
     20 INFO:swh.provenance.origin:Processing origin https://anongit.kde.org/kdenlive.git
Jun 3 2022, 11:46 AM · Provenance database
vsellier added a revision to T4302: provenance: origin processed several time by different clients: D7956: Provenance: Docker for origin layer + logging improvement.
Jun 3 2022, 11:44 AM · Provenance database

Jun 2 2022

vsellier added a comment to T4302: provenance: origin processed several time by different clients.

It's not as bad as expected, it seems only 2 clients are proceeding the same origin at the same time:

Jun 2 2022, 5:34 PM · Provenance database
vsellier triaged T4302: provenance: origin processed several time by different clients as Normal priority.
Jun 2 2022, 3:40 PM · Provenance database
vsellier committed R260:1da3bef0518a: graphql: fix config file permissions (authored by vsellier).
graphql: fix config file permissions
Jun 2 2022, 10:39 AM
vsellier changed the status of T4265: Create test topics on staging's kafka for the mirror tests from Open to Work in Progress.
Jun 2 2022, 10:19 AM · System administration

Jun 1 2022

vsellier committed rDPROV101a9a0aa2e1: Add statistic for uniq origin processing (authored by vsellier).
Add statistic for uniq origin processing
Jun 1 2022, 6:39 PM
vsellier accepted D7928: Deploy new origin intrinsic metadata journal client indexer.

I'm not sure if the service should be stopped before being removed but otherwise it looks ok

Jun 1 2022, 11:40 AM
vsellier closed D7931: Remove the autoreload option.
Jun 1 2022, 11:23 AM
vsellier committed R259:c31308537db0: Remove the autoreload option (authored by vsellier).
Remove the autoreload option
Jun 1 2022, 11:23 AM
vsellier closed D7932: Add chart to deploy graphql.
Jun 1 2022, 11:19 AM
vsellier committed R260:ddbf7c0e6080: Add chart to deploy graphql (authored by vsellier).
Add chart to deploy graphql
Jun 1 2022, 11:19 AM
vsellier added inline comments to D7932: Add chart to deploy graphql.
Jun 1 2022, 11:18 AM
vsellier updated the diff for D7932: Add chart to deploy graphql.

fix typos according to the feedback

Jun 1 2022, 11:18 AM
vsellier requested changes to D7928: Deploy new origin intrinsic metadata journal client indexer.

after rereading this together, it seems the worker activation is missing

Jun 1 2022, 11:02 AM
vsellier accepted D7928: Deploy new origin intrinsic metadata journal client indexer.
Jun 1 2022, 10:57 AM
vsellier added a comment to D7928: Deploy new origin intrinsic metadata journal client indexer.
  • Is the scheduler section in 'swh::deploy::indexer_journal_client::config' still needed ?
Jun 1 2022, 10:30 AM
vsellier updated the diff for D7932: Add chart to deploy graphql.

update storage default url to match a local service deployed on the cluster

Jun 1 2022, 9:40 AM
vsellier added a comment to D7931: Remove the autoreload option.

The target of this repository is staging / production or even test environments.

Jun 1 2022, 9:34 AM
vsellier updated the diff for D7932: Add chart to deploy graphql.

Remove testing values file

Jun 1 2022, 8:58 AM
vsellier requested review of D7932: Add chart to deploy graphql.
Jun 1 2022, 8:46 AM
vsellier added a revision to T4134: Package the graphql service: D7932: Add chart to deploy graphql.
Jun 1 2022, 8:46 AM · System administration, GraphQL API

May 31 2022

vsellier added a revision to T4134: Package the graphql service: D7931: Remove the autoreload option.
May 31 2022, 7:34 PM · System administration, GraphQL API
vsellier requested review of D7931: Remove the autoreload option.
May 31 2022, 7:34 PM
vsellier closed D7925: Recreate the staging-worker cluster.
May 31 2022, 3:40 PM
vsellier committed rSPREd7d5ff6b4b64: Recreate the staging-worker cluster (authored by vsellier).
Recreate the staging-worker cluster
May 31 2022, 3:40 PM
vsellier requested review of D7925: Recreate the staging-worker cluster.
May 31 2022, 3:37 PM
vsellier added a revision to T4278: Elastic worker cluster failures to unstuck: D7925: Recreate the staging-worker cluster.
May 31 2022, 3:37 PM · System administration, Roadmap 2022
vsellier added a comment to T4278: Elastic worker cluster failures to unstuck.

An interesting lead that could possibly explain what happened on the cluster : https://etcd.io/docs/v3.4/faq/#should-i-add-a-member-before-removing-an-unhealthy-member

May 31 2022, 3:18 PM · System administration, Roadmap 2022
vsellier added a comment to T4278: Elastic worker cluster failures to unstuck.

FI: an odd number of nodes is recommended for an etcd cluster

May 31 2022, 3:11 PM · System administration, Roadmap 2022
vsellier closed T4278: Elastic worker cluster failures to unstuck as Resolved.

The cluster is up and running.

May 31 2022, 3:05 PM · System administration, Roadmap 2022