vsellier renamed T4315: [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph from [provenance] Fallback to swh-storage if a revision is not found in swh-graph to [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph.

Jun 8 2022, 2:17 PM · Provenance database

vsellier triaged T4315: [provenance] Fallback to swh-storage if a revision or its parent is not found in swh-graph as Normal priority.

Jun 8 2022, 2:16 PM · Provenance database

vsellier requested review of D7972: Support empty response from swh-graph.

Jun 8 2022, 2:07 PM

vsellier added a revision to T4314: [provenance] HistoryGraph building is failing when reaching the first revision: D7972: Support empty response from swh-graph.

Jun 8 2022, 1:59 PM · Provenance database

vsellier triaged T4314: [provenance] HistoryGraph building is failing when reaching the first revision as Normal priority.

Jun 8 2022, 12:43 AM · Provenance database

Jun 7 2022

vsellier triaged T4313: [provenance] some process are oom killed as Normal priority.

Jun 7 2022, 8:25 PM · System administration, Provenance database

vsellier added a comment to D7965: postgres db: Create guest user at db initialization time.

is it not possible to use a sql script directly, in the idea of [1] ?
I suppose it will make the script simpler as the postgresql image logic will take care of running the script only during the database initialization.

Jun 7 2022, 2:45 PM

vsellier committed R259:38b9ab71471a: swh-provenance-client: refresh the dependencies (authored by vsellier).

swh-provenance-client: refresh the dependencies

Jun 7 2022, 11:33 AM

Jun 4 2022

vsellier closed T4302: provenance: origin processed several time by different clients as Invalid.

This issue was a false lead, different snapshots are declared for the same origin letting me believe there were some duplicates.
I completely missed it and fall head first into the trap

Jun 4 2022, 8:19 AM · Provenance database

Jun 3 2022

vsellier committed rDPROV5cbbb9a333b4: exclude local .venv from git changes (authored by vsellier).

exclude local .venv from git changes

Jun 3 2022, 7:52 PM

vsellier committed rDPROVbfd7cac5c89e: Initiate a quick and dirty docker environment (authored by vsellier).

Initiate a quick and dirty docker environment

Jun 3 2022, 7:52 PM

vsellier committed rDPROVd45d3628c831: Add the missing zmq dependency needed for the csv servers (authored by vsellier).

Add the missing zmq dependency needed for the csv servers

Jun 3 2022, 7:52 PM

vsellier closed D7956: Provenance: Docker for origin layer + logging improvement.

Jun 3 2022, 7:52 PM

vsellier committed rDPROV13d6089d6cda: Improve origin layer logging (authored by vsellier).

Improve origin layer logging

Jun 3 2022, 7:52 PM

vsellier updated the diff for D7956: Provenance: Docker for origin layer + logging improvement.

only use the postgresql paramater to configure the max connection

Jun 3 2022, 7:52 PM

vsellier updated the diff for D7962: kafka: add more options to the user management script.

update the parameter count limit according the new supported ones

Jun 3 2022, 6:07 PM

vsellier requested review of D7962: kafka: add more options to the user management script.

Jun 3 2022, 6:04 PM

vsellier added a revision to T4265: Create test topics on staging's kafka for the mirror tests: D7962: kafka: add more options to the user management script.

Jun 3 2022, 6:04 PM · System administration

vsellier added inline comments to D7956: Provenance: Docker for origin layer + logging improvement.

Jun 3 2022, 5:57 PM

vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

I've also added the permissions to the user mirror-test-rw to create and destroy topics. So you should be able to manage the swh.test.objects[_privileged] topics lifecycle

Jun 3 2022, 4:01 PM · System administration

vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

btw the credentials are pushed in the credential store

Jun 3 2022, 3:21 PM · System administration

vsellier reopened T4265: Create test topics on staging's kafka for the mirror tests as "Work in Progress".

heh, it misses the last part of the task (the commands to manage the topics)

Jun 3 2022, 3:18 PM · System administration

vsellier closed T4265: Create test topics on staging's kafka for the mirror tests as Resolved.

Changing the status to resolved.
@douardda don't hesitate to reopen if it's not working as expected

Jun 3 2022, 3:17 PM · System administration

vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

Permission of users should be ok:

mirror-test-rw

root@getty:~# diff -U3 /usr/local/sbin/create_kafka_users_rocquencourt_staging.sh create_kafka_test_users_rw_rocquencourt_staging.sh 
--- /usr/local/sbin/create_kafka_users_rocquencourt_staging.sh	2022-01-21 16:57:22.076322616 +0000
+++ create_kafka_test_users_rw_rocquencourt_staging.sh	2022-06-03 13:02:03.497371791 +0000
@@ -56,15 +72,15 @@
 	--entity-type users \
 	--entity-name $username

Jun 3 2022, 3:16 PM · System administration

vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

Topics created:

storage1 /opt/kafka/bin% TOPICS="content directory extid metadata_authority metadata_fetcher origin origin_visit origin_visit_status raw_extrinsic_metadata release revision skipped_content snapshot"

Jun 3 2022, 12:16 PM · System administration

vsellier requested review of D7956: Provenance: Docker for origin layer + logging improvement.

Jun 3 2022, 11:55 AM

vsellier added a comment to T4302: provenance: origin processed several time by different clients.

after some ingestion time, it seems the first analysis is wrong:

(provenance) provenance-client01:~$ sort ~/origin-client.log |grep Processing | uniq -c  | sort -n | tail -n 10
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/360factors/workflow-conditions.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/4s/cda-document-viewer.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/4tic/goc.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/ALEXks/sapfor_2017.git
      6 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/ASIWeb/testcomplete.git
      7 INFO:swh.provenance.origin:Processing origin https://anonhg.netbsd.org/xsrc-draft/
      7 INFO:swh.provenance.origin:Processing origin https://anonhg.netbsd.org/xsrc-public/
      8 INFO:swh.provenance.origin:Processing origin https://anonhg.netbsd.org/xsrc/
     13 INFO:swh.provenance.origin:Processing origin https://bitbucket.org/9front/plan9front
     20 INFO:swh.provenance.origin:Processing origin https://anongit.kde.org/kdenlive.git

Jun 3 2022, 11:46 AM · Provenance database

vsellier added a revision to T4302: provenance: origin processed several time by different clients: D7956: Provenance: Docker for origin layer + logging improvement.

Jun 3 2022, 11:44 AM · Provenance database

Jun 2 2022

vsellier added a comment to T4302: provenance: origin processed several time by different clients.

It's not as bad as expected, it seems only 2 clients are proceeding the same origin at the same time:

Jun 2 2022, 5:34 PM · Provenance database

vsellier triaged T4302: provenance: origin processed several time by different clients as Normal priority.

Jun 2 2022, 3:40 PM · Provenance database

vsellier committed R260:1da3bef0518a: graphql: fix config file permissions (authored by vsellier).

graphql: fix config file permissions

Jun 2 2022, 10:39 AM

vsellier changed the status of T4265: Create test topics on staging's kafka for the mirror tests from Open to Work in Progress.

Jun 2 2022, 10:19 AM · System administration

Jun 1 2022

vsellier committed rDPROV101a9a0aa2e1: Add statistic for uniq origin processing (authored by vsellier).

Add statistic for uniq origin processing

Jun 1 2022, 6:39 PM

vsellier accepted D7928: Deploy new origin intrinsic metadata journal client indexer.

I'm not sure if the service should be stopped before being removed but otherwise it looks ok

Jun 1 2022, 11:40 AM

vsellier closed D7931: Remove the autoreload option.

Jun 1 2022, 11:23 AM

vsellier committed R259:c31308537db0: Remove the autoreload option (authored by vsellier).

Remove the autoreload option

Jun 1 2022, 11:23 AM

vsellier closed D7932: Add chart to deploy graphql.

Jun 1 2022, 11:19 AM

vsellier committed R260:ddbf7c0e6080: Add chart to deploy graphql (authored by vsellier).

Add chart to deploy graphql

Jun 1 2022, 11:19 AM

vsellier added inline comments to D7932: Add chart to deploy graphql.

Jun 1 2022, 11:18 AM

vsellier updated the diff for D7932: Add chart to deploy graphql.

fix typos according to the feedback

Jun 1 2022, 11:18 AM

vsellier requested changes to D7928: Deploy new origin intrinsic metadata journal client indexer.

after rereading this together, it seems the worker activation is missing

Jun 1 2022, 11:02 AM

vsellier accepted D7928: Deploy new origin intrinsic metadata journal client indexer.

Jun 1 2022, 10:57 AM

vsellier added a comment to D7928: Deploy new origin intrinsic metadata journal client indexer.

Is the scheduler section in 'swh::deploy::indexer_journal_client::config' still needed ?

Jun 1 2022, 10:30 AM

vsellier updated the diff for D7932: Add chart to deploy graphql.

update storage default url to match a local service deployed on the cluster

Jun 1 2022, 9:40 AM

vsellier added a comment to D7931: Remove the autoreload option.

The target of this repository is staging / production or even test environments.

Jun 1 2022, 9:34 AM

vsellier updated the diff for D7932: Add chart to deploy graphql.

Remove testing values file

Jun 1 2022, 8:58 AM

vsellier requested review of D7932: Add chart to deploy graphql.

Jun 1 2022, 8:46 AM

vsellier added a revision to T4134: Package the graphql service: D7932: Add chart to deploy graphql.

Jun 1 2022, 8:46 AM · System administration, GraphQL API

May 31 2022

vsellier added a revision to T4134: Package the graphql service: D7931: Remove the autoreload option.

May 31 2022, 7:34 PM · System administration, GraphQL API

vsellier requested review of D7931: Remove the autoreload option.

May 31 2022, 7:34 PM

vsellier closed D7925: Recreate the staging-worker cluster.

May 31 2022, 3:40 PM

vsellier committed rSPREd7d5ff6b4b64: Recreate the staging-worker cluster (authored by vsellier).

Recreate the staging-worker cluster

May 31 2022, 3:40 PM

vsellier requested review of D7925: Recreate the staging-worker cluster.

May 31 2022, 3:37 PM

vsellier added a revision to T4278: Elastic worker cluster failures to unstuck: D7925: Recreate the staging-worker cluster.

May 31 2022, 3:37 PM · System administration, Roadmap 2022

vsellier added a comment to T4278: Elastic worker cluster failures to unstuck.

An interesting lead that could possibly explain what happened on the cluster : https://etcd.io/docs/v3.4/faq/#should-i-add-a-member-before-removing-an-unhealthy-member

May 31 2022, 3:18 PM · System administration, Roadmap 2022

vsellier added a comment to T4278: Elastic worker cluster failures to unstuck.

FI: an odd number of nodes is recommended for an etcd cluster

May 31 2022, 3:11 PM · System administration, Roadmap 2022

vsellier closed T4278: Elastic worker cluster failures to unstuck as Resolved.

The cluster is up and running.

May 31 2022, 3:05 PM · System administration, Roadmap 2022

vsellier closed T4278: Elastic worker cluster failures to unstuck, a subtask of T4144: Elastic worker infrastructure, as Resolved.

May 31 2022, 3:05 PM · meta-task, System administration, Roadmap 2022

vsellier added a comment to T4278: Elastic worker cluster failures to unstuck.

Unfortunately, after several tries, we were unable to restart the cluster due to a problem with the etcd leader election / data on the nodes (probably wrong manipulation from us).
We finally destroyed the cluster (we had to follow [1] because the cluster was in an unstable state and rancher refused to remove it)

May 31 2022, 2:34 PM · System administration, Roadmap 2022

May 30 2022

vsellier accepted D7900: Update manifest to exclude swh/provenance/tools.

I tried to fix it too but without success, so I guess we can go with this diff until we find a better solution

May 30 2022, 2:33 PM

May 24 2022

vsellier closed T4268: Downgrade internship cluster to 1.21 as Resolved.

May 24 2022, 9:40 AM · System administration

May 23 2022

vsellier closed D7886: rancher: downgrade deployment-internship cluster to 1.21.

May 23 2022, 7:38 PM

vsellier committed rSPRE6e7b13218cbc: rancher: downgrade deployment-internship cluster to 1.21 (authored by vsellier).

rancher: downgrade deployment-internship cluster to 1.21

May 23 2022, 7:38 PM

vsellier requested review of D7886: rancher: downgrade deployment-internship cluster to 1.21.

May 23 2022, 4:51 PM

vsellier added a revision to T4268: Downgrade internship cluster to 1.21: D7886: rancher: downgrade deployment-internship cluster to 1.21.

May 23 2022, 4:51 PM · System administration

vsellier added a comment to T4268: Downgrade internship cluster to 1.21.

Following this page to cleanup all the current resources on rancher-node-intership[0-2]: https://rancher.com/docs/rancher/v2.5/en/cluster-admin/cleaning-cluster-nodes/
restart the nodes
Delete the deployment-intership cluster in rancher
Add the desire kubernetes version in terraform
Apply
launch the docker command to register the nodes in each node

May 23 2022, 4:36 PM · System administration

vsellier updated the task description for T4265: Create test topics on staging's kafka for the mirror tests.

May 23 2022, 3:27 PM · System administration

vsellier added a comment to T4265: Create test topics on staging's kafka for the mirror tests.

thanks, I forgot to mention that

May 23 2022, 3:27 PM · System administration

vsellier added a comment to T4268: Downgrade internship cluster to 1.21.

The error when we try to declare a resource in 1.22:

May 23 2022, 2:40 PM · System administration

vsellier changed the status of T4268: Downgrade internship cluster to 1.21 from Open to Work in Progress.

May 23 2022, 2:38 PM · System administration

vsellier triaged T4265: Create test topics on staging's kafka for the mirror tests as Normal priority.