Squashed with D6272
Details
- Reviewers
olasd douardda - Group Reviewers
Reviewers - Commits
- rDPROV172e327c2588: Remove remote provenance storage based on `swh.core.api.RPCClient`
Diff Detail
- Repository
- rDPROV Provenance database
- Branch
- master
- Lint
Lint Skipped - Unit
Unit Tests Skipped - Build Status
Buildable 23989 Build 37425: Phabricator diff pipeline on jenkins Jenkins console · Jenkins Build 37424: arc lint + arc unit
Event Timeline
Build is green
Patch application report for D6273 (id=22710)
Could not rebase; Attempt merge onto 3383cae57e...
Updating 3383cae..17c6926 Fast-forward mypy.ini | 3 + swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 508 ++++++++++++++- swh/provenance/api/server.py | 818 ++++++++++++++++++++++--- swh/provenance/cli.py | 29 +- swh/provenance/origin.py | 13 + swh/provenance/postgresql/archive.py | 13 + swh/provenance/revision.py | 13 + swh/provenance/storage/archive.py | 13 + swh/provenance/tests/conftest.py | 31 +- swh/provenance/tests/test_archive_interface.py | 16 +- 11 files changed, 1345 insertions(+), 126 deletions(-)
Changes applied before test
commit 17c692664f6da9593919c50460d8aec54819a242 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` commit b11a446102843ed6197505ba18cfd6807b3f75aa Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic commit 37e87534960abd9bcb5a5082ec933b4a542b3b21 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit d133b5448d26347ed761d678deb5f3ecd17ae86c Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit e0509cb75b21e7594de55b66818b3d3d663e0dcc Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes commit d0a11a1499839baebf67407314a78029057b606a Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 10:52:44 2021 +0200 Add StatsD support to the main algorithms of the module commit a8009be206dc1e414a69a708f826266b750b74cd Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 14 15:33:33 2021 +0200 Add StatsD support to `ArchiveInterface` implementations
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/349/ for more details.
Looks like it is supposed to be replaced by the rabbitmq-based RPC? Please explain it in the commit/diff message.
You mean, why are we removing this backend implentation? Because it doesn't serve our purposes, the one using RabbitMQ is the only remote implementation we have that guarantees conflict free writings to the storage.
Build is green
Patch application report for D6273 (id=23005)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..3b7827e Fast-forward mypy.ini | 3 + swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 528 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 33 +- swh/provenance/interface.py | 9 + swh/provenance/mongo/backend.py | 3 + swh/provenance/postgresql/provenance.py | 3 + swh/provenance/provenance.py | 3 + swh/provenance/tests/conftest.py | 57 +-- 10 files changed, 1366 insertions(+), 131 deletions(-)
Changes applied before test
commit 3b7827e09876f3d355672455ac532ef7daf430e1 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit 5b2d1183b3af02059731f6020186275c832281cc Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling Add `close` method to both `ProvenanceInterface` and `ProvenanceStorageInterface` to explicitly release resources. commit 728bccedf66d70c6d0f2a64efeec41b416039a57 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit 243061be5bcdd4bc1da9cc40e956b9186d924d84 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/362/ for more details.
Build has FAILED
Patch application report for D6273 (id=23009)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..b32f7ba Fast-forward .gitignore | 3 +- mypy.ini | 3 + pytest.ini | 5 + requirements-test.txt | 1 + swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 528 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 33 +- swh/provenance/interface.py | 9 + swh/provenance/mongo/backend.py | 3 + swh/provenance/postgresql/provenance.py | 3 + swh/provenance/provenance.py | 3 + swh/provenance/tests/conftest.py | 79 +-- tox.ini | 4 +- 14 files changed, 1399 insertions(+), 133 deletions(-)
Changes applied before test
commit b32f7ba5c70d732c7c50992811716f976920f94d Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit 365f297f9362cee1bbc78f0d1a178f0846a32444 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 21 16:13:53 2021 +0200 Add support for remote backend on existing storage tests commit 42a2ee4e90755bf97cbd52fc151933d495916d00 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling Add `close` method to both `ProvenanceInterface` and `ProvenanceStorageInterface` to explicitly release resources. commit 728bccedf66d70c6d0f2a64efeec41b416039a57 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit 243061be5bcdd4bc1da9cc40e956b9186d924d84 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
Link to build: https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/364/
See console output for more information: https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/364/console
Build has FAILED
Patch application report for D6273 (id=23009)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..b32f7ba Fast-forward .gitignore | 3 +- mypy.ini | 3 + pytest.ini | 5 + requirements-test.txt | 1 + swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 528 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 33 +- swh/provenance/interface.py | 9 + swh/provenance/mongo/backend.py | 3 + swh/provenance/postgresql/provenance.py | 3 + swh/provenance/provenance.py | 3 + swh/provenance/tests/conftest.py | 79 +-- tox.ini | 4 +- 14 files changed, 1399 insertions(+), 133 deletions(-)
Changes applied before test
commit b32f7ba5c70d732c7c50992811716f976920f94d Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit 365f297f9362cee1bbc78f0d1a178f0846a32444 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 21 16:13:53 2021 +0200 Add support for remote backend on existing storage tests commit 42a2ee4e90755bf97cbd52fc151933d495916d00 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling Add `close` method to both `ProvenanceInterface` and `ProvenanceStorageInterface` to explicitly release resources. commit 728bccedf66d70c6d0f2a64efeec41b416039a57 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit 243061be5bcdd4bc1da9cc40e956b9186d924d84 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
Link to build: https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/365/
See console output for more information: https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/365/console
Build is green
Patch application report for D6273 (id=23014)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..f2ca78f Fast-forward .gitignore | 3 +- mypy.ini | 3 + pytest.ini | 5 + requirements-test.txt | 1 + swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 528 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 33 +- swh/provenance/interface.py | 9 + swh/provenance/mongo/backend.py | 3 + swh/provenance/postgresql/provenance.py | 3 + swh/provenance/provenance.py | 3 + swh/provenance/tests/conftest.py | 79 +-- tox.ini | 3 +- 14 files changed, 1398 insertions(+), 133 deletions(-)
Changes applied before test
commit f2ca78f8f0b7e9240dfd623c2d22f29efbc7e707 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit bf2c97ebd296eb9034279a36fbb9d8aaac4bfe5a Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 21 16:13:53 2021 +0200 Add support for remote backend on existing storage tests commit 42a2ee4e90755bf97cbd52fc151933d495916d00 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling Add `close` method to both `ProvenanceInterface` and `ProvenanceStorageInterface` to explicitly release resources. commit 728bccedf66d70c6d0f2a64efeec41b416039a57 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit 243061be5bcdd4bc1da9cc40e956b9186d924d84 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/368/ for more details.
Build is green
Patch application report for D6273 (id=23048)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..c936324 Fast-forward .gitignore | 4 +- mypy.ini | 3 + pytest.ini | 5 + requirements-test.txt | 1 + swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 528 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 33 +- swh/provenance/interface.py | 9 + swh/provenance/mongo/backend.py | 3 + swh/provenance/postgresql/provenance.py | 3 + swh/provenance/provenance.py | 3 + swh/provenance/tests/conftest.py | 69 +-- tox.ini | 3 +- 14 files changed, 1393 insertions(+), 129 deletions(-)
Changes applied before test
commit c93632431f97c67608a9130a2b01b7bfdbd2e753 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit 50a1441f2068a2abe1293be893e66806cf0977b9 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 21 16:13:53 2021 +0200 Add support for remote backend on existing storage tests commit ba22e90f707688aed716d20253256ee12c414a33 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling commit 4ec6d3359e20920469cde43516bc120d4352f915 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit f6f174fb76b2c42435b6c75eb90c175d9cb0fca5 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 437c2b4ad60e15c0b102988a99766b2c503c91d5 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Sep 24 11:08:08 2021 +0200 Add `close` method to both `ProvenanceInterface` and `ProvenanceStorageInterface` The idea is to have a mechanism to explicitly release resources when needed. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/375/ for more details.
I would suggest squashing D6272 and this together to land them at the same time.
I think you can remove types-werkzeug from requirements-test.txt. I'm not sure you can drop the http extra from swh.core dependencies in requirements-swh.txt, as the serialization/deserialization scaffolding is still in use in the rabbitmq backend.
OK, about types-werkzeug. I'll remove it.
However, I would prefer not to squash this together with D6272 since moving those commits together in the git history is really a pain of conflict resolution (I even end up with empty files that later reappear).
There is no harm on keeping ProvenanceStorageRPC until ProvenanceStorageRabbitMQ is landed.
Build is green
Patch application report for D6273 (id=23062)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..4995134 Fast-forward .gitignore | 4 +- mypy.ini | 3 + pytest.ini | 5 + requirements-test.txt | 2 +- swh/provenance/__init__.py | 14 +- swh/provenance/api/client.py | 528 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 33 +- swh/provenance/interface.py | 9 + swh/provenance/mongo/backend.py | 3 + swh/provenance/postgresql/provenance.py | 3 + swh/provenance/provenance.py | 3 + swh/provenance/tests/conftest.py | 69 +-- tox.ini | 3 +- 14 files changed, 1393 insertions(+), 130 deletions(-)
Changes applied before test
commit 4995134725cb5e62bcdf1d0c19ea4e2fabac5465 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit 50a1441f2068a2abe1293be893e66806cf0977b9 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 21 16:13:53 2021 +0200 Add support for remote backend on existing storage tests commit ba22e90f707688aed716d20253256ee12c414a33 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling commit 4ec6d3359e20920469cde43516bc120d4352f915 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit f6f174fb76b2c42435b6c75eb90c175d9cb0fca5 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 437c2b4ad60e15c0b102988a99766b2c503c91d5 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Sep 24 11:08:08 2021 +0200 Add `close` method to both `ProvenanceInterface` and `ProvenanceStorageInterface` The idea is to have a mechanism to explicitly release resources when needed. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/376/ for more details.
Build is green
Patch application report for D6273 (id=23173)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..8c0d6a7 Fast-forward .gitignore | 4 +- mypy.ini | 3 + pytest.ini | 5 + requirements-test.txt | 2 +- requirements.txt | 1 + swh/provenance/__init__.py | 28 +- swh/provenance/api/client.py | 530 +++++++++++++++++++- swh/provenance/api/server.py | 844 +++++++++++++++++++++++++++++--- swh/provenance/cli.py | 39 +- swh/provenance/interface.py | 20 + swh/provenance/mongo/backend.py | 19 +- swh/provenance/postgresql/provenance.py | 18 +- swh/provenance/provenance.py | 6 + swh/provenance/tests/conftest.py | 84 ++-- tox.ini | 3 +- 15 files changed, 1453 insertions(+), 153 deletions(-)
Changes applied before test
commit 8c0d6a735afcd07e102cd90de6e5c0a6531fb141 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. The new `ProvenanceStorageRabbitMQ` remote storage should be used instead. commit 15fbf939c19db9f29c99b7c1d045f61967f3886c Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Sep 21 16:13:53 2021 +0200 Add support for remote backend on existing storage tests commit 2ca0c9bdf640c7cbbcf84580aa211d8cb51d55a4 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 13:39:59 2021 +0200 Improve server/client shoutdown logic and error handling commit 0ec7250ea299d42697cb3b480171efcd2926d049 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Tue Aug 31 13:36:34 2021 +0200 Rework `ProvenanceStorageRabbitMQWorker` to handle connection loss Use `pika.SelectConnection` and make an explicit handle of its life-cycle. Improve connection error handling on both client and server side. Change the RabbitMQ scheme to use 5 exchanges (one per entity + location). Each exchange handles all entity related insertions, dispatching to different queues depending on the requested `ProvenanceStorageInterface` methods (16 queues per methods). For instance, the `content` exchange handles all requests for `content_add` and `relation_add` for both relations `CNT_EARLY_IN_REV` and `CNT_IN_DIR` (ie. relations with content as source). In each case, requests are forwarded to 1 of 16 possible workers, depending on the sha1 id of the content. commit f0f3a584ea6965021990ddba926ae13c29b9560a Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Aug 20 12:21:27 2021 +0200 Add new RabbitMQ-based client/server API Get methods in the `ProvenanceStorageInterface` are called through a server that guarantees conflict-free writings to the underlying database. Set methods are called directly from the client to avoid RCP overhead for reads. The server spawns multiple processes to handle independent requests concurrently. commit 846b20e0e9995a13591a1641bf92036ff3764be5 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Sep 24 11:08:08 2021 +0200 Add `open`/`close` methods to both `ProvenanceInterface` and `ProvenanceStorageInterface` The idea is to have a mechanism to explicitly allocate/release resources when needed. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/397/ for more details.
Build is green
Patch application report for D6273 (id=23180)
Could not rebase; Attempt merge onto 4c087ea0ec...
Updating 4c087ea..8d401db Fast-forward requirements-test.txt | 1 - requirements.txt | 1 + swh/provenance/__init__.py | 24 +++------ swh/provenance/api/client.py | 13 ----- swh/provenance/api/server.py | 92 +-------------------------------- swh/provenance/cli.py | 13 +++-- swh/provenance/interface.py | 20 +++++++ swh/provenance/mongo/backend.py | 19 +++++-- swh/provenance/postgresql/provenance.py | 18 ++++--- swh/provenance/provenance.py | 6 +++ swh/provenance/tests/conftest.py | 63 ++++++++++------------ 11 files changed, 97 insertions(+), 173 deletions(-)
Changes applied before test
commit 8d401db34539f5df2ce2bd37080ec8ae1557417b Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 1 11:27:02 2021 +0200 Remove old client/server storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected. commit 846b20e0e9995a13591a1641bf92036ff3764be5 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Fri Sep 24 11:08:08 2021 +0200 Add `open`/`close` methods to both `ProvenanceInterface` and `ProvenanceStorageInterface` The idea is to have a mechanism to explicitly allocate/release resources when needed. commit 6c3071493b5d3f187113493275d402a27866da95 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Rename remote storage backend classes Make names consistent with the naming convention used for other components.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/403/ for more details.
Build is green
Patch application report for D6273 (id=23265)
Rebasing onto 4c087ea0ec...
Current branch diff-target is up to date.
Changes applied before test
commit 68c54032460a8f9b97b48417404e910e88a84328 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Remove remote provenance storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/416/ for more details.
Build is green
Patch application report for D6273 (id=23267)
Rebasing onto 4c087ea0ec...
Current branch diff-target is up to date.
Changes applied before test
commit 172e327c25883bee768a9c16b850ce6aab7e2eb2 Author: Andres Ezequiel Viso <aeviso@softwareheritage.org> Date: Wed Sep 15 16:14:10 2021 +0200 Remove remote provenance storage based on `swh.core.api.RPCClient` This implementation was a first attempt for conflict resolution that didn't worked as expected.
See https://jenkins.softwareheritage.org/job/DPROV/job/tests-on-diff/418/ for more details.