Page MenuHomeSoftware Heritage
Feed All Stories

Apr 7 2020

swh-public-ci added a comment to D2970: Make swh/storage/storage.py flake8 compliant.

Build has FAILED

Apr 7 2020, 4:46 PM
anlambert accepted D2970: Make swh/storage/storage.py flake8 compliant.
Apr 7 2020, 4:45 PM
swh-public-ci added a comment to D2969: Add a setup.cfg file to configure flake8 for black compatibility.

Build has FAILED

Apr 7 2020, 4:45 PM
Harbormaster failed remote builds in B11649: Diff 10561 for D2969: Add a setup.cfg file to configure flake8 for black compatibility!
Apr 7 2020, 4:45 PM
douardda created D2971: Make Storage.origin_visit_update() add an OriginVisit model entity in the journal.
Apr 7 2020, 4:45 PM
anlambert accepted D2969: Add a setup.cfg file to configure flake8 for black compatibility.
Apr 7 2020, 4:45 PM
douardda created D2970: Make swh/storage/storage.py flake8 compliant.
Apr 7 2020, 4:44 PM
anlambert accepted D2968: Fix the inmemory journal writer.

Looks good to me.

Apr 7 2020, 4:44 PM
douardda created D2969: Add a setup.cfg file to configure flake8 for black compatibility.
Apr 7 2020, 4:44 PM
swh-public-ci added a comment to D2968: Fix the inmemory journal writer.

Build is green

Apr 7 2020, 4:43 PM
douardda created D2968: Fix the inmemory journal writer.
Apr 7 2020, 4:38 PM
douardda abandoned D2963: test: fix test_content_add_validation() for swh.model 0.0.64.

It seems not to be necessary (besides not being the proper fix, if any)

Apr 7 2020, 4:30 PM
swh-public-ci added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

Build is green

Apr 7 2020, 4:30 PM
swh-public-ci added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

Build is green

Apr 7 2020, 4:23 PM
anlambert created D2967: docker/conftest: Increase wait-for-it timeouts.
Apr 7 2020, 4:23 PM
ardumont updated the diff for D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

Update test-filter docstring

Apr 7 2020, 4:21 PM
ardumont updated the diff for D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.
  • Remove the check on hasattr as all storage implementations have the clear_buffers
  • Add the clear_buffers implementations on filter proxy storage (+ tests)
Apr 7 2020, 4:17 PM
anlambert updated the task description for T2048: Use OpenID Connect to authenticate and authorize users in Django.
Apr 7 2020, 4:13 PM · Web app
anlambert committed rDWAPPS92757050329d: package.json: Upgrade dependencies (authored by anlambert).
package.json: Upgrade dependencies
Apr 7 2020, 4:12 PM
anlambert committed rDWAPPS05e9efac023a: auth: Add OIDC session silent refresh middleware (authored by anlambert).
auth: Add OIDC session silent refresh middleware
Apr 7 2020, 4:12 PM
anlambert closed T2267: Check OIDC user session state in backend, a subtask of T2048: Use OpenID Connect to authenticate and authorize users in Django, as Resolved.
Apr 7 2020, 4:12 PM · Web app
anlambert closed T2267: Check OIDC user session state in backend as Resolved by committing rDWAPPS05e9efac023a: auth: Add OIDC session silent refresh middleware.
Apr 7 2020, 4:12 PM · Web app
anlambert closed D2877: auth: Add OIDC session silent refresh middleware.
Apr 7 2020, 4:12 PM
anlambert committed rSPSITE03fd09fc9128: webapp: Pin django and typing-extensions to buster-backports versions (authored by anlambert).
webapp: Pin django and typing-extensions to buster-backports versions
Apr 7 2020, 4:11 PM
anlambert closed D2965: webapp: Pin django and typing-extensions to buster-backports versions.
Apr 7 2020, 4:11 PM
olasd accepted D2965: webapp: Pin django and typing-extensions to buster-backports versions.
Apr 7 2020, 4:09 PM
ardumont added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

If you add clear_buffers to the basic signature of Storage, I think you can skip all the hasattrs.

Apr 7 2020, 3:53 PM
ardumont added inline comments to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.
Apr 7 2020, 3:52 PM
olasd added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

If you add clear_buffers to the basic signature of Storage, I think you can skip all the hasattrs.

Apr 7 2020, 3:50 PM
swh-public-ci added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

Build is green

Apr 7 2020, 3:45 PM
ardumont added inline comments to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.
Apr 7 2020, 3:45 PM
ardumont updated the diff for D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.
  • Rename method clear to clear_buffers
  • Add clear_buffers on all storage endpoints, passthrough for proxy storage, noop endpoint for main backend implementations
  • Explicit the warning about losing data when calling the clear_buffers endpoint
  • complete tests with directories and skipped_contents
Apr 7 2020, 3:40 PM
ardumont added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

This is missing tests for directories and snapshots :)

Apr 7 2020, 3:34 PM
anlambert closed D2964: debian: Prepare new swh-web deployment.
Apr 7 2020, 3:26 PM
anlambert added a comment to D2964: debian: Prepare new swh-web deployment.

Closed by rDWAPPS62041880dd71b8242df179206431803ea8aa8d1a

Apr 7 2020, 3:26 PM
olasd requested changes to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

This is missing tests for directories and snapshots :)

Apr 7 2020, 3:11 PM
ardumont added a comment to T2352: nixguix: Fails to finish as it's stuck in a loop up to memory error.

Right now, heading for 2. for now as the solution for 3. is still a pending question [2]

Apr 7 2020, 3:05 PM · Package Loader, Storage manager
swh-public-ci added a comment to D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.

Build is green

Apr 7 2020, 3:02 PM
ardumont created D2966: storage.buffer: Add a new clear_buffers operation for the buffer proxy.
Apr 7 2020, 2:57 PM
ardumont updated the task description for T2352: nixguix: Fails to finish as it's stuck in a loop up to memory error.
Apr 7 2020, 2:55 PM · Package Loader, Storage manager
olasd accepted D2964: debian: Prepare new swh-web deployment.

Alright. I've done a build test in unstable-swh and buster-swh and both builds have succeeded (at least until the point of tests, which will need a new tag). Thanks!

Apr 7 2020, 2:49 PM
olasd edited P641 Masterwork From Distant Lands.
Apr 7 2020, 2:39 PM
anlambert created D2965: webapp: Pin django and typing-extensions to buster-backports versions.
Apr 7 2020, 2:32 PM
anlambert retitled D2964: debian: Prepare new swh-web deployment from debian: Prepare new swh-web deployment Some updates to debian packaging of `swh-web` after local package build tests: - Bump swh module dependencies version - Upgrade Django to 2.2 (now in `buster-backports`) - Pin minimal typing-extensions... to debian: Prepare new swh-web deployment.
Apr 7 2020, 2:31 PM
Harbormaster failed remote builds in B11639: Diff 10551 for D2964: debian: Prepare new swh-web deployment!
Apr 7 2020, 2:31 PM
swh-public-ci added a comment to D2964: debian: Prepare new swh-web deployment.

Build has FAILED

Apr 7 2020, 2:31 PM
anlambert created D2964: debian: Prepare new swh-web deployment.
Apr 7 2020, 2:30 PM
anlambert updated the task description for T2303: Upgrade to Django 2.x in production.
Apr 7 2020, 2:16 PM · System administration, Web app
vlorentz requested changes to D2963: test: fix test_content_add_validation() for swh.model 0.0.64.

If the error is caused by an invalid argument, it should raise StorageArgumentException, not ValueError.

Apr 7 2020, 1:43 PM
olasd added a comment to T2310: Make origin visits immutable.

I'll start with a general reasoning about origin visit vs. origin visit state objects in our "conceptual" data model, as it was sprinkled throughout my comment initially.

Apr 7 2020, 12:43 PM · Storage manager, Data Model
anlambert added a comment to D2963: test: fix test_content_add_validation() for swh.model 0.0.64.

Nevertheless, a lot of tests are now failing in storage since the latest model updates.

Apr 7 2020, 12:28 PM
anlambert added a comment to D2963: test: fix test_content_add_validation() for swh.model 0.0.64.

This test did not fail with swh-model 0.0.64, see https://jenkins.softwareheritage.org/job/DSTO/job/tests/1085/consoleFull and it did not fail too when I execute it locally on current master of swh-storage.

Apr 7 2020, 12:26 PM
swh-public-ci added a comment to D2963: test: fix test_content_add_validation() for swh.model 0.0.64.

Build is green

Apr 7 2020, 12:22 PM
douardda created D2963: test: fix test_content_add_validation() for swh.model 0.0.64.
Apr 7 2020, 12:16 PM
ardumont added projects to T2352: nixguix: Fails to finish as it's stuck in a loop up to memory error: Core Loader, Storage manager.
Apr 7 2020, 11:56 AM · Package Loader, Storage manager
ardumont updated the task description for T2352: nixguix: Fails to finish as it's stuck in a loop up to memory error.
Apr 7 2020, 11:55 AM · Package Loader, Storage manager
swh-public-ci added a comment to D2877: auth: Add OIDC session silent refresh middleware.

Build is green

Apr 7 2020, 11:51 AM
ardumont triaged T2352: nixguix: Fails to finish as it's stuck in a loop up to memory error as Normal priority.
Apr 7 2020, 11:50 AM · Package Loader, Storage manager
ardumont created T2352: nixguix: Fails to finish as it's stuck in a loop up to memory error.
Apr 7 2020, 11:49 AM · Package Loader, Storage manager
vlorentz accepted D2877: auth: Add OIDC session silent refresh middleware.
Apr 7 2020, 11:48 AM
anlambert updated the diff for D2877: auth: Add OIDC session silent refresh middleware.

Rebase

Apr 7 2020, 11:43 AM
ardumont added a comment to T2310: Make origin visits immutable.

It looks you misread what I meant: I was talking about a new OriginVisitUpdate with a snapshot "inconsistent" with the previous snapshot reported by the previous OriginVisitUpdate for the same visit.

Apr 7 2020, 11:38 AM · Storage manager, Data Model
vlorentz added a comment to T2310: Make origin visits immutable.
  • do we allow an OriginVisitUpdate(status='ongoing', snapshot=yyy) with the snapshot yyy not a superset of a previous update?

It doesn't make sense to have this, but I'm not sure we should care.

I think this is a rather simple check to implement so I don't see why not do it. Intrinsic robustness is always (if not over complex) a good thing add.

Apr 7 2020, 11:25 AM · Storage manager, Data Model
vlorentz added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

Indeed. Then it would make sense to also give the full path to dir_filter, instead of adding a redundant attribute.

Apr 7 2020, 11:19 AM
douardda added a comment to T2310: Make origin visits immutable.

Thanks for the questions. I'm unsure about some questions and i replied as best
i could.

do we allow an OriginVisitUpdate(status='ongoing', snaphost=None)? what would
be the meaning of this?

Yes. It means "loading started, so no snapshot yet".
That sounds sensible ;)

Apr 7 2020, 10:56 AM · Storage manager, Data Model
douardda added a comment to T2310: Make origin visits immutable.

We currently don't have "created" (so no "start" either), but it would make sense to create it.

Regarding this model, a few questions come to my mind:

  • do we allow an OriginVisitUpdate(status='ongoing', snaphost=None)? what would be the meaning of this? or do we enforce one just after the created step to model the start transition?

This could mean these things:

  1. on a first update, to mean the visit was created (but we don't need it if we have a "created" state)
Apr 7 2020, 10:47 AM · Storage manager, Data Model
olasd committed rDJNL17d64ff61bcf: Set a larger max message size for the kafka used in journal tests (authored by olasd).
Set a larger max message size for the kafka used in journal tests
Apr 7 2020, 10:28 AM
olasd closed D2962: Set a larger max message size for the kafka used in journal tests.
Apr 7 2020, 10:28 AM
olasd committed rDJNL4aca80d0e313: Split kafka writer tests into a consumer loop and an object check (authored by olasd).
Split kafka writer tests into a consumer loop and an object check
Apr 7 2020, 10:28 AM
olasd closed D2961: Split kafka writer tests into a consumer loop and an object check.
Apr 7 2020, 10:28 AM
ardumont created P640 latest nixguix killed.
Apr 7 2020, 9:58 AM
ardumont added a comment to D2799: Added LaunchpadLister.

Yes that's how I do it

Apr 7 2020, 9:49 AM
ardumont accepted D2962: Set a larger max message size for the kafka used in journal tests.
Apr 7 2020, 9:39 AM
ardumont accepted D2961: Split kafka writer tests into a consumer loop and an object check.
Apr 7 2020, 9:38 AM

Apr 6 2020

olasd added a subtask for T2351: Consider backfilling mistakenly rejected large objects from PostgreSQL: T2350: Support large messages in swh.journal / kafka.
Apr 6 2020, 10:35 PM · Journal
olasd added a parent task for T2350: Support large messages in swh.journal / kafka: T2351: Consider backfilling mistakenly rejected large objects from PostgreSQL.
Apr 6 2020, 10:35 PM · Journal
olasd triaged T2351: Consider backfilling mistakenly rejected large objects from PostgreSQL as Normal priority.
Apr 6 2020, 10:35 PM · Journal
olasd triaged T2350: Support large messages in swh.journal / kafka as High priority.
Apr 6 2020, 10:31 PM · Journal
olasd triaged T2349: Make the journal writer reliable as High priority.
Apr 6 2020, 10:25 PM · Journal
olasd triaged T2348: swh.journal silently loses large objects instead of rejecting them as High priority.
Apr 6 2020, 10:22 PM · Mirror, Journal
DanSeraf added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

I can define a dir_filter function but how could i get the values i need, like the full path generated inside from_disk, without touching the implementation?

I'm not sure i understand the question.
from_disk has a save_path parameter. If set to True, the paths are stored alongside the output result.
Isn't that enough?

Apr 6 2020, 8:27 PM
swh-public-ci added a comment to D2962: Set a larger max message size for the kafka used in journal tests.

Build is green

Apr 6 2020, 7:32 PM
olasd created D2962: Set a larger max message size for the kafka used in journal tests.
Apr 6 2020, 7:27 PM
swh-public-ci added a comment to D2961: Split kafka writer tests into a consumer loop and an object check.

Build is green

Apr 6 2020, 7:14 PM
olasd created D2961: Split kafka writer tests into a consumer loop and an object check.
Apr 6 2020, 7:10 PM
olasd updated the task description for T2345: Improve handling of recurrent loading tasks in scheduler.
Apr 6 2020, 7:06 PM · Sprint 2021 01, Archive coverage, Scheduling utilities
legau added inline comments to D2799: Added LaunchpadLister.
Apr 6 2020, 7:00 PM
legau added a comment to D2799: Added LaunchpadLister.

Yes that's how I do it

Apr 6 2020, 6:50 PM
ardumont added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

I can define a dir_filter function but how could i get the values i need, like the full path generated inside from_disk, without touching the implementation?

Apr 6 2020, 6:18 PM
DanSeraf added a comment to D2960: swh-model: exclude path support for the function from_disk in Directory.

I think in the end, this should be implemented with the dir_filter parameter instead.

Please, check that you can do this. If you can, this will, there is no need to change the current implementation.
And instead, either:

  • not touch swh.model at all, then define your ignore_path function in swh-scanner and use it when calling from_disk.
  • if you think, this can be a shared behavior (i guess it can), define a function callable here in swh-model (like accept_all_directories and the other ones next to it).

In both cases, this won't touch Directory.from_disk though.

Apr 6 2020, 6:10 PM
ardumont added a comment to D2799: Added LaunchpadLister.

Build is still broken even with rebasing

Apr 6 2020, 6:07 PM
ardumont added inline comments to D2799: Added LaunchpadLister.
Apr 6 2020, 6:01 PM
legau added a comment to D2799: Added LaunchpadLister.

Build is still broken even with rebasing

Apr 6 2020, 5:49 PM
Harbormaster failed remote builds in B11632: Diff 10543 for D2799: Added LaunchpadLister!
Apr 6 2020, 5:49 PM
swh-public-ci added a comment to D2799: Added LaunchpadLister.

Build has FAILED

Apr 6 2020, 5:49 PM
legau updated the diff for D2799: Added LaunchpadLister.

Modified Fixtures

Apr 6 2020, 5:48 PM
ardumont requested changes to D2960: swh-model: exclude path support for the function from_disk in Directory.

I think in the end, this should be implemented with the dir_filter parameter instead.

Apr 6 2020, 5:40 PM
legau added inline comments to D2799: Added LaunchpadLister.
Apr 6 2020, 5:35 PM
vlorentz added inline comments to D2960: swh-model: exclude path support for the function from_disk in Directory.
Apr 6 2020, 5:28 PM
ardumont added inline comments to D2960: swh-model: exclude path support for the function from_disk in Directory.
Apr 6 2020, 4:44 PM