Page MenuHomeSoftware Heritage
Feed Advanced Search

Jul 9 2020

douardda closed D3476: Convert ImmutableDict to dict before passing it to json.dumps..
Jul 9 2020, 11:44 AM
douardda added inline comments to D3476: Convert ImmutableDict to dict before passing it to json.dumps..
Jul 9 2020, 11:42 AM
douardda updated the diff for D3476: Convert ImmutableDict to dict before passing it to json.dumps..

simplify the diff by killing map_optional()

Jul 9 2020, 11:33 AM
douardda commandeered D3476: Convert ImmutableDict to dict before passing it to json.dumps..
Jul 9 2020, 11:09 AM
douardda added a comment to D3476: Convert ImmutableDict to dict before passing it to json.dumps..

Not sure I buy the map_optional here. When I read a line of code with [x if x else f(x)] I understand it right away. With map_optional, I need to go get the definition of that oneliner function.
Plus you have the type annotation headache...

Jul 9 2020, 10:48 AM
douardda added a comment to D3475: setup: Do no expose the pytest-plugin any longer.

Adapt according to remarks:

  • Update docstring
Jul 9 2020, 10:43 AM

Jul 8 2020

douardda accepted D3469: Move all celery-related fixtures to the swh.scheduler pytest plugin.

looks fine, as far as I can tell

Jul 8 2020, 6:15 PM
douardda committed rDLDSVNfe5e4d342f30: Update the svn loader to new Revision API (authored by douardda).
Update the svn loader to new Revision API
Jul 8 2020, 5:11 PM
douardda closed D3464: Update the svn loader to new Revision API.
Jul 8 2020, 5:11 PM
douardda updated the diff for D3464: Update the svn loader to new Revision API.

typo (thx ardumont)

Jul 8 2020, 4:58 PM
douardda committed rDLDG8694ec70cda7: Small code simplification (authored by douardda).
Small code simplification
Jul 8 2020, 4:13 PM
douardda created D3464: Update the svn loader to new Revision API.
Jul 8 2020, 4:07 PM
douardda accepted D3463: loader: Adapt to swh-model >= 0.4.0.

lgtm

Jul 8 2020, 3:43 PM
douardda committed rDLDG0394f0f85edf: Adapt the git loader to swh-model >0.4 (authored by douardda).
Adapt the git loader to swh-model >0.4
Jul 8 2020, 3:13 PM
douardda closed D3454: Adapt the git loader to swh-model >0.4.
Jul 8 2020, 3:13 PM
douardda updated the diff for D3454: Adapt the git loader to swh-model >0.4.

small simplification as suggested by anlambert

Jul 8 2020, 3:11 PM
douardda updated the diff for D3454: Adapt the git loader to swh-model >0.4.

rebase

Jul 8 2020, 2:12 PM
douardda updated the diff for D3454: Adapt the git loader to swh-model >0.4.

fix MANIFEST.in to include the git bundle file

Jul 8 2020, 2:00 PM
douardda updated the diff for D3454: Adapt the git loader to swh-model >0.4.

bump dependencies

Jul 8 2020, 12:25 PM
douardda committed rCDFJd502f6eb5a6d: Merge both "pip install" RUN commands in tox/Dockerfile (authored by douardda).
Merge both "pip install" RUN commands in tox/Dockerfile
Jul 8 2020, 11:56 AM
douardda closed D3457: Merge both "pip install" RUN commands in tox/Dockerfile.
Jul 8 2020, 11:56 AM
douardda created D3457: Merge both "pip install" RUN commands in tox/Dockerfile.
Jul 8 2020, 10:18 AM
douardda accepted D3450: Use attr instead of NamedTuple to generate SWHID..

Thx, looks good. Just a small question about the modification of the swhid() API.

Jul 8 2020, 9:32 AM
douardda added a comment to D3454: Adapt the git loader to swh-model >0.4.
In D3454#84921, @olasd wrote:

Oh, that needs a requirements-swh.txt bump as well, I guess.

Jul 8 2020, 9:19 AM

Jul 7 2020

douardda created D3454: Adapt the git loader to swh-model >0.4.
Jul 7 2020, 7:03 PM
douardda added a comment to P718 (An Untitled Masterwork).
$ git cat-file -p v0.0.1
object 9768d0b576dbaaecd80abedad6dfd0d72f1476da
type commit
tag v0.0.1
tagger David Douard <david.douard@sdfa3.org> 1594138133 +0200
Jul 7 2020, 6:42 PM
douardda added a comment to P718 (An Untitled Masterwork).
git cat-file -p 9768d0b576dbaaecd80abedad6dfd0d72f1476da
tree f0695c2e2fa7ce9d574023c3413761a473e500ca
parent c3c588713233609f5bbbb2d9e7f3fb4a660f3f72
author Stefano Zacchiroli <zack@upsilon.cc> 1443083765 +0200
committer Stefano Zacchiroli <zack@upsilon.cc> 1443083765 +0200
Jul 7 2020, 6:41 PM
douardda created P718 (An Untitled Masterwork).
Jul 7 2020, 6:39 PM
douardda committed rDSTO5ab70237d6c3: Extract revision's extra_header as a top level attribute (authored by douardda).
Extract revision's extra_header as a top level attribute
Jul 7 2020, 5:18 PM
douardda closed D3426: Extract revision's extra_header as a top level attribute.
Jul 7 2020, 5:18 PM
douardda updated the diff for D3426: Extract revision's extra_header as a top level attribute.

fix sql schema and simplify the migration script

Jul 7 2020, 4:50 PM
douardda added inline comments to D3426: Extract revision's extra_header as a top level attribute.
Jul 7 2020, 4:45 PM
douardda updated the diff for D3426: Extract revision's extra_header as a top level attribute.

typos/fixes in 158.sql

Jul 7 2020, 3:35 PM
douardda updated the diff for D3426: Extract revision's extra_header as a top level attribute.

add a test for "true" bw compat, aka old data in pg still works

Jul 7 2020, 3:20 PM
douardda created P717 (An Untitled Masterwork).
Jul 7 2020, 2:21 PM
douardda accepted D3396: Add raw metadata to the model..

As said on IRC, I'd rather see this diff depends on D3417, but otherwise I'm fine with it.

Jul 7 2020, 11:26 AM
douardda requested changes to D3435: Add mercurial.from_disk.HgLoaderFromDisk.

knowing this is still WIP, I have a few comments/questions:

Jul 7 2020, 10:51 AM

Jul 6 2020

douardda updated the diff for D3426: Extract revision's extra_header as a top level attribute.

use frozen<list<list<blob>>> in cassandra's schema for extra_headers

Jul 6 2020, 6:55 PM
douardda added a comment to D3426: Extract revision's extra_header as a top level attribute.
In D3426#84268, @olasd wrote:

Looks nice.

Few comments:

  • I'm not convinced about the Cassandra storage of these. Have you tried a list <frozen list<blob>> or somesuch?
Jul 6 2020, 4:33 PM
douardda updated the diff for D3426: Extract revision's extra_header as a top level attribute.

bump deps for swh-journal and swh-core

Jul 6 2020, 2:35 PM
douardda updated the diff for D3426: Extract revision's extra_header as a top level attribute.

bump dep swh-model>= 0.4.0

Jul 6 2020, 2:21 PM
douardda committed rDMODa7d9aca2b0df: Extract the extra_headers from metadata on the Revision model class (authored by douardda).
Extract the extra_headers from metadata on the Revision model class
Jul 6 2020, 2:12 PM
douardda closed D3389: Extract the extra_headers from metadata on the Revision model class.
Jul 6 2020, 2:12 PM
douardda committed rDJNLbf35ea79d3cc: Refactor pytest_plugins.assert_all_objects_consumed() to use model objects (authored by douardda).
Refactor pytest_plugins.assert_all_objects_consumed() to use model objects
Jul 6 2020, 1:47 PM
douardda closed D3427: Refactor pytest_plugins.assert_all_objects_consumed() to use model objects.
Jul 6 2020, 1:47 PM
douardda updated the diff for D3427: Refactor pytest_plugins.assert_all_objects_consumed() to use model objects.

typos

Jul 6 2020, 1:42 PM
douardda added a comment to D3427: Refactor pytest_plugins.assert_all_objects_consumed() to use model objects.

ok

jsyk, noticed 2 typos in the commit message (and diff description):

  • "wee" -> "we"
  • "whic" -> "which"
Jul 6 2020, 1:40 PM
douardda updated the task description for T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object.
Jul 6 2020, 1:09 PM · Data Model
douardda created D3427: Refactor pytest_plugins.assert_all_objects_consumed() to use model objects.
Jul 6 2020, 1:01 PM
douardda updated the diff for D3389: Extract the extra_headers from metadata on the Revision model class.

split the 2 test functions in parts, and fix the bwcompat hook

Jul 6 2020, 11:59 AM
douardda added a comment to D3389: Extract the extra_headers from metadata on the Revision model class.
In D3389#83915, @olasd wrote:

Accepted, with the following caveat: I'm still not sure what the plan is to deploy this now? In the current state of this diff, as soon as we deploy this on workers, data loss will occur (as the new extra_headers field isn't supported by swh.storage).

Now that I think of it, introducing a new field in swh.model needs a coordinated deployment anyway so maybe it doesn't matter (as long as the swh.storage update to support the new field occurs before the next deployment).

Yes this needs a counterpart in storage (wip), and needs a coordinated deployment.

Jul 6 2020, 10:49 AM
douardda created D3426: Extract revision's extra_header as a top level attribute.
Jul 6 2020, 10:44 AM
douardda added a revision to T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object: D3426: Extract revision's extra_header as a top level attribute.
Jul 6 2020, 10:44 AM · Data Model
douardda added a comment to D3389: Extract the extra_headers from metadata on the Revision model class.
In D3389#83915, @olasd wrote:

Accepted, with the following caveat: I'm still not sure what the plan is to deploy this now? In the current state of this diff, as soon as we deploy this on workers, data loss will occur (as the new extra_headers field isn't supported by swh.storage).

Now that I think of it, introducing a new field in swh.model needs a coordinated deployment anyway so maybe it doesn't matter (as long as the swh.storage update to support the new field occurs before the next deployment).

Jul 6 2020, 10:11 AM
douardda accepted D3421: storage: Send metrics from the origin_add endpoint.
Jul 6 2020, 9:50 AM

Jul 3 2020

douardda added a comment to T2474: drop blake2 hashes.

not sure about the db space as an argument, but the CPU is by itself worth the move IMHO.

Jul 3 2020, 4:30 PM · Data Model, Storage manager
douardda updated the diff for D3389: Extract the extra_headers from metadata on the Revision model class.

rebase

Jul 3 2020, 11:57 AM
douardda requested changes to D3396: Add raw metadata to the model..

Any reason not to use type_validator() in these new schema entities?

Jul 3 2020, 11:33 AM
douardda accepted D3403: loader-svn: tests: Refactor svn-eol-style scenario.

LGTM but I'd prefer the commit message be self-contained, i.e. explicitly explain the unttest->pytest conversion rather then delegating this to a task in the forge.

Jul 3 2020, 10:54 AM
douardda accepted D3394: Improve test coverage and type coverage for copy_to.

Thx a lot, LGTM (can't say I've reviewed the tests very carefully though.)

Jul 3 2020, 10:41 AM

Jul 2 2020

douardda requested changes to D3394: Improve test coverage and type coverage for copy_to.

Looks globally fine to me, but I have a few comments/requests.

Jul 2 2020, 2:17 PM
douardda created P710 (An Untitled Masterwork).
Jul 2 2020, 12:31 PM

Jul 1 2020

douardda updated the diff for D3389: Extract the extra_headers from metadata on the Revision model class.

more mypy vs. attrs-strict fighting

Jul 1 2020, 6:02 PM
douardda updated the diff for D3389: Extract the extra_headers from metadata on the Revision model class.

make mypy happy (hopefully)

Jul 1 2020, 5:39 PM
douardda updated the diff for D3389: Extract the extra_headers from metadata on the Revision model class.

restrict extra_headers to (bytes, bytes) only

Jul 1 2020, 5:23 PM
douardda updated the diff for D3389: Extract the extra_headers from metadata on the Revision model class.

improve bw-compat support, tests and hypothesis strategies

Jul 1 2020, 4:04 PM
douardda added a revision to T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object: D3389: Extract the extra_headers from metadata on the Revision model class.
Jul 1 2020, 3:18 PM · Data Model
douardda created D3389: Extract the extra_headers from metadata on the Revision model class.
Jul 1 2020, 3:18 PM
douardda added a comment to T2471: NPM package angular-ts-manage fails to be properly loaded.

This origin (and others) also fails to be mirrored with errors like:

Jul 1 2020, 10:10 AM · Core Loader
douardda triaged T2471: NPM package angular-ts-manage fails to be properly loaded as High priority.
Jul 1 2020, 10:05 AM · Core Loader

Jun 30 2020

douardda added a comment to T1805: Public API v2.

Rereading this task, I have a few comments/questions.

Jun 30 2020, 2:24 PM · meta-task, Web app
douardda committed rDSTOdc1878b9fb50: Make release_add support adding the same object twice in the same call (authored by douardda).
Make release_add support adding the same object twice in the same call
Jun 30 2020, 10:07 AM
douardda closed D2771: Make release_add support adding the same object twice in the same call.
Jun 30 2020, 10:07 AM

Jun 29 2020

douardda added inline comments to D3355: Generalize origin_metadata to allow support for other object types in the future..
Jun 29 2020, 5:49 PM
douardda updated the diff for D2771: Make release_add support adding the same object twice in the same call.

kill redundant test

Jun 29 2020, 5:29 PM
douardda updated the diff for D2771: Make release_add support adding the same object twice in the same call.

rebase and improve a bit the test as well as the cassandra and in_memory backends

Jun 29 2020, 5:25 PM
douardda commandeered D2771: Make release_add support adding the same object twice in the same call.
Jun 29 2020, 5:16 PM
douardda created P704 (An Untitled Masterwork).
Jun 29 2020, 4:01 PM

Jun 26 2020

douardda accepted D3360: replayer: Drop obsolete fields from origin-visit.
Jun 26 2020, 10:16 AM
douardda accepted D3361: loader*: Drop obsolete origin visit fields.

lgtm

Jun 26 2020, 10:15 AM
douardda added inline comments to D3359: storage*: Open order parameter to origin-visit-get endpoint.
Jun 26 2020, 10:11 AM
douardda committed rDJNLaff31e54e966: Use proper hash ids in tests' journal_data (authored by douardda).
Use proper hash ids in tests' journal_data
Jun 26 2020, 9:23 AM
douardda closed D3353: Use proper hash ids in tests' journal_data.
Jun 26 2020, 9:23 AM
douardda added a comment to D3353: Use proper hash ids in tests' journal_data.

do you know where the old hashes came from?

Jun 26 2020, 9:22 AM

Jun 25 2020

douardda created D3353: Use proper hash ids in tests' journal_data.
Jun 25 2020, 1:03 PM
douardda closed T2422: Add an `object_type` attribute to model classes as Resolved.

closed by D3152

Jun 25 2020, 12:57 PM · Data Model
douardda triaged T2466: Re design the lazy loading feature for Content objects as Normal priority.
Jun 25 2020, 12:57 PM · Core Loader
douardda committed rDMODe632abed41c5: Tag model entities with their "object_type" (authored by douardda).
Tag model entities with their "object_type"
Jun 25 2020, 10:29 AM
douardda closed D3152: Tag model entities with their "type".
Jun 25 2020, 10:29 AM

Jun 24 2020

douardda updated the diff for D3152: Tag model entities with their "type".

update copyright

Jun 24 2020, 5:40 PM
douardda added inline comments to D3152: Tag model entities with their "type".
Jun 24 2020, 5:38 PM
douardda added inline comments to D3152: Tag model entities with their "type".
Jun 24 2020, 5:38 PM
douardda accepted D3346: tests: Relax checks on journal writes regarding origin-visit*/snapshot.

It's a bit sad we cannot generically use sets for this (because some model objects are still unhashable), so fine for me.

Jun 24 2020, 5:07 PM
douardda requested review of D3152: Tag model entities with their "type".

due to changes in from_disk, I'd rather have a new review on this.

Jun 24 2020, 4:37 PM
douardda updated the diff for D3152: Tag model entities with their "type".

improve commit message + use '_' in DiskBakedContent.object_type

Jun 24 2020, 4:33 PM
douardda updated the diff for D3152: Tag model entities with their "type".

Update the diff to make it work (see T2422)

Jun 24 2020, 4:01 PM
douardda added a comment to T2422: Add an `object_type` attribute to model classes.

Previously proposed "short-term" solution does not work. So the only "short-term" solution is to make DiskBakedContent inherit from BaseModel (or BaseContent).

Jun 24 2020, 3:45 PM · Data Model
douardda added a comment to T2422: Add an `object_type` attribute to model classes.

This task is currently blocked by an implementation "detail":

Jun 24 2020, 3:28 PM · Data Model
douardda committed rDWAPPSe8f3588ede5f: Do not use the returned value of storage.origin_add in tests (authored by douardda).
Do not use the returned value of storage.origin_add in tests
Jun 24 2020, 11:51 AM
douardda closed D3326: Do not use the returned value of storage.origin_add in tests.
Jun 24 2020, 11:51 AM