Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 19 2021

douardda committed rDPROVf2ffd718c468: Rename synthetic result test files (authored by douardda).
Rename synthetic result test files
Apr 19 2021, 4:56 PM
douardda committed rDPROV594490576b00: Improve and rename test_provenance_db() as test_probenance_heuristics() (authored by douardda).
Improve and rename test_provenance_db() as test_probenance_heuristics()
Apr 19 2021, 4:56 PM
douardda committed rDPROV5e89689ef08b: Also test the provenance db with ArchiveStorage (authored by douardda).
Also test the provenance db with ArchiveStorage
Apr 19 2021, 4:56 PM
douardda closed D5387: Refactor the model and simplify a bit origin.py.
Apr 19 2021, 4:56 PM
douardda committed rDPROV7eaeebb6091a: Simplify a bit origin.py (authored by douardda).
Simplify a bit origin.py
Apr 19 2021, 4:56 PM
douardda committed rDPROVa23a33c5a77d: Refactor the model (authored by douardda).
Refactor the model
Apr 19 2021, 4:56 PM
douardda committed rDPROV8853314af981: Add a test for the (noroot, upper) case (authored by douardda).
Add a test for the (noroot, upper) case
Apr 19 2021, 4:55 PM
douardda closed D5337: Add a test to compare the result of revision_add() with known results.
Apr 19 2021, 4:55 PM
douardda committed rDPROVadbc99dd357d: Add a test to compare the result of revision_add() with known results (authored by douardda).
Add a test to compare the result of revision_add() with known results
Apr 19 2021, 4:55 PM
douardda updated the diff for D5389: Improve tests.

rebas

Apr 19 2021, 4:45 PM
douardda updated the diff for D5388: Also test the provenance db with ArchiveStorage.

rebase

Apr 19 2021, 4:44 PM
douardda updated the diff for D5387: Refactor the model and simplify a bit origin.py.

rebased

Apr 19 2021, 4:43 PM
douardda updated the diff for D5337: Add a test to compare the result of revision_add() with known results.

rebased

Apr 19 2021, 4:42 PM
douardda committed rDPROV62617e500649: Enforce black version 19.10b0 in tox to be consistent with pre-commit (authored by douardda).
Enforce black version 19.10b0 in tox to be consistent with pre-commit
Apr 19 2021, 4:41 PM
douardda added a comment to T3269: Investigate scheduling policy for fsfe's gitea.

The loading tasks created during this first listing were oneshot tasks. So they have been modified to recurring tasks with something like:

Apr 19 2021, 3:29 PM · Origin-Gitea/Gogs
douardda added a comment to T2602: Investigate how to upgrade the schema of the Cassandra storage.

Doesn't this deserve a state-of-the-art kind of thing? Are there documentation material on the subject? How does other (big) cassandra users handle this?

Apr 19 2021, 2:14 PM · Storage manager
douardda added a comment to T3269: Investigate scheduling policy for fsfe's gitea.

The listing task has been disabled, I think because of failures in the last executions:

Apr 19 2021, 11:51 AM · Origin-Gitea/Gogs
douardda triaged T3269: Investigate scheduling policy for fsfe's gitea as High priority.
Apr 19 2021, 10:19 AM · Origin-Gitea/Gogs
douardda added a comment to T3084: Fast track save code now requests.

is there a grafana dashboard dedicated to this queue?

Apr 19 2021, 10:14 AM · System administration, Web app
douardda added a comment to T3246: Document takedown request processing workflow.

also: what about exports we provide on git annex?

Apr 19 2021, 10:10 AM · Archive content
douardda added a comment to T3246: Document takedown request processing workflow.

do we also intent to have a takedown topic on kafka?

Apr 19 2021, 10:09 AM · Archive content

Apr 9 2021

douardda created P1003 (An Untitled Masterwork).
Apr 9 2021, 4:04 PM

Apr 8 2021

douardda added a comment to T3198: Mirror: unexpected closed connection to the pg server.

Just got this one below. Note that this occurred just when the replayer actually started to insert object in the storage (before that, since the start of the replayer process, only kafka scaffolding took place for quite some time, around 30mn!)

Apr 8 2021, 12:02 PM · Mirror
douardda triaged T3218: The graph replayer generates REQTMOUT Timeout errors as High priority.
Apr 8 2021, 11:44 AM · Mirror

Apr 7 2021

douardda added a comment to T3214: Restrict accepted timestamps to values that can be processed all along.

looks like there is no revision with date or committer_date > 9999-12-31 in the main storage...

Apr 7 2021, 3:04 PM · Data Model
douardda triaged T3214: Restrict accepted timestamps to values that can be processed all along as High priority.
Apr 7 2021, 2:30 PM · Data Model

Apr 6 2021

douardda closed T3201: Mirror: unsupported Unicode escape sequence as Resolved by committing rDSTO39507b24d0f4: Make the replayer drop the Revision.metadata.
Apr 6 2021, 4:42 PM · Mirror
douardda closed D5414: Make the replayer drop the Revision.metadata.
Apr 6 2021, 4:42 PM
douardda closed T3201: Mirror: unsupported Unicode escape sequence, a subtask of T3197: Mirror: fix common issues of a replayer session, as Resolved.
Apr 6 2021, 4:42 PM · Mirror
douardda committed rDSTO39507b24d0f4: Make the replayer drop the Revision.metadata (authored by douardda).
Make the replayer drop the Revision.metadata
Apr 6 2021, 4:42 PM
douardda committed rDSTO84dcbe3d0e56: Merge test_replay's _check_replayed and check_replayed in a single function (authored by douardda).
Merge test_replay's _check_replayed and check_replayed in a single function
Apr 6 2021, 4:42 PM
douardda closed D5413: Make pg Storage.extid_add() write extid objects to the journal.
Apr 6 2021, 4:42 PM
douardda committed rDSTO36a7fd34f3ba: Fix pg Storage.extid_add(): write ExtID objects to the journal (authored by douardda).
Fix pg Storage.extid_add(): write ExtID objects to the journal
Apr 6 2021, 4:42 PM
douardda retitled D5413: Make pg Storage.extid_add() write extid objects to the journal from Make pg Strorage.extid_add() write extid objects to the journal to Make pg Storage.extid_add() write extid objects to the journal.
Apr 6 2021, 4:33 PM
douardda updated the diff for D5414: Make the replayer drop the Revision.metadata.

fix commit message

Apr 6 2021, 4:32 PM
douardda added a comment to D5413: Make pg Storage.extid_add() write extid objects to the journal.

Could you add a test for the storage? All other *_add have a journal test IIRC

let me check that

Apr 6 2021, 4:09 PM
douardda updated the diff for D5414: Make the replayer drop the Revision.metadata.

rebased

Apr 6 2021, 4:08 PM
douardda updated the diff for D5413: Make pg Storage.extid_add() write extid objects to the journal.

Add explicit checks for extid being written in the journal and split the revision in 2

Apr 6 2021, 4:07 PM
douardda added a comment to D5413: Make pg Storage.extid_add() write extid objects to the journal.

Could you add a test for the storage? All other *_add have a journal test IIRC

Apr 6 2021, 3:39 PM
douardda added a comment to D5413: Make pg Storage.extid_add() write extid objects to the journal.

lgtm

(I would have made that 2 commits with each its own perimeter, 1 for the actual perimeter, 1 to refactor the test, but whatever)

Apr 6 2021, 3:38 PM
douardda added inline comments to D5414: Make the replayer drop the Revision.metadata.
Apr 6 2021, 3:33 PM
douardda triaged T3209: Fix swh-scanner for python > 3.7 as High priority.
Apr 6 2021, 11:55 AM · Code scanner
douardda committed rDSNIP78408668a12f: Add the weekly-planning.sh script (authored by douardda).
Add the weekly-planning.sh script
Apr 6 2021, 10:19 AM

Apr 2 2021

douardda requested review of D5414: Make the replayer drop the Revision.metadata.
Apr 2 2021, 4:26 PM
douardda requested review of D5413: Make pg Storage.extid_add() write extid objects to the journal.
Apr 2 2021, 4:20 PM
douardda added a comment to T3197: Mirror: fix common issues of a replayer session.

Currently, the mirror test session is running with:

Apr 2 2021, 10:15 AM · Mirror
douardda added a comment to T3201: Mirror: unsupported Unicode escape sequence.

easy fix: modify the replayer to ignore this 'metadata' column while inserting revisions

Apr 2 2021, 10:05 AM · Mirror
douardda added a comment to T3201: Mirror: unsupported Unicode escape sequence.
09:45 <+vlorentz> douardda: yes and the only way around it (short of dropping data) is T3089
09:46 -swhbot:#swh-devel- T3089 (submitter: vlorentz, owner: vlorentz, status: Open): Remove the 'metadata' column of the 'revision' table <https://forge.softwareheritage.org/T3089>
09:46 <+vlorentz> or switching to cassandra
09:46 <+vlorentz> the good news is, they couldn't be inserted in the storage either, so you can safely drop them for now
Apr 2 2021, 9:59 AM · Mirror
douardda triaged T3201: Mirror: unsupported Unicode escape sequence as High priority.
Apr 2 2021, 9:54 AM · Mirror
douardda triaged T3200: Mirror: year is out of range as High priority.
Apr 2 2021, 9:51 AM · Mirror
douardda triaged T3199: Mirror: key value violates unique constraint "person_fullname_idx" as High priority.
Apr 2 2021, 9:48 AM · Mirror
douardda triaged T3198: Mirror: unexpected closed connection to the pg server as High priority.
Apr 2 2021, 9:47 AM · Mirror
douardda triaged T3197: Mirror: fix common issues of a replayer session as High priority.
Apr 2 2021, 9:41 AM · Mirror
douardda created P998 (An Untitled Masterwork).
Apr 2 2021, 9:34 AM

Apr 1 2021

douardda created P996 (An Untitled Masterwork).
Apr 1 2021, 12:48 PM
douardda created P995 (An Untitled Masterwork).
Apr 1 2021, 11:11 AM

Mar 31 2021

douardda added inline comments to D5387: Refactor the model and simplify a bit origin.py.
Mar 31 2021, 3:24 PM
douardda added a comment to D5387: Refactor the model and simplify a bit origin.py.

test coverage of the code touched by this diff isn't great

Mar 31 2021, 3:22 PM
douardda added inline comments to D5388: Also test the provenance db with ArchiveStorage.
Mar 31 2021, 3:20 PM

Mar 30 2021

douardda added a comment to D5389: Improve tests.

Why .hex() everywhere? Does swh-provenance use hex strings internally?

Mar 30 2021, 6:51 PM
douardda added reviewers for D5389: Improve tests: zack, grouss.
Mar 30 2021, 5:36 PM
douardda requested review of D5389: Improve tests.
Mar 30 2021, 5:34 PM
douardda requested review of D5388: Also test the provenance db with ArchiveStorage.
Mar 30 2021, 5:32 PM
douardda requested review of D5387: Refactor the model and simplify a bit origin.py.
Mar 30 2021, 5:31 PM
douardda added inline comments to D5337: Add a test to compare the result of revision_add() with known results.
Mar 30 2021, 11:02 AM

Mar 29 2021

douardda added reviewers for D5337: Add a test to compare the result of revision_add() with known results: zack, grouss.
Mar 29 2021, 12:20 PM
douardda accepted D5363: extid: remove unicity on (extid_type, extid) and (target_type, target).

looks indeed reasonable (both the 1. point and the code) thanks

Mar 29 2021, 11:33 AM

Mar 26 2021

douardda updated the diff for D5337: Add a test to compare the result of revision_add() with known results.

rebase

Mar 26 2021, 4:21 PM
douardda committed rDPROV4a5a99ea7d20: Add missing mypy.ini entry for iso8601 (authored by douardda).
Add missing mypy.ini entry for iso8601
Mar 26 2021, 4:20 PM
douardda updated the diff for D5337: Add a test to compare the result of revision_add() with known results.

rebase

Mar 26 2021, 2:58 PM
douardda committed rDPROV877a8a02b5ed: Add missing dependency on iso8601 (authored by douardda).
Add missing dependency on iso8601
Mar 26 2021, 2:57 PM
douardda updated the diff for D5337: Add a test to compare the result of revision_add() with known results.

rebased

Mar 26 2021, 2:49 PM
douardda committed rDPROV41bc4cef338c: Fix invalid extra dependency on swh-core (authored by douardda).
Fix invalid extra dependency on swh-core
Mar 26 2021, 2:48 PM
douardda updated the diff for D5337: Add a test to compare the result of revision_add() with known results.

apply vlorentz comments

Mar 26 2021, 12:09 PM
douardda added inline comments to D5337: Add a test to compare the result of revision_add() with known results.
Mar 26 2021, 11:03 AM
douardda created P991 (An Untitled Masterwork).
Mar 26 2021, 9:52 AM

Mar 25 2021

douardda updated the diff for D5337: Add a test to compare the result of revision_add() with known results.

refactor a bit the test

Mar 25 2021, 3:11 PM
douardda requested review of D5337: Add a test to compare the result of revision_add() with known results.
Mar 25 2021, 3:04 PM
douardda committed rDPROVeb524713c3d6: Refactor ProvenanceWithPathDB.insert_location() (authored by douardda).
Refactor ProvenanceWithPathDB.insert_location()
Mar 25 2021, 10:16 AM
douardda committed rDPROV15e390fb66c9: Add tests for revision_add() and content_find_first() (authored by douardda).
Add tests for revision_add() and content_find_first()
Mar 25 2021, 10:16 AM
douardda committed rDPROV2ec4a0ab9da6: Make ArchivePostgreSQL.directory_ls_internal close the db cursor (authored by douardda).
Make ArchivePostgreSQL.directory_ls_internal close the db cursor
Mar 25 2021, 10:16 AM
douardda committed rDPROVe66d94925459: Fix the example config in the README file (authored by douardda).
Fix the example config in the README file
Mar 25 2021, 10:16 AM
douardda closed D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().
Mar 25 2021, 10:16 AM
douardda committed rDPROVc5985c3085c0: Enforce tz-aware datetime value for RevisionEntry.date (authored by douardda).
Enforce tz-aware datetime value for RevisionEntry.date
Mar 25 2021, 10:16 AM

Mar 24 2021

douardda updated the diff for D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().

Add forgotten test data file CMDBTS.msgpack

Mar 24 2021, 3:38 PM
douardda created P986 (An Untitled Masterwork).
Mar 24 2021, 2:46 PM
douardda updated the diff for D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().

apply aeviso comments and reorder (and fix) revisions

Mar 24 2021, 11:24 AM

Mar 23 2021

douardda added inline comments to D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().
Mar 23 2021, 5:41 PM
douardda updated the diff for D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().

Rebase and use test dataset from the CMDBTS git repo

Mar 23 2021, 2:44 PM

Mar 18 2021

douardda created P978 (An Untitled Masterwork).
Mar 18 2021, 6:43 PM
douardda updated the diff for D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().

rebase and remove the dependency on pytz

Mar 18 2021, 6:40 PM
douardda added inline comments to D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().
Mar 18 2021, 6:35 PM
douardda added inline comments to D5234: Refactor ProvenanceWithPathDB.insert_location() and add a test for revision_add().
Mar 18 2021, 6:31 PM
douardda accepted D5259: deposit:config: Update deposit server config with swh_authority_url.
Mar 18 2021, 3:54 PM
douardda accepted D4616: common/archive: Add branch names filtering support in lookup_snapshot.

LGTM, but I would have loved to see a test dedicated to branches (and filters) with non-ascii chars (ach time I see a <str>.encode() I expect the worst... )

Mar 18 2021, 12:14 PM

Mar 17 2021

douardda requested changes to D5259: deposit:config: Update deposit server config with swh_authority_url.

Can you add a line in the commit message explaining what this swh_authority_url config entry is for? Because il looks weird to add use a real URL as value in a docker test environment...

Mar 17 2021, 11:37 AM

Mar 16 2021

douardda closed D5244: Update exporters.edged to swh.model 1.0.
Mar 16 2021, 4:18 PM
douardda committed rDDATASET8a164beb6a84: Update exporters.edged to swh.model 1.0 (authored by douardda).
Update exporters.edged to swh.model 1.0
Mar 16 2021, 4:18 PM

Mar 15 2021

douardda accepted D5239: Add deposit info to objects added to swh-storage from metadata-only deposits.
Mar 15 2021, 3:13 PM
douardda requested changes to D5239: Add deposit info to objects added to swh-storage from metadata-only deposits.

Otherwise LGTM. I'd really like a better commit message, and probably some documentation somewhere (in docs/ maybe?) explaining these 2 levels of metadata, especially documenting the second layer, since it's crafted by the deposit.

Mar 15 2021, 1:50 PM