Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 4 2022

ardumont added a comment to P1469 content loader checksum mismatched.

Either the manifest referenced the wrong integrity for that urls (or someone updated in
place the urls...):

Oct 4 2022, 10:34 AM
ardumont created P1469 content loader checksum mismatched.
Oct 4 2022, 10:25 AM
ardumont added a comment to D8601: {Content|Directory}Loader: Register tasks.

fwiw, this is working in docker ^.

Oct 4 2022, 10:10 AM

Oct 3 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 6:41 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Run through docker for directory:

Oct 3 2022, 6:41 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 6:32 PM · Data Model, Nixguix loader
ardumont updated the diff for D8341: Add nixguix lister.

Fix documentation references and tests according to latest change in visit_type

Oct 3 2022, 6:27 PM
ardumont updated the diff for D8341: Add nixguix lister.

Use correct visit_type {content, directory} in the listed origins outputted from the
lister.

Oct 3 2022, 6:22 PM
ardumont updated the diff for D8341: Add nixguix lister.

zut, forgot to update the mock setup ¯\_(ツ)_/¯

Oct 3 2022, 6:16 PM
ardumont updated the diff for D8341: Add nixguix lister.

Use proper range of commits to update the diff

Oct 3 2022, 6:06 PM
ardumont accepted D8599: k8s-archive-production: Add an internal ingress to expose reaper webui.
Oct 3 2022, 6:04 PM
ardumont updated the diff for D8341: Add nixguix lister.

Fix missing integrity field (this makes for a redundant check in tests, adding
missing integrity to actual test correctly this entry)

Oct 3 2022, 6:02 PM
ardumont updated the diff for D8601: {Content|Directory}Loader: Register tasks.

Fix constructor mess

Oct 3 2022, 5:57 PM
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 5:46 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 5:46 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

docker run on the lister:

17:36:23 swh-scheduler@localhost:5433=# select now(), visit_type, lister_id, count(*) from listed_origins where lister_id = ( select id from listers where name='nixguix' and instance_name='nix-community.github.io') group by visit_type, lister_id;
+-------------------------------+------------+--------------------------------------+-------+
|              now              | visit_type |              lister_id               | count |
+-------------------------------+------------+--------------------------------------+-------+
| 2022-10-03 15:44:20.179895+00 | git        | 3f5c040a-6247-4ef3-a812-36f4b9ceafeb |     1 |
| 2022-10-03 15:44:20.179895+00 | file       | 3f5c040a-6247-4ef3-a812-36f4b9ceafeb |    87 |
| 2022-10-03 15:44:20.179895+00 | tar        | 3f5c040a-6247-4ef3-a812-36f4b9ceafeb | 31130 |
+-------------------------------+------------+--------------------------------------+-------+
(3 rows)
Oct 3 2022, 5:45 PM · Data Model, Nixguix loader
ardumont added inline comments to D8341: Add nixguix lister.
Oct 3 2022, 5:40 PM
ardumont updated the summary of D8341: Add nixguix lister.
Oct 3 2022, 5:39 PM
ardumont updated the diff for D8341: Add nixguix lister.
  • Adapt last review comment (reuse swh.core's mimetype dictionary)
  • Format warning log
  • Decrease log verbosity for one debug instruction (it was in info for test purposes)
  • Skip artifacts with missing integrity field too (nixpkgs dataset has some)
Oct 3 2022, 5:35 PM
ardumont closed D8603: Make mimetype to archive format dictionary public.
Oct 3 2022, 5:03 PM
ardumont committed rDCORE9a8292c12c5f: Make mimetype to archive format dictionary public (authored by ardumont).
Make mimetype to archive format dictionary public
Oct 3 2022, 5:03 PM
ardumont requested review of D8603: Make mimetype to archive format dictionary public.
Oct 3 2022, 5:01 PM
ardumont updated the summary of D8341: Add nixguix lister.
Oct 3 2022, 5:01 PM
ardumont added inline comments to D8341: Add nixguix lister.
Oct 3 2022, 5:00 PM
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8603: Make mimetype to archive format dictionary public.
Oct 3 2022, 4:59 PM · Data Model, Nixguix loader
ardumont added a comment to D8601: {Content|Directory}Loader: Register tasks.

ah! yeah, it's the first time we are scheduling ListedOrigins for non package loaders...
Thanks for raising this up!

Oct 3 2022, 4:52 PM
ardumont updated the diff for D8341: Add nixguix lister.

Adapt according to review (except for reusing the mimetype from swh.core.tarballs, i'll
do that in another iteration if you don't mind)

Oct 3 2022, 4:51 PM
ardumont added inline comments to D8341: Add nixguix lister.
Oct 3 2022, 4:49 PM
ardumont added inline comments to D8341: Add nixguix lister.
Oct 3 2022, 4:37 PM
ardumont accepted D8602: pre-commit: Fix tests data exclusion from codespell check.
Oct 3 2022, 4:10 PM
ardumont updated the diff for D8341: Add nixguix lister.

Add missing coverage

Oct 3 2022, 4:07 PM
ardumont closed D8587: {Content|Directory}Loader: Adapt support for checksums.
Oct 3 2022, 2:51 PM
ardumont committed rDLDBASE39c33a66c27c: {Content|Directory}Loader: Adapt support for checksums (authored by ardumont).
{Content|Directory}Loader: Adapt support for checksums
Oct 3 2022, 2:51 PM
ardumont requested review of D8601: {Content|Directory}Loader: Register tasks.
Oct 3 2022, 2:50 PM
ardumont accepted D8596: package/utils: Fix download function documentation.

oh yeah, correct, thx.
It's possible it used to so until it got extracted in the package loader instead.

Oct 3 2022, 2:49 PM
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8601: {Content|Directory}Loader: Register tasks.
Oct 3 2022, 2:46 PM · Data Model, Nixguix loader
ardumont updated the diff for D8587: {Content|Directory}Loader: Adapt support for checksums.

Adapt according to latest review

Oct 3 2022, 2:46 PM
ardumont closed D8584: Add Directory Loader to allow tarball ingestion as Directory.
Oct 3 2022, 2:37 PM
ardumont committed rDLDBASEdbf7f3dca0c8: Add Directory Loader to allow tarball ingestion as Directory (authored by ardumont).
Add Directory Loader to allow tarball ingestion as Directory
Oct 3 2022, 2:37 PM
ardumont added inline comments to D8587: {Content|Directory}Loader: Adapt support for checksums.
Oct 3 2022, 2:34 PM
ardumont updated the diff for D8584: Add Directory Loader to allow tarball ingestion as Directory.

Rebase

Oct 3 2022, 2:33 PM
ardumont updated the summary of D8341: Add nixguix lister.
Oct 3 2022, 2:28 PM
ardumont updated the diff for D8341: Add nixguix lister.

Rebase and adapt according to:

  • reviews
  • docker runs [1]
Oct 3 2022, 2:27 PM
ardumont created P1467 swh.lister.nixguix run with tarball detection....
Oct 3 2022, 2:26 PM
ardumont added a comment to D8598: Fix compatibility issue with latest dependency version.

Tag and released v1.2.3, jenkins up to you now, tia!

Oct 3 2022, 2:02 PM
ardumont closed D8598: Fix compatibility issue with latest dependency version.
Oct 3 2022, 2:01 PM
ardumont committed rDSCH17c6d488168b: Fix compatibility issue with latest dependency version (authored by ardumont).
Fix compatibility issue with latest dependency version
Oct 3 2022, 2:01 PM
ardumont retitled D8598: Fix compatibility issue with latest dependency version from Restrict importlib_metadata to a non-breaking version to Fix compatibility issue with latest dependency version.
Oct 3 2022, 2:00 PM
ardumont updated the diff for D8598: Fix compatibility issue with latest dependency version.

Adapt according to reviews

Oct 3 2022, 1:59 PM
ardumont added a comment to D8598: Fix compatibility issue with latest dependency version.

ack on what you said both ;)

Oct 3 2022, 1:57 PM
ardumont added inline comments to D8587: {Content|Directory}Loader: Adapt support for checksums.
Oct 3 2022, 1:56 PM
ardumont requested review of D8598: Fix compatibility issue with latest dependency version.
Oct 3 2022, 1:54 PM
ardumont planned changes to D8341: Add nixguix lister.

Docker run made apparent that determining the urls' nature is not so simple...
I'll amend accordingly and take into account review suggestions.

Oct 3 2022, 10:21 AM
ardumont added inline comments to D8341: Add nixguix lister.
Oct 3 2022, 10:20 AM
ardumont updated the summary of D8341: Add nixguix lister.
Oct 3 2022, 10:07 AM

Oct 2 2022

ardumont updated the summary of D8341: Add nixguix lister.
Oct 2 2022, 10:04 PM
ardumont updated the diff for D8341: Add nixguix lister.

Fix build

Oct 2 2022, 8:06 PM
ardumont updated the diff for D8341: Add nixguix lister.

Rebase (this includes a work around to fix master build)

Oct 2 2022, 7:45 PM
ardumont added a comment to D8587: {Content|Directory}Loader: Adapt support for checksums.

Maybe a new upstream version bump on importlib-metadata?

[1]

...
importlib-metadata==5.0.0
Oct 2 2022, 7:37 PM
ardumont updated the summary of D8587: {Content|Directory}Loader: Adapt support for checksums.
Oct 2 2022, 11:51 AM
ardumont added a comment to D8587: {Content|Directory}Loader: Adapt support for checksums.

Build has FAILED

Oct 2 2022, 11:40 AM
ardumont added inline comments to D8584: Add Directory Loader to allow tarball ingestion as Directory.
Oct 2 2022, 11:38 AM
ardumont retitled D8587: {Content|Directory}Loader: Adapt support for checksums from ContentLoader: Improve integrity check support to {Content|Directory}Loader: Adapt support for checksums.
Oct 2 2022, 11:36 AM
ardumont updated the diff for D8587: {Content|Directory}Loader: Adapt support for checksums.

Adapt both Content and Directory loaders to deal with checksums dict instead of
integrity.

Oct 2 2022, 11:36 AM

Oct 1 2022

ardumont added inline comments to D8584: Add Directory Loader to allow tarball ingestion as Directory.
Oct 1 2022, 4:53 PM
ardumont updated the diff for D8341: Add nixguix lister.
  • nixguix: Transform integrity field into a checksums dict
  • nixguix: Register task
  • nixguix: Add a small lister introduction
Oct 1 2022, 4:43 PM

Sep 30 2022

ardumont added inline comments to D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 5:50 PM
ardumont added inline comments to D8341: Add nixguix lister.
Sep 30 2022, 5:44 PM
ardumont added inline comments to D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 5:34 PM
ardumont added inline comments to D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 5:26 PM
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 5:16 PM · Data Model, Nixguix loader
ardumont added inline comments to D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 5:14 PM
ardumont updated the diff for D8587: {Content|Directory}Loader: Adapt support for checksums.
  • Build content out of MultiHash
  • Add support for sha512 hash
  • Fix input test checksum
Sep 30 2022, 5:05 PM
ardumont retitled D8587: {Content|Directory}Loader: Adapt support for checksums from ContentLoader: Fix integrity check to ContentLoader: Improve integrity check support.
Sep 30 2022, 5:04 PM
ardumont added inline comments to D8587: {Content|Directory}Loader: Adapt support for checksums.
Sep 30 2022, 5:01 PM
ardumont added inline comments to D8587: {Content|Directory}Loader: Adapt support for checksums.
Sep 30 2022, 4:00 PM
ardumont retitled D8341: Add nixguix lister from nixguix: Boostrap lister to Add nixguix lister.
Sep 30 2022, 3:32 PM
ardumont requested review of D8587: {Content|Directory}Loader: Adapt support for checksums.
Sep 30 2022, 3:20 PM
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8587: {Content|Directory}Loader: Adapt support for checksums.
Sep 30 2022, 3:16 PM · Data Model, Nixguix loader
ardumont accepted D8586: hashutil: Add sha512 support in MultiHash class.

That's also something needed for the ongoing directory/content loader [1]

Sep 30 2022, 3:15 PM
ardumont updated the diff for D8584: Add Directory Loader to allow tarball ingestion as Directory.

Refactoring step

Sep 30 2022, 2:47 PM
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 2:33 PM · Data Model, Nixguix loader
ardumont closed D8581: Add Content Loader to ingest raw content file.
Sep 30 2022, 2:32 PM
ardumont committed rDLDBASEf774aba59e65: Add Content Loader to ingest raw content file (authored by ardumont).
Add Content Loader to ingest raw content file
Sep 30 2022, 2:32 PM
ardumont requested review of D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 2:31 PM
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 2:29 PM · Data Model, Nixguix loader
ardumont updated the diff for D8581: Add Content Loader to ingest raw content file.

Rebase

Sep 30 2022, 2:27 PM
ardumont updated the summary of D8341: Add nixguix lister.
Sep 30 2022, 11:57 AM
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 11:56 AM · Data Model, Nixguix loader
ardumont added inline comments to D8581: Add Content Loader to ingest raw content file.
Sep 30 2022, 11:56 AM
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 11:56 AM · Data Model, Nixguix loader
ardumont abandoned D8582: Allow more checksum computations in Content model.

Thanks! I'll close this.

Sep 30 2022, 10:52 AM
ardumont added a comment to D8582: Allow more checksum computations in Content model.

Model objects should map 1:1 with what is stored in the archive.

Sep 30 2022, 10:45 AM
ardumont added a comment to D8582: Allow more checksum computations in Content model.

Why are the changes to the model object needed, instead of just hashing the file directly?

Sep 30 2022, 10:45 AM
ardumont added inline comments to D8582: Allow more checksum computations in Content model.
Sep 30 2022, 10:43 AM
ardumont updated the diff for D8581: Add Content Loader to ingest raw content file.

Refactoring steps

Sep 30 2022, 10:22 AM
ardumont updated the diff for D8581: Add Content Loader to ingest raw content file.

Compute expected checksum to check integrity outside the loop

Sep 30 2022, 10:13 AM
ardumont updated the task description for T4385: Federate prometheus instances through thanos.
Sep 30 2022, 9:52 AM · meta-task, System administration, Roadmap 2022
ardumont updated the task description for T4385: Federate prometheus instances through thanos.
Sep 30 2022, 9:52 AM · meta-task, System administration, Roadmap 2022
ardumont accepted D8571: from_disks: fix some of the pattern checking logic.

I went for a minimal changes for that untested code as I am mostly planning a full
overhaul of that area. So I did not spent to effort to add new test (the code isn't
currently directly tested either as far as I understand).

Sep 30 2022, 9:49 AM