Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 6 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 6 2022, 2:31 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 6 2022, 2:08 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8636: ContentLoader: Allow nar computation checks.
Oct 6 2022, 1:03 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 6 2022, 10:53 AM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Finally, more concentrated frequency dict:

Oct 6 2022, 9:17 AM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

It must be more interesting to read it with a frequency [1]:

Oct 6 2022, 9:10 AM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Out of the paste [1] (csv extract from swh-scheduler dev db after 3 lister runs on
docker), here is the state of detected files [2] so far (computed with [3]):

Oct 6 2022, 9:04 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 6 2022, 8:50 AM · Data Model, Nixguix loader

Oct 5 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 4:57 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8632: nixguix: Refactor by renaming success or failure the different datasets.
Oct 5 2022, 4:51 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8631: nixguix: Deal with manifest entries without an integrity field.
Oct 5 2022, 4:01 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 3:06 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 3:00 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 2:53 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8630: {Cnt|Dir}Loader: Fix standard/nar hash mismatch behavior to fail loading.
Oct 5 2022, 2:48 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 12:04 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8627: base-buster/Dockerfile: Install nix binaries in buster image.
Oct 5 2022, 11:57 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8626: nixguix: Improve is_tarball detection pattern.
Oct 5 2022, 11:53 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8625: docker: Install nix binaries in swh/stack image.
Oct 5 2022, 11:30 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 11:25 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8624: nixguix: Improve further tarball detection.
Oct 5 2022, 11:12 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8623: core.tarball: Add missing mimetype to the list.
Oct 5 2022, 11:09 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 10:34 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8621: wip: Add nixguix lister and loader.
Oct 5 2022, 10:34 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 10:11 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 5 2022, 10:06 AM · Data Model, Nixguix loader

Oct 4 2022

ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8620: nixguix: Improve git origins detection.
Oct 4 2022, 11:32 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8619: nixguix: Improve tarball detection.
Oct 4 2022, 8:44 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8618: DirectoryLoader: Check nar hashes when provided.
Oct 4 2022, 7:33 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 4 2022, 4:53 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8614: nixguix: Add support for listing origins with "recursive" integrity.
Oct 4 2022, 4:50 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Another data point about ^. It's not important for the guix manifest [1]. We can keep a
compatible behavior for it and slightly improve the listing behavior for nixpkgs as it's
important for those [2].

Oct 4 2022, 4:33 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 4 2022, 4:17 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

With the gazillion of new diffs on top of the origin lister code, we can now also list
the nixpkgs-unstable-full.json manifests [1]

Oct 4 2022, 4:17 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 4 2022, 4:05 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8612: nixguix: Add support for pseudo url with missing schema.
Oct 4 2022, 4:01 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8611: nixguix: Deal with connection error with server.
Oct 4 2022, 2:57 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8610: nixguix: Refactor by cleaning up unneeded code.
Oct 4 2022, 2:40 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 4 2022, 2:37 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 4 2022, 2:19 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8607: nixguix: Deal with impossible communication with server.
Oct 4 2022, 2:08 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8606: nixguix: Deal with mistyped origins.
Oct 4 2022, 1:02 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Another one bites the dust [1]

Oct 4 2022, 11:51 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 4 2022, 11:02 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8605: nixguix: Randomize order of listed origins.
Oct 4 2022, 10:59 AM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

For the content loader i have mostly checksums mismatches [1].
It seems the integrity from the manifest is either wrong (or some in-place update occurred in the respective servers [2])

Oct 4 2022, 10:36 AM · Data Model, Nixguix loader

Oct 3 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 6:41 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Run through docker for directory:

Oct 3 2022, 6:41 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 6:32 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 5:46 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Oct 3 2022, 5:46 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

docker run on the lister:

17:36:23 swh-scheduler@localhost:5433=# select now(), visit_type, lister_id, count(*) from listed_origins where lister_id = ( select id from listers where name='nixguix' and instance_name='nix-community.github.io') group by visit_type, lister_id;
+-------------------------------+------------+--------------------------------------+-------+
|              now              | visit_type |              lister_id               | count |
+-------------------------------+------------+--------------------------------------+-------+
| 2022-10-03 15:44:20.179895+00 | git        | 3f5c040a-6247-4ef3-a812-36f4b9ceafeb |     1 |
| 2022-10-03 15:44:20.179895+00 | file       | 3f5c040a-6247-4ef3-a812-36f4b9ceafeb |    87 |
| 2022-10-03 15:44:20.179895+00 | tar        | 3f5c040a-6247-4ef3-a812-36f4b9ceafeb | 31130 |
+-------------------------------+------------+--------------------------------------+-------+
(3 rows)
Oct 3 2022, 5:45 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8603: Make mimetype to archive format dictionary public.
Oct 3 2022, 4:59 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8601: {Content|Directory}Loader: Register tasks.
Oct 3 2022, 2:46 PM · Data Model, Nixguix loader

Sep 30 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 5:16 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8587: {Content|Directory}Loader: Adapt support for checksums.
Sep 30 2022, 3:16 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 2:33 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 2:29 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 30 2022, 11:56 AM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8584: Add Directory Loader to allow tarball ingestion as Directory.
Sep 30 2022, 11:56 AM · Data Model, Nixguix loader

Sep 29 2022

ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8582: Allow more checksum computations in Content model.
Sep 29 2022, 5:57 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 29 2022, 4:54 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 29 2022, 4:53 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 29 2022, 4:53 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 29 2022, 4:28 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8581: Add Content Loader to ingest raw content file.
Sep 29 2022, 4:21 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 29 2022, 4:19 PM · Data Model, Nixguix loader

Sep 23 2022

ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Hum, for the 7 false, I have to check. For the 88 packages with no-origin, it is more
annoying. Well, some are metapackages as gcc-toolchain, so they can be skipped. Is it
ok for you to let this 'no-origin' type? For some others, I have to check if they are
covered elsewhere.

Sep 23 2022, 3:48 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

For ^, something like this would do [1]

Sep 23 2022, 3:38 PM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Thanks for all that ^! And great pointers!

Sep 23 2022, 3:09 PM · Data Model, Nixguix loader
zimoun added a comment to T3781: Replace the Nixguix loader with a lister.
  • artifacts url which are mostly tarballs [1] and sometimes files [2]
  • dvcs repositories delegated to dedicated loader to ingestion: svn [3], hg [4], git [5] (out of guix manifest)
  • Other stuff can be ignored as we don't have anything relevant to ingest [6]
Sep 23 2022, 12:46 PM · Data Model, Nixguix loader
ardumont added a subtask for T3781: Replace the Nixguix loader with a lister: T3294: nixguix: Add support for pseudo-URLs with a missing schema.
Sep 23 2022, 11:23 AM · Data Model, Nixguix loader

Sep 7 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 7 2022, 12:02 PM · Data Model, Nixguix loader

Sep 6 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 6 2022, 7:07 PM · Data Model, Nixguix loader
ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8406: [WIP] archive.loader: Allow archive loader to deal with nixguix archives.
Sep 6 2022, 6:21 PM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 6 2022, 10:39 AM · Data Model, Nixguix loader
ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Sep 6 2022, 10:32 AM · Data Model, Nixguix loader
ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Some more information regarding extensions supported in nixpkgs and guix manifests:

In [33]: sources = "https://nix-community.github.io/nixpkgs-swh/sources-unstable.json"
Sep 6 2022, 9:42 AM · Data Model, Nixguix loader

Aug 30 2022

ardumont added a revision to T3781: Replace the Nixguix loader with a lister: D8341: Add nixguix lister.
Aug 30 2022, 11:19 AM · Data Model, Nixguix loader

Jul 1 2022

douardda triaged T4370: Refactor the origin visit data model (aka get rid of the OriginVisit model object) as High priority.
Jul 1 2022, 4:35 PM · Storage manager, Data Model

Jun 30 2022

ardumont updated the task description for T3781: Replace the Nixguix loader with a lister.
Jun 30 2022, 3:25 PM · Data Model, Nixguix loader

Jun 29 2022

ardumont renamed T3781: Replace the Nixguix loader with a lister from Replace the Nixguix loader with a lister? to Replace the Nixguix loader with a lister.
Jun 29 2022, 11:05 AM · Data Model, Nixguix loader
ardumont added a parent task for T3781: Replace the Nixguix loader with a lister: T2879: Finalize nixguix loader implementation.
Jun 29 2022, 10:28 AM · Data Model, Nixguix loader

Jun 28 2022

ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

So taking a bit more look into this possible new lister, we'd end up with the following
possible outputs:

  • artifacts url which are mostly tarballs [1] and sometimes files [2]
  • dvcs repositories delegated to dedicated loader to ingestion: svn [3], hg [4], git [5] (out of guix manifest)
  • Other stuff can be ignored as we don't have anything relevant to ingest [6]
Jun 28 2022, 6:03 PM · Data Model, Nixguix loader

May 25 2022

ardumont added a comment to T3781: Replace the Nixguix loader with a lister.

Another argument: currently, there is always at least some failures when loading real
Nix and Guix repositories, so visits always have status partial; which prevents them
from being listed in
https://archive.softwareheritage.org/browse/search/?q=&with_visit=true&with_content=true&visit_type=nixguix
(but we get results when un-checking " only show origins visited at least once")

May 25 2022, 10:14 AM · Data Model, Nixguix loader
vlorentz added a comment to T3781: Replace the Nixguix loader with a lister.

Another argument: currently, there is always at least some failures when loading real Nix and Guix repositories, so visits always have status partial; which prevents them from being listed in https://archive.softwareheritage.org/browse/search/?q=&with_visit=true&with_content=true&visit_type=nixguix (but we get results when un-checking " only show origins visited at least once")

May 25 2022, 10:02 AM · Data Model, Nixguix loader

Mar 25 2022

bchauvet lowered the priority of T3134: SWHID v2 from Normal to Low.
Mar 25 2022, 5:26 PM · Roadmap 2022, Roadmap 2020, Data Model, Web app, meta-task, Roadmap 2021

Mar 23 2022

bchauvet added a project to T3134: SWHID v2: Roadmap 2022.
Mar 23 2022, 4:42 PM · Roadmap 2022, Roadmap 2020, Data Model, Web app, meta-task, Roadmap 2021

Mar 16 2022

vlorentz closed T3752: Store/represent time offsets as strings, a subtask of T3594: Faithfully store weird git objects, as Resolved.
Mar 16 2022, 10:36 AM · meta-task, Data Model, Storage manager
vlorentz closed T3752: Store/represent time offsets as strings as Resolved.

swh-model 5.0.0 released, which finalizes these changes

Mar 16 2022, 10:36 AM · Data Model, Storage manager
vlorentz added revisions to T3752: Store/represent time offsets as strings: D7011: Revert "Restore 'offset' and 'negative_utc' arguments and make them optional", D7012: Remove deprecated property 'TimestampWithTimezone.offset'.
Mar 16 2022, 10:36 AM · Data Model, Storage manager

Mar 15 2022

vlorentz updated the task description for T3752: Store/represent time offsets as strings.
Mar 15 2022, 10:33 AM · Data Model, Storage manager

Mar 7 2022

vlorentz updated the task description for T3924: Write mailmaps after cooking git-bare archives with display names?.
Mar 7 2022, 12:54 PM · Data Model, Vault

Mar 3 2022

vlorentz closed T3890: schema of timezones in the journal as Resolved.
Mar 3 2022, 2:46 PM · Journal, Data Model

Feb 14 2022

vlorentz added a parent task for T3924: Write mailmaps after cooking git-bare archives with display names?: T3504: Make the git-bare cooker publicly available.
Feb 14 2022, 2:34 PM · Data Model, Vault

Feb 10 2022

vlorentz updated the task description for T3924: Write mailmaps after cooking git-bare archives with display names?.
Feb 10 2022, 11:05 AM · Data Model, Vault
vlorentz triaged T3924: Write mailmaps after cooking git-bare archives with display names? as Normal priority.
Feb 10 2022, 10:57 AM · Data Model, Vault

Feb 8 2022

vlorentz closed T3594: Faithfully store weird git objects as Resolved.
Feb 8 2022, 11:53 AM · meta-task, Data Model, Storage manager
vlorentz closed T3753: Store original git manifests as Resolved.
Feb 8 2022, 11:53 AM · Data Model, Storage manager
vlorentz closed T3753: Store original git manifests, a subtask of T3594: Faithfully store weird git objects, as Resolved.
Feb 8 2022, 11:53 AM · meta-task, Data Model, Storage manager