Prior to this, some urls were detected as file because their version name were wrongly
detected as extension, hence not matching tarball extensions.
Related to T3781
(Note: Connectivity issues on my side)
Differential D8773
nixguix: Deal with edge case url with version instead of extension ardumont on Oct 25 2022, 5:29 PM. Authored by
Details
Prior to this, some urls were detected as file because their version name were wrongly Related to T3781 (Note: Connectivity issues on my side)
Diff Detail
Event TimelineComment Actions Build is green Patch application report for D8773 (id=31631)Rebasing onto 8355fee25f... First, rewinding head to replay your work on top of it... Applying: nixguix: Deal with edge case url with version instead of extension Changes applied before testcommit e165d09ecd16014e00a4d05d3d2a0e99ac725a9b Author: Antoine R. Dumont (@ardumont) <ardumont@softwareheritage.org> Date: Tue Oct 25 17:04:29 2022 +0200 nixguix: Deal with edge case url with version instead of extension Prior to this, some urls were detected as file because their version name were wrongly detected as extension, hence not matching tarball extensions. Related to T3781 See https://jenkins.softwareheritage.org/job/DLS/job/tests-on-diff/830/ for more details. Comment Actions I think detecting versions is a lost cause. Here is what I had to do for PyPI: https://forge.softwareheritage.org/source/swh-storage/browse/master/swh/storage/migrate_extrinsic_metadata.py$234-281 with lots of annoying examples here: https://forge.softwareheritage.org/source/swh-storage/browse/master/swh/storage/tests/migrate_extrinsic_metadata/test_pypi.py$62-146 Comment Actions plus, Nix and Guix already have the package name and version in their metadata, can't we ask them to provide this data to us? Comment Actions Build is green Patch application report for D8773 (id=31635)Rebasing onto 8355fee25f... First, rewinding head to replay your work on top of it... Applying: nixguix: Deal with edge case url with version instead of extension Changes applied before testcommit 4cc46f34fb4a929afc67d560fd10765c94ec1bbd Author: Antoine R. Dumont (@ardumont) <ardumont@softwareheritage.org> Date: Tue Oct 25 17:04:29 2022 +0200 nixguix: Deal with edge case url with version instead of extension Prior to this, some urls were detected as file because their version name were wrongly detected as extension, hence not matching tarball extensions. Related to T3781 See https://jenkins.softwareheritage.org/job/DLS/job/tests-on-diff/832/ for more details. Comment Actions
With this and the next diff, the listing will be complete (in guix at least). I've some
Yes, it's complicated and annoying! I'll look in more details tomorrow to your links.
I'm pretty sure zimoun would not mind adding those indeed. But that will be enough for For the nixpkgs part, I'm not sure they will react, as far as the nixpkgs issues I've [1] T4608 (upstream issue linked from it) [2] T4609 (upstream issue linked from it) Comment Actions I am not a big fan either of adding more tarball detection heuristics but after quickly hacking on the code, it seems this is the only way to handle these edge case URLS (plus there is some cases when even analyzing HTTP headers cannot help to detect if the URL targets a tarball P1512). So let's land this and move on in deploying and testing that lister on staging. Comment Actions Rebase (commit id mismatch locally and in the diff, don't recall what changed) Comment Actions Build is green Patch application report for D8773 (id=31638)Rebasing onto 8355fee25f... Current branch diff-target is up to date. Changes applied before testcommit 026fea21da49b30fda9c5adbb24c6d7a8c24c8df Author: Antoine R. Dumont (@ardumont) <ardumont@softwareheritage.org> Date: Tue Oct 25 17:04:29 2022 +0200 nixguix: Deal with edge case url with version instead of extension Prior to this, some urls were detected as file because their version name were wrongly detected as extension, hence not matching tarball extensions. Related to T3781 See https://jenkins.softwareheritage.org/job/DLS/job/tests-on-diff/835/ for more details. |