Page MenuHomeSoftware Heritage

cpan: Add extid manifest to CpanPackageInfo
ClosedPublic

Authored by anlambert on Nov 2 2022, 4:36 PM.

Details

Summary

It enables to avoid downloading and processing a release archive for
a CPAN module if it has already been archived by Software Heritage.

Related to T2833

Diff Detail

Event Timeline

Build is green

Patch application report for D8801 (id=31717)

Rebasing onto e6847f3616...

Current branch diff-target is up to date.
Changes applied before test
commit 458d67e37e6d0b77c83f4c504bad19816741e377
Author: Antoine Lambert <anlambert@softwareheritage.org>
Date:   Wed Nov 2 16:29:04 2022 +0100

    cpan: Implement CpanPackageInfo.extid
    
    As each release artifact info for a CPAN module provided as loader argument
    is guaranteed to contain the sha256 checksum of the associated archive to
    download (as the CPAN lister discards artifacts with no checksum info),
    we can use that checksum as extid.
    
    It enables to avoid downloading and processing a release archive if it has
    already been archived by Software Heritage.
    
    Related to T2833

See https://jenkins.softwareheritage.org/job/DLDBASE/job/tests-on-diff/1012/ for more details.

Update: Prefer to use an extid manifest instead of simply using sha256 checksum as it seems more robust.

anlambert retitled this revision from cpan: Implement CpanPackageInfo.extid to cpan: Add extid manifest to CpanPackageInfo.Nov 2 2022, 5:20 PM
anlambert edited the summary of this revision. (Show Details)

Build is green

Patch application report for D8801 (id=31720)

Rebasing onto e6847f3616...

Current branch diff-target is up to date.
Changes applied before test
commit 981a8d58cc06d3005ecf29715280616f86d61a41
Author: Antoine Lambert <anlambert@softwareheritage.org>
Date:   Wed Nov 2 16:29:04 2022 +0100

    cpan: Add extid manifest to CpanPackageInfo
    
    It enables to avoid downloading and processing a release archive for
    a CPAN module if it has already been archived by Software Heritage.
    
    Related to T2833

See https://jenkins.softwareheritage.org/job/DLDBASE/job/tests-on-diff/1013/ for more details.

This revision is now accepted and ready to land.Nov 2 2022, 5:38 PM

Build is green

Patch application report for D8801 (id=31728)

Rebasing onto 8e34a6d779...

Current branch diff-target is up to date.
Changes applied before test
commit bf2cb039d5d7ad2877d01a32f2c354b375e9aead
Author: Antoine Lambert <anlambert@softwareheritage.org>
Date:   Wed Nov 2 16:29:04 2022 +0100

    cpan: Add extid manifest to CpanPackageInfo
    
    It enables to avoid downloading and processing a release archive for
    a CPAN module if it has already been archived by Software Heritage.
    
    Related to T2833

See https://jenkins.softwareheritage.org/job/DLDBASE/job/tests-on-diff/1016/ for more details.