Page MenuHomeSoftware Heritage

Deduplicate origins in the GitHub lister
ClosedPublic

Authored by olasd on Dec 1 2021, 4:39 PM.

Details

Summary

In some circumstances, GitHub will return two separate repos with the
same html_url in the same page. This makes the lister fail with a
cardinality error.

Sentry report: https://sentry.softwareheritage.org/share/issue/642ff21b110740249c4b4bd1c0611be4/

Test Plan

a patched GitHub lister can continue on

Diff Detail

Repository
rDLS Listers
Lint
Automatic diff as part of commit; lint not applicable.
Unit
Automatic diff as part of commit; unit tests not applicable.

Event Timeline

Build is green

Patch application report for D6716 (id=24401)

Rebasing onto 8991c625ea...

Current branch diff-target is up to date.
Changes applied before test
commit 5f567b3c3425f7840f0639a766dabbdcb7688fb3
Author: Nicolas Dandrimont <nicolas@dandrimont.eu>
Date:   Wed Dec 1 16:00:14 2021 +0100

    Deduplicate origins in the GitHub lister
    
    In some circumstances, GitHub will return two separate repos with the
    same html_url in the same page. This makes the lister fail with a
    cardinality error.

See https://jenkins.softwareheritage.org/job/DLS/job/tests-on-diff/424/ for more details.

olasd requested review of this revision.Dec 1 2021, 4:42 PM
This revision is now accepted and ready to land.Dec 1 2021, 4:46 PM
This revision was automatically updated to reflect the committed changes.