As part of the maven lister, it's been put into attention that some urls can be listed
without being the main canonical urls. This can result in origins duplication for no
good reason.
So let's reuse some existing url canonicalization code (for gh origins) in listers
and reuse when possible. That code should exist in swh-web and be refactored out into
swh.core then be reused both in swh-web and listers (starting with the maven one,
possibly nixguix, and packagist listers can be done later as well).
Plan:
- [x] D7836: Compute canonical gh urls in an exposed library function in swh.core
- [x] D7840: Refactor GitHubSession request management out of swh.lister in swh.core
- [x] Tag, and eventually unstuck debian build
- [ ] D7840: Use GitHubSession to make the canonical computation deal with rate limit
- [ ] Refactor swh.lister to reuse the code moved in swh.core
- [ ] Refactor swh.web to canonicalize the gh origin server side
- [ ] (Goal) Adapt maven lister to list canonical gh urls if any
Optional plan:
- Adapt nixguix lister
- Adapt packagist lister
- Adapt remaining listers if any
Note: ``gh`` refers to ``GitHub``