github/utils: Deal with exotic urls to canonicalize
The sample of those exotic urls got extracted out the staging scheduler [1].
And the actual run of that code makes the github api (without mocks) returns the correct
canonical urls [2]
Related to T3874
[1] P1371
[2]
$ ipython ... In [1]: from swh.core.github.utils import get_canonical_github_origin_url In [2]: get_canonical_github_origin_url('git@github.com/huaweicloud/huaweicloud-sdk-java-v3.git') No tokens set in configuration, using anonymous mode Out[2]: 'https://github.com/huaweicloud/huaweicloud-sdk-java-v3' In [3]: get_canonical_github_origin_url('git//github.com/powertac/powertac-server.git') No tokens set in configuration, using anonymous mode Out[3]: 'https://github.com/powertac/powertac-server' In [4]: get_canonical_github_origin_url('https://${env.GITHUB_USER}:${env.GITHUB_TOKEN}@github.com/molgenis/vibe.git') No tokens set in configuration, using anonymous mode Out[4]: 'https://github.com/molgenis/vibe' In [5]: get_canonical_github_origin_url('ssh://git@github.com/softwaremagico/ThinkMachine.git') No tokens set in configuration, using anonymous mode Out[5]: 'https://github.com/softwaremagico/ThinkMachine' In [6]: get_canonical_github_origin_url('ssh://github.com:alibaba/SmartEngine.git') No tokens set in configuration, using anonymous mode Out[6]: 'https://github.com/alibaba/SmartEngine' In [7]: get_canonical_github_origin_url('//github.com:networknt/light-tram-kafka.git') No tokens set in configuration, using anonymous mode Out[7]: 'https://github.com/networknt/light-tram-kafka' In [8]: get_canonical_github_origin_url('[fetch=]git@github.com:turnonline/ecosystem-admin-widgets.git') No tokens set in configuration, using anonymous mode Out[8]: 'https://github.com/turnonline/ecosystem-admin-widgets' In [9]: get_canonical_github_origin_url('git@github.com:ttulka/spring-boot-configuration-properties-store.git') No tokens set in configuration, using anonymous mode Out[9]: 'https://github.com/ttulka/spring-boot-configuration-properties-store'