The first cgit implementation is done. And the largest cgit instances have been listed (T1835).
But, some cgit instances [1] expose inconsistently the repositories they host (in term of git clone uri).
Reflect on the proper way to adapt the cgit lister for it.
It has been entertained so far to (excerpt from irc discussion):
- 1 extra request per repo page identified in pages. Parse those pages' result to extract the git clone uri (first implementation from @nahimilega in D1610, asked by us to be modified to avoid extra load on the server we list)
- Try and determine git clone uri pattern per group (some information is sometimes at disposal in the main page we parse). That seems to match our case specifically for those instances [1]
- Whatever the choice, not being exclusive on the implemented solution. Allow to define a listing policy per cgit instance. Because the current default policy is good enough for most, it'd be great to keep it. Triggering another policy for the one really divergent from it [1].
Related T1835
Related T376 (eclipse's cgit)
[1] Cgit instances: