Page MenuHomeSoftware Heritage

Archive git.fsfe.org (Gitea)
Closed, MigratedEdits Locked

Description

At the Free Software Foundation Europe (FSFE), we use a git.fsfe.org (based on Gitea) to host all our code and websites. This includes a full archive of the commits to fsfe.org which definitely has some historical value ;)

How could this be achieved? I learnt that there is no connector from Gitea to Software Heritage, but perhaps there is some automatic scanner for the public repos we host.

Unfortunately we do not have the resources (mostly regarding Go skills) to come up with a patch for Gitea's code, but I would be happy to assist, test, and give feedback!

PS: @zack asked me to file a feature request for my inquiry here. If I did not sort/label my request correctly, please feel free to edit.

Event Timeline

mxmehl triaged this task as Wishlist priority.Mar 13 2020, 12:55 PM
mxmehl created this task.
mxmehl created this object in space S1 Public.

Hi!

Thanks for submitting this request. There's a good chance that this can be the same lister as gogs: T1721.

There's a public Repository API: https://git.fsfe.org/api/v1/repos/search, so I guess the only thing missing is a lister on our side.

This looks like a cross between GitHub and GitLab's APIs, shouldn't be too hard for someone to quickly implement it in our codebase.

Unfortunately, try.gogs.io's API is hidden behind auth so I can't confirm that the responses actually have the same shape between gogs and gitea.

zack added a subscriber: zack.

I've add multiple looks to the proposed gitea lister.
This looks fine to me, i've accepted it but not completely.
If some other team member could do a second pass, that'd be neat.

Cheers,

now we have the gitea lister, we should (upgrade swh.lister on prod and) add a few listing tasks, like this fsfe instance, as well as other instances like https://codeberg.org.

discussions on having lists of public gitea instances:

and

Ok I was expecting something a bit smart in explore.sapk.fr, but not really:

https://gitea.com/sapk/explore/src/branch/master/main.go#L28

Also beware that the default pagination value in the gitea lister is 3 (https://forge.softwareheritage.org/source/swh-lister/browse/master/swh/lister/gitea/lister.py$23) so it is very slow.

So when a gitea listing task is about to be added, make sure to give this limit argument a decent value (eg. 100)

douardda raised the priority of this task from Wishlist to High.Aug 27 2020, 4:28 PM

Can this be closed now? What's missing? Adding a listing task?

The lister is deployed, this forge is not listed though (codeberg.org is).

I've sent an email to the fsfe.

douardda claimed this task.

Listed (oneshot full + recurring incremental) and loaded (as far as I can tell).