Page MenuHomeSoftware Heritage

GitLab lister: allow ignoring origins contained in given namespace prefixes
Open, NormalPublic

Description

When looking at GitLab forges we'd be able to ingest, I came across the Drupal forge.

https://git.drupalcode.org/explore

It looks like they have two namespaces:

  • one namespace for the main repository of all maintained projects (/project/)
  • one namespace for "issue collaboration forks" (/issues/)

Afaict, forks in the issues namespace are automatically generated for every ticket in their issue tracker, so that contributors can collaborate on patches. Patches are automatically (?) submitted as merge requests on the main repository.

I believe that we could archive this forge, but the issues namespace is noise that would be taken into account through the merge-request refs in the main repositories, and that it would be useful to filter it out on the lister side.

In that same vein, on the main gitlab.com instance, there's a bunch of transient repositories generated by the gitlab QA process that could be filtered out.

https://gitlab.com/gitlab-qa-sandbox-group-1
...

Event Timeline

olasd renamed this task from GitLab lister: allow ignoring origins contained in a given namespace to GitLab lister: allow ignoring origins contained in given namespace prefixes.Mon, Nov 14, 11:03 AM
olasd created this task.
olasd updated the task description. (Show Details)
olasd triaged this task as Normal priority.Mon, Nov 14, 4:42 PM
olasd added a project: Easy hack.