It's wip as there remains stuff to add:
- tests
- checking how to filter appropriately (if we need it, i don't seem to find duplicates)
- ensure rate limit policy is ok (doubt about the returned code)
But some review would be welcome already ;)
Local runs are fine:
- origins are created in local swh db (good ones with actual https which loads through git loader)
- local gitlab cache model is also populated (don't seem to find duplicates either)
- tasks are scheduled