this is potentially endless, but here are some big ones that I think should be there:
- [x] when we started crawling (before project announcement)
- [ ] when we started ingesting major forges/instances; ideally we should have an entry for each logo added to our [[ https://archive.softwareheritage.org/ | archive coverage list ]]
- [ ] bitbucket
- [ ] cran
- [ ] debian
- [ ] framagit
- [x] github
- [ ] gitlab.com
- [x] gitorious (T312)
- [x] google code (T673, T617, T682)
- [ ] GNU tarballs (?)
- [ ] HAL
- [ ] Inria's gitlab
- [ ] IPOL
- [ ] npm
- [ ] NixOS
- [ ] PyPI
- [ ] major/notable events that have changes archived coverage or structure
- [ ] what happened on 2020-03-01, explaining why archive started growing much faster (related to github listing/loading)