We want to add to the [[ https://docs.softwareheritage.org/devel/archive-journal.html#archive-journal | journal of archive changes ]] list of notable changes in archival.
This is potentially endless, but here are some big ones that should be there:
- [x] when we started crawling (before project announcement)
- [ ] when we started ingesting major forges/instances; ideally we should have an entry for each logo added to our [[ https://archive.softwareheritage.org/ | archive coverage list ]]
- [ ] bitbucket
- [ ] cran (T1709)
- [ ] debian ([[ https://www.softwareheritage.org/2018/02/20/listing-and-loading-of-debian-repositories-now-live/ | announcement ]])
- [ ] framagit
- [x] github
- [ ] gitlab instances (T1139)
- [ ] gitlab.com (T989)
- [ ] Inria (T1243)
- [x] gitorious (T312)
- [x] google code (T673, T617, T682)
- [ ] GNU tarballs (?)
- [ ] HAL ([[ https://www.softwareheritage.org/2018/09/28/depositing-scientific-software-into-software-heritage/ | announcement ]])
- [ ] IPOL ([[ https://www.softwareheritage.org/2020/06/11/ipol-and-swh/ | announcement ]])
- [ ] npm (T1378)
- [ ] NixOS (T1991, first lister D2025...) ([[ https://www.tweag.io/blog/2020-06-18-software-heritage/ | announcement ]])
- [ ] Guix (T1352)
- [ ] PyPI ([[ https://www.softwareheritage.org/2018/10/10/pypi-available-on-software-heritage/ | announcement ]])
- [ ] major/notable events that have changes archived coverage or structure
- [ ] what happened on 2020-03-01, explaining why archive started growing much faster (related to github listing/loading)