we want to update the graph dataset export, which is now dating back to January 2019 https://annex.softwareheritage.org/public/dataset/graph/2019-01-28/
ideally, doing so could be the test bed for a first fully automated export (T1847)
we want to update the graph dataset export, which is now dating back to January 2019 https://annex.softwareheritage.org/public/dataset/graph/2019-01-28/
ideally, doing so could be the test bed for a first fully automated export (T1847)
Status | Assigned | Task | ||
---|---|---|---|---|
Migrated | gitlab-migration | T3085 Complete and updated copy of the archive on S3 (objects+graph) | ||
Migrated | gitlab-migration | T1848 refresh graph dataset export | ||
Migrated | gitlab-migration | T1847 fully automate export of the graph dataset | ||
Migrated | gitlab-migration | T3178 document how to export the graph dataset automatically | ||
Migrated | gitlab-migration | T2431 Document how to export the graph edge dataset | ||
Migrated | gitlab-migration | T1741 graph dataset: update to use persistent identifiers everywhere |
Now that there is both a columnar+compressed graph from 2021 and a columnar graph from 2022 that is pending compression, this task about "refreshing the export from January 2019" is resolved.