we want to fully automate the export of the archive graph, see most recent output example here: https://annex.softwareheritage.org/public/dataset/graph/2019-01-28/
Description
Description
Status | Assigned | Task | ||
---|---|---|---|---|
Migrated | gitlab-migration | T3085 Complete and updated copy of the archive on S3 (objects+graph) | ||
Migrated | gitlab-migration | T1848 refresh graph dataset export | ||
Migrated | gitlab-migration | T1847 fully automate export of the graph dataset | ||
Migrated | gitlab-migration | T3178 document how to export the graph dataset automatically | ||
Migrated | gitlab-migration | T2431 Document how to export the graph edge dataset |
Event Timeline
Comment Actions
The ORC exporter is done, and it's likely that we won't provide CSV exports in the future, or we'll generate them from the ORC format.
Comment Actions
reopening, as ideally we'd like to have run the entire ORC export once to completion before closing