Page MenuHomeSoftware Heritage

fully automate export of the graph dataset
Closed, MigratedEdits Locked

Description

we want to fully automate the export of the archive graph, see most recent output example here: https://annex.softwareheritage.org/public/dataset/graph/2019-01-28/

Event Timeline

zack triaged this task as High priority.Jun 23 2019, 10:20 PM
zack created this task.
zack lowered the priority of this task from High to Normal.Nov 18 2019, 2:48 PM
zack added a project: Compressed graph service.

I think this is (reasonably) done now, please check and close it.

No, only the edge part is done, we still need a parquet and a CSV exporter :/

zack changed the task status from Open to Work in Progress.Sep 17 2020, 9:04 AM

The ORC exporter is done, and it's likely that we won't provide CSV exports in the future, or we'll generate them from the ORC format.

reopening, as ideally we'd like to have run the entire ORC export once to completion before closing