HomeSoftware Heritage

Add graph properties compressed from the ORC dataset

Description

Add graph properties compressed from the ORC dataset

This commit adds the handling of graph *properties*, i.e., data attached
to nodes or edges (commit timestamps, commit messages, content lengths,
...) to swh-graph.

The class WriteNodeProperties is used to extract the node properties
from the ORCGraphDataset and write them in separate files, in compressed
format. The properties can then be read using the SwhGraphProperties
class.

The compression pipeline and the tests were all changed to use the new
dataset format.

Details

Provenance
seirlAuthored on Feb 5 2022, 12:24 AM
seirlPushed on Mar 29 2022, 3:38 PM
Differential Revision
D7331: Add graph properties compressed from the ORC dataset
Parents
rDGRPH8307b841e9d9: Merge tag 'v0.5.2' into master
Branches
Unknown
Tags
Unknown
Build Status
Buildable 27941
Build 43753: test-and-buildJenkins console ยท Jenkins