HomeSoftware Heritage

Add graph properties compressed from the ORC dataset

This commit no longer exists in the repository. It may have been part of a branch which was deleted.

Description

Add graph properties compressed from the ORC dataset

This commit adds the handling of graph *properties*, i.e., data attached
to nodes or edges (commit timestamps, commit messages, content lengths,
...) to swh-graph.

The class WriteNodeProperties is used to extract the node properties
from the ORCGraphDataset and write them in separate files, in compressed
format. The properties can then be read using the SwhGraphProperties
class.

The compression pipeline and the tests were all changed to use the new
dataset format.

Details

Provenance
seirlAuthored on Feb 5 2022, 12:24 AM
seirlPushed on Mar 29 2022, 3:38 PM
Differential Revision
D7331: Add graph properties compressed from the ORC dataset
Build Status
Buildable 27941
Build 43753: test-and-buildJenkins console · Jenkins

Commit No Longer Exists

This commit no longer exists in the repository.