Page MenuHomeSoftware Heritage

graph export: handle labels
ClosedPublic

Authored by seirl on Dec 9 2020, 7:05 PM.

Diff Detail

Repository
rDDATASET Datasets
Branch
master
Lint
No Linters Available
Unit
No Unit Test Coverage
Build Status
Buildable 17848
Build 27581: arc lint + arc unit

Event Timeline

Can we get a better commit message? What does this do, what are the implications, why do we do that, etc.

This revision is now accepted and ready to land.Dec 11 2020, 12:12 PM

Better commit message:

Graph export: add labels to the export CSV format

We want to have the labels in the edge dataset to differentiate between
the file names and branch names that have the same src/dst.

This changes the format of the edge files to be:

<SRC> <DST> [LABEL] [PERMISSION]

Where LABEL is an optional base64-encoded label of the file or branch
name, and PERMISSION an integer in base 10 corresponding to the
permission of the file.