Page MenuHomeSoftware Heritage

graph export: handle labels
ClosedPublic

Authored by seirl on Dec 9 2020, 7:05 PM.

Diff Detail

Repository
rDDATASET Datasets
Lint
Automatic diff as part of commit; lint not applicable.
Unit
Automatic diff as part of commit; unit tests not applicable.

Event Timeline

Can we get a better commit message? What does this do, what are the implications, why do we do that, etc.

This revision is now accepted and ready to land.Dec 11 2020, 12:12 PM

Better commit message:

Graph export: add labels to the export CSV format

We want to have the labels in the edge dataset to differentiate between
the file names and branch names that have the same src/dst.

This changes the format of the edge files to be:

<SRC> <DST> [LABEL] [PERMISSION]

Where LABEL is an optional base64-encoded label of the file or branch
name, and PERMISSION an integer in base 10 corresponding to the
permission of the file.