HomeSoftware Heritage

relational exports: add ID field to origin table

Description

relational exports: add ID field to origin table

The origin table now contains the origin URL and a sha1 of the URL as
the "ID" field. Allows us to join this table more easily with the SWHIDs
retrieved from the compressed graph, as well as generate the edge
dataset without having to compute sha1s manually.

Details

Provenance
seirlAuthored on Apr 14 2022, 4:11 PM
seirlPushed on Apr 14 2022, 5:53 PM
Differential Revision
D7585: relational exports: add ID field to origin table
Parents
rDDATASET075b3c3068fe: journalprocessor: save final offsets to a text file
Branches
Unknown
Tags
Unknown
Build Status
Buildable 28490
Build 44545: test-and-buildJenkins console · Jenkins