The origin table now contains the origin URL and a sha1 of the URL as
the "ID" field. Allows us to join this table more easily with the SWHIDs
retrieved from the compressed graph, as well as generate the edge
dataset without having to compute sha1s manually.
Details
Details
- Reviewers
ardumont - Group Reviewers
Reviewers - Commits
- rDDATASET9f342d9994aa: relational exports: add ID field to origin table
Diff Detail
Diff Detail
- Repository
- rDDATASET Datasets
- Branch
- master
- Lint
No Linters Available - Unit
No Unit Test Coverage - Build Status
Buildable 28484 Build 44536: Phabricator diff pipeline on jenkins Jenkins console · Jenkins Build 44535: arc lint + arc unit
Event Timeline
Comment Actions
Build is green
Patch application report for D7585 (id=27465)
Rebasing onto 075b3c3068...
Current branch diff-target is up to date.
Changes applied before test
commit 9f342d9994aaaa406c4f586fe46803c8e60b850d Author: Antoine Pietri <antoine.pietri1@gmail.com> Date: Thu Apr 14 14:11:18 2022 +0000 relational exports: add ID field to origin table The origin table now contains the origin URL and a sha1 of the URL as the "ID" field. Allows us to join this table more easily with the SWHIDs retrieved from the compressed graph, as well as generate the edge dataset without having to compute sha1s manually.
See https://jenkins.softwareheritage.org/job/DDATASET/job/tests-on-diff/129/ for more details.