- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Jan 8 2023
Jan 6 2023
Dec 22 2022
Future versions will be generated using only code in swh-graph (bash glue code replaced by Python code, some of which shells out to bash for simplicity), so the replication package will simply be replaced by a swh-graph tag.
TODO: deanonymized dataset should be just a <contributor_id,contributor_base64,contributor_escaped> table, rather than repeating the origin<->contributor mapping
Dec 21 2022
blobs-fileinfo.csv.zst: (no changes needed)
Dec 19 2022
Dec 15 2022
Dec 6 2022
Dec 5 2022
Dec 1 2022
Nov 24 2022
Nov 21 2022
Nov 14 2022
the replication/05-earliest-revision.sh script in the replication package mentions the swh-graph version it uses, and the fully qualified class name, so it can be found in the swh-graph code.
Nov 10 2022
Nov 7 2022
It's now available on https://annex.softwareheritage.org/public/dataset/license-blobs/2022-04-25/
Oct 19 2022
Oct 11 2022
Oct 3 2022
Sep 29 2022
Sep 23 2022
Aug 29 2022
May 1 2022
Now that there is both a columnar+compressed graph from 2021 and a columnar graph from 2022 that is pending compression, this task about "refreshing the export from January 2019" is resolved.
Apr 29 2022
Fixed in D7718
Done, this page https://annex.softwareheritage.org/public/dataset/graph/ now contains a link to the detailed list of datasets: https://forge.softwareheritage.org/D7487
Done!