Page MenuHomeSoftware Heritage

DatasetsFolder
ActivePublic

Members

  • This project does not have any members.

Watchers

  • This project does not have any watchers.

Details

Description

datasets maintained by Software Heritage

Recent Activity

Aug 19 2019

seirl triaged T1956: Integrate usage docs of the graph dataset in swh-docs as High priority.
Aug 19 2019, 6:19 PM · Datasets

Jul 14 2019

zack renamed T1914: synchronously write content objects to AWS during ingestion from synchronously write content objects to AWS to synchronously write content objects to AWS during ingestion.
Jul 14 2019, 4:48 PM · Mirror, Datasets
zack triaged T1914: synchronously write content objects to AWS during ingestion as High priority.
Jul 14 2019, 4:47 PM · Mirror, Datasets

Jul 9 2019

zack triaged T1899: complete object storage mirror on AWS as Normal priority.
Jul 9 2019, 10:59 AM · Mirror, Datasets

Jun 30 2019

zack added a parent task for T1848: refresh graph dataset export: T1868: refresh compressed representation of the archive.
Jun 30 2019, 1:58 PM · Datasets

Jun 23 2019

zack added a subtask for T1848: refresh graph dataset export: T1741: graph dataset: update to use persistent identifiers everywhere.
Jun 23 2019, 10:23 PM · Datasets
zack added a parent task for T1741: graph dataset: update to use persistent identifiers everywhere: T1848: refresh graph dataset export.
Jun 23 2019, 10:23 PM · Datasets
zack triaged T1848: refresh graph dataset export as Low priority.
Jun 23 2019, 10:22 PM · Datasets
zack added a parent task for T1847: fully automate export of the graph dataset: T1848: refresh graph dataset export.
Jun 23 2019, 10:22 PM · Datasets
zack added a subtask for T1848: refresh graph dataset export: T1847: fully automate export of the graph dataset.
Jun 23 2019, 10:22 PM · Datasets
zack created T1848: refresh graph dataset export.
Jun 23 2019, 10:21 PM · Datasets
zack triaged T1847: fully automate export of the graph dataset as High priority.
Jun 23 2019, 10:20 PM · Datasets
zack created T1847: fully automate export of the graph dataset.
Jun 23 2019, 10:20 PM · Datasets

Jun 11 2019

seirl triaged T1796: Datasets exported from Spark are missing some rows as Normal priority.
Jun 11 2019, 11:52 PM · Datasets

Jun 5 2019

zack claimed T1742: graph dataset: uniform file names.
Jun 5 2019, 10:07 AM · Datasets
zack closed T1742: graph dataset: uniform file names as Resolved.
Jun 5 2019, 10:07 AM · Datasets

Jun 4 2019

zack closed T1783: edge dataset: re-export rev→rev edges in the right order as Resolved.
Jun 4 2019, 10:33 PM · Datasets
zack triaged T1783: edge dataset: re-export rev→rev edges in the right order as High priority.
Jun 4 2019, 2:33 PM · Datasets

May 23 2019

zack added a project to T1741: graph dataset: update to use persistent identifiers everywhere: Datasets.
May 23 2019, 2:37 PM · Datasets
zack added a project to T1742: graph dataset: uniform file names: Datasets.
May 23 2019, 2:37 PM · Datasets
zack added a comment to T1743: create a nice landing web page for exported dataset.

A nice related work here are the LAW datasets.

May 23 2019, 2:37 PM · Datasets
zack triaged T1743: create a nice landing web page for exported dataset as Low priority.
May 23 2019, 2:36 PM · Datasets
zack created Datasets.
May 23 2019, 2:29 PM