Page MenuHomeSoftware Heritage

Archive contentFolder
ActivePublic

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

stuff related to content (of all kinds, not only "blobs") that is already stored in the Software Heritage archive

Recent Activity

Wed, Jul 29

vlorentz triaged T2333: Use non-url identifiers for origin url attribute as Normal priority.
Wed, Jul 29, 10:52 AM · Archive content

Jul 6 2020

olasd closed T998: Mercurial loader sometimes creates snapshots that point to revisions that haven't been loaded, a subtask of T995: Investigate and fix snapshots with broken links, as Resolved.
Jul 6 2020, 1:33 PM · Archive content

Jun 19 2020

anlambert closed T2441: Update SWHID regexp used by Zenodo as Resolved.

PR merged. New SWHID qualifiers will be supported in release 1.1.7 of idutils.

Jun 19 2020, 9:53 AM · Archive content

Jun 9 2020

rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Le mar. 9 juin 2020 à 16:18, anlambert (Antoine Lambert) <
forge@softwareheritage.org> a écrit :

Jun 9 2020, 5:23 PM · Archive content
anlambert added a comment to T2441: Update SWHID regexp used by Zenodo.

I just simplified the regexp to allow qualifiers permutation: https://github.com/inveniosoftware/idutils/blob/cc09640ffb457bab3cfe8d0eeb4822dd521fd36d/idutils/__init__.py#L245-L249

Jun 9 2020, 4:18 PM · Archive content
rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Is there a way to improve the regex in https://github.com/inveniosoftware/idutils/pull/60 to allow qualifiers to come in any order instead of the canonical one?

Jun 9 2020, 3:59 PM · Archive content
anlambert added a comment to T2441: Update SWHID regexp used by Zenodo.

PR submitted: https://github.com/inveniosoftware/idutils/pull/60

Jun 9 2020, 2:27 PM · Archive content
anlambert triaged T2441: Update SWHID regexp used by Zenodo as Normal priority.
Jun 9 2020, 11:30 AM · Archive content

Mar 24 2020

olasd added a project to T2333: Use non-url identifiers for origin url attribute : Archive content.
Mar 24 2020, 12:44 PM · Archive content

Feb 19 2020

vlorentz claimed T1258: Synthesize release objects for all upstream things that match the concept of a release.
Feb 19 2020, 5:53 PM · Archive content

Jan 29 2020

vlorentz moved T846: Some objects from the original GitHub import have never actually been imported. from Restricted Project Column to Restricted Project Column on the Restricted Project board.
Jan 29 2020, 5:07 PM · Restricted Project, Restricted Project, Archive content

Jan 23 2020

olasd changed the status of T846: Some objects from the original GitHub import have never actually been imported. from Open to Work in Progress.

List of revisions with no parents (1259):

Jan 23 2020, 6:37 PM · Restricted Project, Restricted Project, Archive content
douardda added a project to T846: Some objects from the original GitHub import have never actually been imported.: Restricted Project.
Jan 23 2020, 2:01 PM · Restricted Project, Restricted Project, Archive content
douardda added a parent task for T846: Some objects from the original GitHub import have never actually been imported.: T2207: 0 lag.
Jan 23 2020, 2:01 PM · Restricted Project, Restricted Project, Archive content

Dec 16 2019

anlambert closed T2148: Recreate save code now requests that failed when migrating loaders as Resolved.
Dec 16 2019, 3:43 PM · Archive content
anlambert updated the task description for T2148: Recreate save code now requests that failed when migrating loaders.
Dec 16 2019, 3:43 PM · Archive content
anlambert updated the task description for T2148: Recreate save code now requests that failed when migrating loaders.
Dec 16 2019, 2:33 PM · Archive content
anlambert updated the task description for T2148: Recreate save code now requests that failed when migrating loaders.
Dec 16 2019, 11:01 AM · Archive content

Dec 13 2019

anlambert updated the task description for T2148: Recreate save code now requests that failed when migrating loaders.
Dec 13 2019, 4:34 PM · Archive content
anlambert triaged T2148: Recreate save code now requests that failed when migrating loaders as Normal priority.
Dec 13 2019, 4:31 PM · Archive content

Nov 18 2019

zack added a comment to T1817: À la recherche du content perdu.

I've used swh-graph to lookup the 74 still missing contents, I've managed to find 67 of them, see cnt→ori mapping in (tracing them back to actual origins requires T2045):

Nov 18 2019, 5:54 PM · Archive content

Jul 3 2019

ardumont placed T958: googlecode import: Clean up googlecode origin's origin_visits up for grabs.
Jul 3 2019, 3:26 PM · SVN Loader, Origin-GoogleCode, Archive content

Jun 20 2019

olasd updated the task description for T1817: À la recherche du content perdu.
Jun 20 2019, 6:23 PM · Archive content
olasd updated subscribers of T1817: À la recherche du content perdu.

151 contents have been restored with help from the provenance index, thanks to @grouss.

Jun 20 2019, 6:22 PM · Archive content

Jun 17 2019

olasd triaged T1817: À la recherche du content perdu as Normal priority.
Jun 17 2019, 5:59 PM · Archive content

Apr 24 2019

vlorentz closed T1691: metadata indexer: investigate metadata entries with empty mappings as Resolved.

We should investigate why they are there.

Apr 24 2019, 5:22 PM · Archive content, Indexer
vlorentz closed T1691: metadata indexer: investigate metadata entries with empty mappings, a subtask of T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata, as Resolved.
Apr 24 2019, 5:22 PM · Archive content, Indexer
zack renamed T1691: metadata indexer: investigate metadata entries with empty mappings from metadata indexer: investigate empty mappings to metadata indexer: investigate metadata entries with empty mappings.
Apr 24 2019, 5:21 PM · Archive content, Indexer
zack triaged T1691: metadata indexer: investigate metadata entries with empty mappings as Normal priority.
Apr 24 2019, 5:20 PM · Archive content, Indexer
zack closed T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata as Resolved.

This is now done, aside from a minor issue noted below:

softwareheritage-indexer=# select count(*) from revision_intrinsic_metadata where metadata = '{"@context": "https://doi.org/10.5063/schema/codemeta-2.0"}'::jsonb;
 count 
-------
     0
(1 row)
Apr 24 2019, 5:18 PM · Archive content, Indexer

Apr 3 2019

vlorentz updated the task description for T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 3 2019, 11:14 AM · Archive content, Indexer
vlorentz updated the task description for T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 3 2019, 10:41 AM · Archive content, Indexer
vlorentz added a comment to T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 3 2019, 9:52 AM · Archive content, Indexer

Apr 2 2019

zack updated the task description for T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 2 2019, 4:41 PM · Archive content, Indexer
zack updated the task description for T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 2 2019, 4:40 PM · Archive content, Indexer
zack updated the task description for T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 2 2019, 4:37 PM · Archive content, Indexer
zack updated the task description for T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata.
Apr 2 2019, 4:37 PM · Archive content, Indexer

Mar 25 2019

olasd closed T1534: PostgreSQL replication issues between prado and somerset as Resolved.

The replication process from prado to somerset is now complete, and the archive frontend has been switched over to this database.

Mar 25 2019, 6:08 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 25 2019, 6:07 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 25 2019, 10:32 AM · System administration, Archive content

Mar 23 2019

olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 23 2019, 2:29 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 23 2019, 10:02 AM · System administration, Archive content

Mar 22 2019

olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 22 2019, 11:26 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 22 2019, 6:32 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 22 2019, 6:00 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 22 2019, 5:51 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 22 2019, 5:31 PM · System administration, Archive content
olasd updated the task description for T1534: PostgreSQL replication issues between prado and somerset.
Mar 22 2019, 4:52 PM · System administration, Archive content
olasd changed the status of T1534: PostgreSQL replication issues between prado and somerset from Open to Work in Progress.

The replicated cluster is now clear to be taken down for a rebuild.

Mar 22 2019, 2:55 PM · System administration, Archive content

Mar 20 2019

zack reassigned T1549: Clean up entries in {origin_intrinsic,revision}_metadata with no metadata from zack to vlorentz.
Mar 20 2019, 12:10 PM · Archive content, Indexer