Page MenuHomeSoftware Heritage

Archive contentFolder
ActivePublic

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

stuff related to content (of all kinds, not only "blobs") that is already stored in the Software Heritage archive

Recent Activity

Yesterday

vlorentz added a comment to T75: Check integrity of Revisions and Releases.

Two other sources of mismatched checksums:

Thu, Sep 16, 7:22 PM · Archive content, Restricted Project

Tue, Sep 14

vlorentz claimed T75: Check integrity of Revisions and Releases.
Tue, Sep 14, 4:50 PM · Archive content, Restricted Project

Fri, Sep 3

vlorentz added a parent task for T75: Check integrity of Revisions and Releases: T3552: Fix corrupted releases and revisions in the storage.
Fri, Sep 3, 6:28 PM · Archive content, Restricted Project
vlorentz added a comment to T75: Check integrity of Revisions and Releases.

Old versions of Dulwich (eg. 0.16.3, the version in stretch), dropped newlines at the end of the gpgsig header.

Fri, Sep 3, 6:25 PM · Archive content, Restricted Project

Jul 30 2021

vlorentz removed a revision from T75: Check integrity of Revisions and Releases: D6045: converters: Preserve GPG signatures on releases.
Jul 30 2021, 10:59 AM · Archive content, Restricted Project
vlorentz added a revision to T75: Check integrity of Revisions and Releases: D6045: converters: Preserve GPG signatures on releases.
Jul 30 2021, 10:58 AM · Archive content, Restricted Project
vlorentz added a comment to T75: Check integrity of Revisions and Releases.

Dulwich 0.19.10 (released in january 2019) changed the way they handle signatures on annotated tags, so we silently drop all signatures since we started using it (probably whenever we upgraded loaders to Buster).

Jul 30 2021, 10:58 AM · Archive content, Restricted Project

May 26 2021

vlorentz changed the status of T2564: migrate existing revisions metadata extra_headers to actual extra_headers field, a subtask of T3089: Remove the 'metadata' column of the 'revision' table, from Work in Progress to Open.
May 26 2021, 11:26 AM · Storage manager, Archive content

Apr 28 2021

vlorentz changed the status of T2564: migrate existing revisions metadata extra_headers to actual extra_headers field, a subtask of T3089: Remove the 'metadata' column of the 'revision' table, from Open to Work in Progress.
Apr 28 2021, 12:43 PM · Storage manager, Archive content

Apr 27 2021

vlorentz removed a project from T3246: Document takedown request processing workflow: Roadmap 2021.
Apr 27 2021, 2:15 PM · Archive content

Apr 23 2021

vlorentz assigned T3113: Cold storage archive to douardda.
Apr 23 2021, 4:49 PM · Roadmap 2021, Archive content, meta-task
vlorentz added a subtask for T3089: Remove the 'metadata' column of the 'revision' table: T2564: migrate existing revisions metadata extra_headers to actual extra_headers field.
Apr 23 2021, 9:58 AM · Storage manager, Archive content

Apr 20 2021

douardda added a comment to T3246: Document takedown request processing workflow.

do we also intent to have a takedown topic on kafka?

Apr 20 2021, 11:08 AM · Archive content

Apr 19 2021

vlorentz removed a parent task for T3089: Remove the 'metadata' column of the 'revision' table: T2471: NPM package angular-ts-manage fails to be properly loaded.
Apr 19 2021, 12:43 PM · Storage manager, Archive content
rdicosmo moved T3246: Document takedown request processing workflow from Backlog to Work in progress on the Roadmap 2021 board.
Apr 19 2021, 11:53 AM · Archive content
douardda added a comment to T3246: Document takedown request processing workflow.

also: what about exports we provide on git annex?

Apr 19 2021, 10:10 AM · Archive content
douardda added a comment to T3246: Document takedown request processing workflow.

do we also intent to have a takedown topic on kafka?

Apr 19 2021, 10:09 AM · Archive content

Apr 15 2021

vlorentz closed T3090: Make loaders not rely on the 'metadata' column of the 'revision' table, a subtask of T3089: Remove the 'metadata' column of the 'revision' table, as Resolved.
Apr 15 2021, 3:15 PM · Storage manager, Archive content

Apr 12 2021

olasd added a comment to T3246: Document takedown request processing workflow.

Knobs to adjust the visibility of origins in the archive and in the web API

Apr 12 2021, 4:52 PM · Archive content
olasd triaged T3246: Document takedown request processing workflow as Normal priority.
Apr 12 2021, 4:33 PM · Archive content

Apr 6 2021

vlorentz added a parent task for T3089: Remove the 'metadata' column of the 'revision' table: T3201: Mirror: unsupported Unicode escape sequence.
Apr 6 2021, 2:20 PM · Storage manager, Archive content

Mar 15 2021

vlorentz added a parent task for T3089: Remove the 'metadata' column of the 'revision' table: T2471: NPM package angular-ts-manage fails to be properly loaded.
Mar 15 2021, 12:32 PM · Storage manager, Archive content
vlorentz triaged T3113: Cold storage archive as Normal priority.
Mar 15 2021, 12:30 PM · Roadmap 2021, Archive content, meta-task

Mar 10 2021

rdicosmo moved T3113: Cold storage archive from Backlog to Work in progress on the Roadmap 2021 board.
Mar 10 2021, 4:29 PM · Roadmap 2021, Archive content, meta-task
rdicosmo created T3113: Cold storage archive.
Mar 10 2021, 4:26 PM · Roadmap 2021, Archive content, meta-task

Mar 5 2021

vlorentz added a subtask for T3089: Remove the 'metadata' column of the 'revision' table: T2513: Copy metadata on revisions to the extrinsic metadata storage.
Mar 5 2021, 3:51 PM · Storage manager, Archive content
vlorentz added a parent task for T3089: Remove the 'metadata' column of the 'revision' table: T2059: Generate (swh) releases from all git tags.
Mar 5 2021, 12:30 PM · Storage manager, Archive content
vlorentz triaged T3089: Remove the 'metadata' column of the 'revision' table as Normal priority.
Mar 5 2021, 12:27 PM · Storage manager, Archive content

Feb 4 2021

vlorentz added a parent task for T75: Check integrity of Revisions and Releases: T3010: Enable the validating storage proxy in production.
Feb 4 2021, 6:13 PM · Archive content, Restricted Project
vlorentz merged T3012: Check all objects in the production storage/journal have a correct hash into T75: Check integrity of Revisions and Releases.
Feb 4 2021, 6:13 PM · Archive content, Restricted Project

Oct 14 2020

ardumont added a comment to T994: origin_visit: distinguish "fetch date" and "injection date".

yes

Oct 14 2020, 1:49 PM · Archive content
olasd updated subscribers of T994: origin_visit: distinguish "fetch date" and "injection date".

I _think_ this usecase is solved with the origin_visit_status table (created vs. ongoing vs. completed). @vlorentz?

Oct 14 2020, 12:34 PM · Archive content
olasd closed T829: Remove duplication between fetch_history and origin_visit as Resolved.

The fetch_history table is gone since swh.storage v0.0.155 / swh-storage schema v141.

Oct 14 2020, 12:32 PM · Storage manager, Archive content

Sep 22 2020

olasd placed T995: Investigate and fix snapshots with broken links up for grabs.

I don't think the script to check for these has been put anywhere, I believe it was just a raw SQL query.

Sep 22 2020, 4:46 PM · Archive content

Sep 11 2020

olasd changed the status of T997: Debian loader sometimes thinks a package has been loaded when it has not, a subtask of T995: Investigate and fix snapshots with broken links, from Work in Progress to Open.
Sep 11 2020, 1:56 PM · Archive content

Aug 27 2020

douardda added a comment to T995: Investigate and fix snapshots with broken links.

what's the status of this task today? is there a probe that tracks these broken links? (or a script one can run)

Aug 27 2020, 4:40 PM · Archive content

Jul 29 2020

vlorentz triaged T2333: Use non-url identifiers for origin url attribute as Normal priority.
Jul 29 2020, 10:52 AM · Archive content

Jul 6 2020

olasd closed T998: Mercurial loader sometimes creates snapshots that point to revisions that haven't been loaded, a subtask of T995: Investigate and fix snapshots with broken links, as Resolved.
Jul 6 2020, 1:33 PM · Archive content

Jun 19 2020

anlambert closed T2441: Update SWHID regexp used by Zenodo as Resolved.

PR merged. New SWHID qualifiers will be supported in release 1.1.7 of idutils.

Jun 19 2020, 9:53 AM · Archive content

Jun 9 2020

rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Le mar. 9 juin 2020 à 16:18, anlambert (Antoine Lambert) <
forge@softwareheritage.org> a écrit :

Jun 9 2020, 5:23 PM · Archive content
anlambert added a comment to T2441: Update SWHID regexp used by Zenodo.

I just simplified the regexp to allow qualifiers permutation: https://github.com/inveniosoftware/idutils/blob/cc09640ffb457bab3cfe8d0eeb4822dd521fd36d/idutils/__init__.py#L245-L249

Jun 9 2020, 4:18 PM · Archive content
rdicosmo added a comment to T2441: Update SWHID regexp used by Zenodo.

Is there a way to improve the regex in https://github.com/inveniosoftware/idutils/pull/60 to allow qualifiers to come in any order instead of the canonical one?

Jun 9 2020, 3:59 PM · Archive content
anlambert added a comment to T2441: Update SWHID regexp used by Zenodo.

PR submitted: https://github.com/inveniosoftware/idutils/pull/60

Jun 9 2020, 2:27 PM · Archive content
anlambert triaged T2441: Update SWHID regexp used by Zenodo as Normal priority.
Jun 9 2020, 11:30 AM · Archive content

Mar 24 2020

olasd added a project to T2333: Use non-url identifiers for origin url attribute : Archive content.
Mar 24 2020, 12:44 PM · Archive content

Feb 19 2020

vlorentz claimed T1258: Synthesize release objects for all upstream things that match the concept of a release.
Feb 19 2020, 5:53 PM · Archive content

Jan 29 2020

vlorentz moved T846: Some objects from the original GitHub import have never actually been imported. from Backlog to Work in progress on the Roadmap 2020 board.
Jan 29 2020, 5:07 PM · Roadmap 2020, Restricted Project, Archive content

Jan 23 2020

olasd changed the status of T846: Some objects from the original GitHub import have never actually been imported. from Open to Work in Progress.

List of revisions with no parents (1259):

Jan 23 2020, 6:37 PM · Roadmap 2020, Restricted Project, Archive content
douardda added a project to T846: Some objects from the original GitHub import have never actually been imported.: Roadmap 2020.
Jan 23 2020, 2:01 PM · Roadmap 2020, Restricted Project, Archive content
douardda added a parent task for T846: Some objects from the original GitHub import have never actually been imported.: T2207: 0 lag.
Jan 23 2020, 2:01 PM · Roadmap 2020, Restricted Project, Archive content