Page MenuHomeSoftware Heritage

Data ModelFolder
ActivePublic

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Recent Activity

Sat, May 8

zack updated the task description for T3316: SWHID v2: determine binary-to-text encoding for checksum part.
Sat, May 8, 1:18 PM · Data Model, Roadmap 2021
zack triaged T3316: SWHID v2: determine binary-to-text encoding for checksum part as Normal priority.
Sat, May 8, 11:43 AM · Data Model, Roadmap 2021
zack closed T2210: Data Model as Invalid.

Closing this as it was a vague meta-task from 2020 roadmap (but we'll keep the actual sub-tasks, which were more clearly identified and are still relevant).

Sat, May 8, 11:37 AM · Data Model, Roadmap 2020

Fri, Apr 30

anlambert added a revision to T3298: Consider making SWHID handling case insensitive: D5655: assets/webapp-utils: Add lowercase validator for core SWHIDs.
Fri, Apr 30, 2:43 PM · Data Model, Web app
vlorentz added a revision to T3298: Consider making SWHID handling case insensitive: D5654: docs/persistent-identifiers: Add guidelines for fixing invalid SWHIDs (this time for uppercase).
Fri, Apr 30, 12:57 PM · Data Model, Web app

Thu, Apr 29

anlambert added a revision to T3298: Consider making SWHID handling case insensitive: D5649: identifiers: Add support for resolving core SWHID with uppercase chars.
Thu, Apr 29, 5:41 PM · Data Model, Web app
rdicosmo added a comment to T3298: Consider making SWHID handling case insensitive.

So for SWHID v1, the resolver should turn the core part into lowercase , am I right ?

Thu, Apr 29, 1:16 PM · Data Model, Web app
anlambert added a comment to T3298: Consider making SWHID handling case insensitive.

I'm not a fan of changing the spec of SWHID version 1 to make them case insensitive, as it seems to be a significant change (in particular for the code that checks for the syntactic correctness of IDs).

Thu, Apr 29, 12:50 PM · Data Model, Web app
vlorentz added a project to T3298: Consider making SWHID handling case insensitive: Data Model.
Thu, Apr 29, 12:28 PM · Data Model, Web app

Fri, Apr 23

vlorentz assigned T3134: SWHID v2 to zack.
Fri, Apr 23, 4:50 PM · Roadmap 2020, Data Model, Web app, meta-task, Roadmap 2021
vlorentz updated the task description for T3284: Support for multiple revision authors?.
Fri, Apr 23, 2:09 PM · Data Model
ardumont updated the task description for T3284: Support for multiple revision authors?.
Fri, Apr 23, 1:34 PM · Data Model
vlorentz lowered the priority of T3284: Support for multiple revision authors? from Normal to Wishlist.
Fri, Apr 23, 1:22 PM · Data Model
vlorentz triaged T3284: Support for multiple revision authors? as Normal priority.
Fri, Apr 23, 1:22 PM · Data Model

Thu, Apr 22

douardda added a subtask for T1957: Handling missing DAG nodes: T3282: Add support for "uninterpreted upstream object" in SWH model and storage.
Thu, Apr 22, 2:44 PM · Data Model
douardda added a comment to T1957: Handling missing DAG nodes.

Examples of such missing objects are revisions with attributes that cannot fit the current data model, e.g. out of range dates. We have example of such revisions in kafka, as mentionned in T3200 and T3170.

Thu, Apr 22, 2:39 PM · Data Model

Wed, Apr 21

douardda added a comment to T3170: Revisions in the journal with out of range dates.

Note that none of their parent revisions can be found either in the archive (one invalid revision in a set of ingested revisions prevent any of them being inserted in the database I suppose, but they are already inserted in kafka at this moment).

Wed, Apr 21, 7:08 PM · Data Model, Journal

Thu, Apr 15

vlorentz closed T3226: swh identify with type=snapshot shows dependency not installed error as Resolved.
Thu, Apr 15, 3:11 PM · Data Model, SWH command line interface

Apr 12 2021

vlorentz added a comment to T3235: Add archival of bug tracker databases as well as an unofficial bug tracker per-project.

You are likely doing a git pull on a periodic basis. Just add git bug bridge pull [<name>] next to it.

Apr 12 2021, 3:37 PM · Archive coverage, Data Model
libEqualizer added a comment to T3235: Add archival of bug tracker databases as well as an unofficial bug tracker per-project.

However, this would require considerable work

Apr 12 2021, 2:48 PM · Archive coverage, Data Model
vlorentz triaged T3235: Add archival of bug tracker databases as well as an unofficial bug tracker per-project as Wishlist priority.

Hi, thanks for the suggestion.

Apr 12 2021, 11:31 AM · Archive coverage, Data Model

Apr 8 2021

pawarhrishi21 added a comment to T3226: swh identify with type=snapshot shows dependency not installed error.

You should install swh.model[cli] instead of swh.model. I added a better error message in D5466 so it's clearer.

Apr 8 2021, 7:31 PM · Data Model, SWH command line interface
vlorentz added a comment to T3226: swh identify with type=snapshot shows dependency not installed error.

And I'm also updating the documentation at https://docs.softwareheritage.org/devel/swh-model/persistent-identifiers.html#computing

Apr 8 2021, 7:28 PM · Data Model, SWH command line interface
vlorentz added a revision to T3226: swh identify with type=snapshot shows dependency not installed error: D5469: docs: Ask readers to install swh.model[cli] to fully use swh-identify.
Apr 8 2021, 7:27 PM · Data Model, SWH command line interface
vlorentz added a comment to T3226: swh identify with type=snapshot shows dependency not installed error.

You should install swh.model[cli] instead of swh.model. I added a better error message in D5466 so it's clearer.

Apr 8 2021, 7:23 PM · Data Model, SWH command line interface
vlorentz added a revision to T3226: swh identify with type=snapshot shows dependency not installed error: D5466: swh-identify: Hide tracebacks if Click or Dulwich is not installed.
Apr 8 2021, 7:20 PM · Data Model, SWH command line interface
vlorentz triaged T3226: swh identify with type=snapshot shows dependency not installed error as Normal priority.
Apr 8 2021, 6:43 PM · Data Model, SWH command line interface
pawarhrishi21 created T3226: swh identify with type=snapshot shows dependency not installed error.
Apr 8 2021, 6:36 PM · Data Model, SWH command line interface
vlorentz closed T3220: Installing swh.model does not install its dependencies as Resolved.

Resolved by D5460; thanks again for the report

Apr 8 2021, 4:36 PM · Data Model, SWH command line interface
vlorentz added a project to T3220: Installing swh.model does not install its dependencies: Data Model.
Apr 8 2021, 4:18 PM · Data Model, SWH command line interface

Apr 7 2021

douardda added a comment to T3214: Restrict accepted timestamps to values that can be processed all along.

looks like there is no revision with date or committer_date > 9999-12-31 in the main storage...

Apr 7 2021, 3:04 PM · Data Model
douardda triaged T3214: Restrict accepted timestamps to values that can be processed all along as High priority.
Apr 7 2021, 2:30 PM · Data Model

Apr 6 2021

zack closed T1136: swh-identify: support recursive checksumming of directories as Invalid.

duplicate with T3160

Apr 6 2021, 11:36 AM · Data Model

Mar 26 2021

DanSeraf closed T2570: swh-identify: support exclusion patterns (e.g., for .git/) as swh-scanner does as Resolved.

Already implemented in D4193

Mar 26 2021, 3:15 PM · Data Model

Mar 24 2021

seirl updated the task description for T3170: Revisions in the journal with out of range dates.
Mar 24 2021, 6:56 PM · Data Model, Journal
seirl updated the task description for T3170: Revisions in the journal with out of range dates.
Mar 24 2021, 4:11 PM · Data Model, Journal
seirl updated the task description for T3170: Revisions in the journal with out of range dates.
Mar 24 2021, 4:11 PM · Data Model, Journal
seirl updated the task description for T3170: Revisions in the journal with out of range dates.
Mar 24 2021, 4:10 PM · Data Model, Journal
seirl triaged T3170: Revisions in the journal with out of range dates as Normal priority.
Mar 24 2021, 1:13 PM · Data Model, Journal

Mar 23 2021

vlorentz added a comment to T2686: Use hashes for all kafka keys.

(and we should keep the origin topic; we already have an ExtSWHID for origins anyway)

Mar 23 2021, 2:55 PM · Data Model, Storage manager
olasd added a comment to T2686: Use hashes for all kafka keys.

The following objects remain:

Mar 23 2021, 2:47 PM · Data Model, Storage manager
vlorentz closed T2703: Use intrinsic identifiers/hashes for RawExtrinsicMetadata objects, a subtask of T2686: Use hashes for all kafka keys, as Resolved.
Mar 23 2021, 2:33 PM · Data Model, Storage manager
vlorentz closed T2703: Use intrinsic identifiers/hashes for RawExtrinsicMetadata objects as Resolved.
Mar 23 2021, 2:33 PM · Data Model, Storage manager, Extrinsic metadata
vlorentz closed T3017: Use hashes as keys in swh.journal.objects.raw_extrinsic_metadata, a subtask of T2703: Use intrinsic identifiers/hashes for RawExtrinsicMetadata objects, as Resolved.
Mar 23 2021, 2:33 PM · Data Model, Storage manager, Extrinsic metadata
vlorentz closed T3017: Use hashes as keys in swh.journal.objects.raw_extrinsic_metadata as Resolved.
Mar 23 2021, 2:33 PM · Data Model, Storage manager, Extrinsic metadata
olasd closed T3022: Deduplicate RawExtrinsicMetadata by hash instead of a subset of their fields, a subtask of T2703: Use intrinsic identifiers/hashes for RawExtrinsicMetadata objects, as Resolved.
Mar 23 2021, 2:25 PM · Data Model, Storage manager, Extrinsic metadata

Mar 20 2021

zack renamed T3160: swh identify: add a -R/--recursive flag from swh identify: add a -R/--recursive to swh identify: add a -R/--recursive flag.
Mar 20 2021, 2:22 PM · Easy hack, Data Model
zack updated the task description for T3160: swh identify: add a -R/--recursive flag.
Mar 20 2021, 2:21 PM · Easy hack, Data Model
zack triaged T3160: swh identify: add a -R/--recursive flag as Normal priority.
Mar 20 2021, 2:20 PM · Easy hack, Data Model

Mar 19 2021

vlorentz added a subtask for T2210: Data Model: T3134: SWHID v2.
Mar 19 2021, 4:23 PM · Data Model, Roadmap 2020