Page MenuHomeSoftware Heritage
Feed Advanced Search

Jun 14 2020

zack added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

Making explicit a direct answer to one of @lewo's question (hinted at by both @olasd and @rdicosmo): no, we do not want a new type of SWHID (swh:1:tar:...) for source code containers, which from our point of view are ephemeral.

Jun 14 2020, 8:56 AM · Data Model
zack added a comment to T2449: Consider switching timestamp offset storage to strings/byte arrays.

Yeah, for having played with it quite a bit in recent times, the current state of timestamp offsets isn't great. I'm fine with the idea of switching them to bytestrings as proposed.

Jun 14 2020, 8:49 AM · Storage manager, Data Model

Jun 13 2020

ardumont added a revision to T2310: Make origin visits immutable: D3278: in_memory: Make origin-visit-status-add respect "on conflict ignore" policy.
Jun 13 2020, 8:42 AM · Storage manager, Data Model

Jun 12 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Jun 12 2020, 7:15 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3276: storage*: Remove origin-visit-update endpoint.
Jun 12 2020, 6:55 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3275: cassandra.storage: Write origin-visit-status once in journal.
Jun 12 2020, 6:53 PM · Storage manager, Data Model
olasd updated subscribers of T2449: Consider switching timestamp offset storage to strings/byte arrays.

(ping @zack who has done some actual analysis on the timezone-related data in the archive)

Jun 12 2020, 1:03 PM · Storage manager, Data Model
olasd triaged T2449: Consider switching timestamp offset storage to strings/byte arrays as Low priority.
Jun 12 2020, 1:00 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 12 2020, 12:04 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3273: replay: Replay origin-visit and origin-visit-status.
Jun 12 2020, 12:02 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3272: Adapt visit_date type from string to datetime.
Jun 12 2020, 11:30 AM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 12 2020, 11:05 AM · Storage manager, Data Model

Jun 11 2020

ardumont added a revision to T2310: Make origin visits immutable: D3267: swh-web: Migrate to new storage.origin_visit_add endpoint.
Jun 11 2020, 7:11 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3265: tests: Migrate to new storage.origin_visit_add endpoint.
Jun 11 2020, 6:57 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 11 2020, 5:46 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3262: storage*: Align origin-visit-add to take iterable of OriginVisit objects as input.
Jun 11 2020, 3:26 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 11 2020, 10:20 AM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3260: tests: Migrate to use origin visit status add storage endpoint.
Jun 11 2020, 10:15 AM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 11 2020, 10:07 AM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3259: tests: Migrate to use origin visit status add storage endpoint.
Jun 11 2020, 10:05 AM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 11 2020, 9:51 AM · Storage manager, Data Model

Jun 10 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Jun 10 2020, 3:09 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3253: loader: Migrate to use origin visit status add storage endpoint.
Jun 10 2020, 12:36 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3251: origin-visit-upsert: Write origin-visit-status to journal.
Jun 10 2020, 10:39 AM · Storage manager, Data Model

Jun 9 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Jun 9 2020, 5:43 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Jun 9 2020, 5:43 PM · Storage manager, Data Model

Jun 8 2020

ardumont added a revision to T2310: Make origin visits immutable: D3244: origin-visit-update: Write visit-status objects to the journal.
Jun 8 2020, 4:28 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3241: Allow journal to deal with origin_visit_status.
Jun 8 2020, 2:28 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3238: origin-visit-add: Write origin-visit-status objects to the journal.
Jun 8 2020, 11:56 AM · Storage manager, Data Model

Jun 6 2020

rdicosmo added a comment to T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

An important issue indeed :-)

Jun 6 2020, 6:08 PM · Data Model

Jun 5 2020

olasd updated subscribers of T2430: lookup ingested tarballs (or similar source code containers) by container checksum.

Thanks for submitting this!

Jun 5 2020, 7:02 PM · Data Model

Jun 3 2020

ardumont added a revision to T2310: Make origin visits immutable: D3212: storage*: Add origin_visit_status_add endpoint.
Jun 3 2020, 12:33 PM · Storage manager, Data Model
olasd added a project to T2430: lookup ingested tarballs (or similar source code containers) by container checksum: Data Model.
Jun 3 2020, 11:50 AM · Data Model

Jun 1 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Jun 1 2020, 2:29 PM · Storage manager, Data Model

May 29 2020

ardumont changed the status of T2310: Make origin visits immutable from Open to Work in Progress.
May 29 2020, 12:40 PM · Storage manager, Data Model

May 28 2020

ardumont updated the task description for T2310: Make origin visits immutable.
May 28 2020, 5:19 PM · Storage manager, Data Model

May 27 2020

ardumont updated the task description for T2310: Make origin visits immutable.
May 27 2020, 4:19 PM · Storage manager, Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3180: pg: Write origin visit updates & status, read from origin_visit_status.
May 27 2020, 2:20 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
May 27 2020, 1:35 PM · Storage manager, Data Model
ardumont added a comment to T2421: Make model objects immutable.

Re-reading (and realizing again) the argument on metadata being phased out. So,
we can drop those unused [1] metadata field from origin-visit/origin-visit-status then.

May 27 2020, 1:12 PM · Data Model
ardumont added a comment to T2421: Make model objects immutable.

I think we should do away with the metadata attributes completely.
I'm not sure why we added them to OriginVisitStatus in the end, instead of
removing them, after we checked that they were always empty in production
data.

May 27 2020, 12:22 PM · Data Model
ardumont added a comment to T2310: Make origin visits immutable.

That's been running since 15:30 or so.

May 27 2020, 11:59 AM · Storage manager, Data Model
olasd added a comment to T2421: Make model objects immutable.

@olasd any reason not to do the step 1. ? Wether or not we "fix" the remaining hashability issue by overwriting the __hash__ method, I see no harm and it makes the model (IMHO) cleaner.

May 27 2020, 10:52 AM · Data Model
olasd added a comment to T2421: Make model objects immutable.

Conceptually, both Snapshot.branches and Directory.entries are mappings, not arrays of objects: snapshots and directories must not contain entries with repeated names (which is something that's currently enforced for snapshots, but not for directories).

What about order preservation ?

May 27 2020, 10:18 AM · Data Model
douardda added a comment to T2421: Make model objects immutable.

@olasd any reason not to do the step 1. ? Wether or not we "fix" the remaining hashability issue by overwriting the __hash__ method, I see no harm and it makes the model (IMHO) cleaner.

May 27 2020, 10:09 AM · Data Model
douardda added a comment to T2421: Make model objects immutable.

Conceptually, both Snapshot.branches and Directory.entries are mappings, not arrays of objects: snapshots and directories must not contain entries with repeated names (which is something that's currently enforced for snapshots, but not for directories).

May 27 2020, 9:38 AM · Data Model

May 26 2020

olasd added a comment to T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object.

In the git "specification" (in the git code really), extra headers are a sequence of arbitrary (key: bytes, value: bytes) tuples, that are serialized in the commit object between the common headers and the commit message.

May 26 2020, 7:18 PM · Data Model
olasd added a comment to T2421: Make model objects immutable.

I think there's a conflation of two issues:

May 26 2020, 7:05 PM · Data Model
olasd renamed T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object from Extract the `extra_git_headers` away from `Revision.metadata` into a top-level immutable object to Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object.
May 26 2020, 6:30 PM · Data Model
ardumont added a project to T2422: Add an `object_type` attribute to model classes: Data Model.
May 26 2020, 5:39 PM · Data Model
ardumont added a project to T2421: Make model objects immutable: Data Model.
May 26 2020, 5:39 PM · Data Model
ardumont added a project to T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object: Data Model.
May 26 2020, 5:39 PM · Data Model

May 25 2020

ardumont updated the task description for T2310: Make origin visits immutable.
May 25 2020, 8:18 PM · Storage manager, Data Model
ardumont added a comment to T2310: Make origin visits immutable.
  • migrate remaining data (data that did not get migrated during the first migration, while loaders continued their work)
May 25 2020, 7:10 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
May 25 2020, 7:09 PM · Storage manager, Data Model

May 15 2020

ardumont closed D3141: Update swhid according to latest specification.
May 15 2020, 12:40 PM · Data Model, SWORD deposit

May 14 2020

swh-public-ci added a comment to D3141: Update swhid according to latest specification.

Build is green

May 14 2020, 2:40 PM · Data Model, SWORD deposit
ardumont updated the diff for D3141: Update swhid according to latest specification.

Rebase on latest master

May 14 2020, 2:37 PM · Data Model, SWORD deposit
swh-public-ci added a comment to D3141: Update swhid according to latest specification.

Build is green

May 14 2020, 12:44 PM · Data Model, SWORD deposit
ardumont updated the diff for D3141: Update swhid according to latest specification.

Rebase on latest master

May 14 2020, 12:40 PM · Data Model, SWORD deposit

May 13 2020

swh-public-ci added a comment to D3141: Update swhid according to latest specification.

Build is green

May 13 2020, 12:27 PM · Data Model, SWORD deposit
ardumont updated the diff for D3141: Update swhid according to latest specification.

Adapt according to review

May 13 2020, 12:24 PM · Data Model, SWORD deposit
anlambert accepted D3141: Update swhid according to latest specification.

Looks good to me.

May 13 2020, 11:57 AM · Data Model, SWORD deposit
ardumont updated the summary of D3141: Update swhid according to latest specification.
May 13 2020, 11:09 AM · Data Model, SWORD deposit
ardumont updated the summary of D3141: Update swhid according to latest specification.
May 13 2020, 11:07 AM · Data Model, SWORD deposit
ardumont updated the summary of D3141: Update swhid according to latest specification.
May 13 2020, 11:05 AM · Data Model, SWORD deposit

May 6 2020

ardumont updated the task description for T2310: Make origin visits immutable.
May 6 2020, 8:39 PM · Storage manager, Data Model
ardumont added a comment to T2310: Make origin visits immutable.
  • Deploy storage (and migrate data, this now can occur while loaders are running)
May 6 2020, 8:39 PM · Storage manager, Data Model

May 1 2020

ardumont added a comment to T2310: Make origin visits immutable.

After further discussion, for testing purposes, we agreed on dumping and
restoring origins and origin-visits from production to staging. And try to
check what the behavior is (2).

May 1 2020, 9:41 AM · Storage manager, Data Model

Apr 30 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Apr 30 2020, 2:48 PM · Storage manager, Data Model
ardumont added a comment to T2310: Make origin visits immutable.

In the mean time, I've started to work on 1. as i'm a bit uneasy about letting
master in a state we cannot really deploy yet. As there is some work nearly
ready to be deployed (new metadata api endpoints) and i don't want to prevent
its deployment.

Apr 30 2020, 1:58 PM · Storage manager, Data Model
anlambert closed T2387: Clarify visits in SWHIDs as Wontfix.

Alright, closing this then.

Apr 30 2020, 12:31 PM · Data Model
rdicosmo added a comment to T2387: Clarify visits in SWHIDs.

I hear your concerns, but the discussion already took place and we have now already 2 published articles out there using visit, plus documentation shared that uses this new terminology.
So, no, we cannot change this now.

Apr 30 2020, 12:30 PM · Data Model
anlambert added a comment to T2387: Clarify visits in SWHIDs.

Plus numerous visits of a same origin can point to the same snapshot so the confusion here.

Apr 30 2020, 12:22 PM · Data Model
anlambert added a comment to T2387: Clarify visits in SWHIDs.

I agree that turning visit key into a snapshot one is less confusing.

Apr 30 2020, 12:16 PM · Data Model
rdicosmo added a comment to T2387: Clarify visits in SWHIDs.

We considered snapshot vs visit when choosing the qualifier name, and we settled with visit, as it conveys the idea that a repository snapshot is taken when a visit is performed (not necessarily through the same origin).

Apr 30 2020, 12:14 PM · Data Model
ardumont added a revision to T2310: Make origin visits immutable: D3101: pg-storage: Adapt implem to write both origin visit updates and status.
Apr 30 2020, 12:03 PM · Storage manager, Data Model
ardumont added a comment to T2310: Make origin visits immutable.

Status on this.

Apr 30 2020, 11:31 AM · Storage manager, Data Model
vlorentz triaged T2387: Clarify visits in SWHIDs as High priority.
Apr 30 2020, 10:56 AM · Data Model

Apr 29 2020

ardumont added a revision to T2310: Make origin visits immutable: D3080: pg-storage: Add new created state for origin-visit-state enum.
Apr 29 2020, 12:10 PM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Apr 29 2020, 11:27 AM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Apr 29 2020, 11:26 AM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Apr 29 2020, 11:24 AM · Storage manager, Data Model
ardumont updated the task description for T2310: Make origin visits immutable.
Apr 29 2020, 10:47 AM · Storage manager, Data Model
zack closed T1003: Register a SWH uri scheme with IANA as Resolved by committing rDMOD0fad88628e51: doc: check-in IANA registration template for the "swh" URI scheme.
Apr 29 2020, 9:34 AM · Data Model

Apr 28 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Apr 28 2020, 6:00 PM · Storage manager, Data Model
zack changed the status of T1003: Register a SWH uri scheme with IANA from Open to Work in Progress.

I've prepared a draft IANA registration template. We have some work in flux on the SWHID spec, so I'll submit the registration request once it's done.

Apr 28 2020, 5:01 PM · Data Model
ardumont closed T2346: Decide on the semantics of origin-visit status(es) as Resolved.

We are going with 1.

Apr 28 2020, 3:22 PM · Storage manager, Data Model
ardumont closed T2346: Decide on the semantics of origin-visit status(es), a subtask of T2310: Make origin visits immutable, as Resolved.
Apr 28 2020, 3:22 PM · Storage manager, Data Model
ardumont renamed T2346: Decide on the semantics of origin-visit status(es) from Decide on the semantics of origin-visit updates to Decide on the semantics of origin-visit status(es).
Apr 28 2020, 3:21 PM · Storage manager, Data Model

Apr 27 2020

zack closed T2379: SWHID: expand spec to allow IRI characters, a subtask of T2262: Deal with IRIs, as Resolved.
Apr 27 2020, 3:33 PM · Storage manager, Data Model
zack closed T2379: SWHID: expand spec to allow IRI characters as Resolved by committing rDMOD3ef4843c8955: SWHID spec: add support for IRI.
Apr 27 2020, 3:33 PM · Storage manager, Data Model

Apr 26 2020

zack added a comment to T2379: SWHID: expand spec to allow IRI characters.

Upon (admittedly quick) review, I don't think that anything more than D3068 is needed to address this.
Double-checking/feedback welcome!

Apr 26 2020, 4:46 PM · Storage manager, Data Model
zack added a revision to T2379: SWHID: expand spec to allow IRI characters: D3068: SWHID spec: add support for IRI.
Apr 26 2020, 4:45 PM · Storage manager, Data Model

Apr 24 2020

anlambert added a comment to T2262: Deal with IRIs.

I wrote that little script to check the number of origin IRIs and URIs in the archive

Apr 24 2020, 7:23 PM · Storage manager, Data Model
zack triaged T2379: SWHID: expand spec to allow IRI characters as Normal priority.
Apr 24 2020, 3:32 PM · Storage manager, Data Model
vlorentz updated the task description for T2262: Deal with IRIs.
Apr 24 2020, 3:18 PM · Storage manager, Data Model
vlorentz renamed T2262: Deal with IRIs from SWHID: deal with IRIs to Deal with IRIs.
Apr 24 2020, 1:29 PM · Storage manager, Data Model
zack renamed T2262: Deal with IRIs from Dealing with IRIs to SWHID: deal with IRIs.
Apr 24 2020, 10:28 AM · Storage manager, Data Model

Apr 21 2020

ardumont updated the task description for T2310: Make origin visits immutable.
Apr 21 2020, 2:22 PM · Storage manager, Data Model