Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 18 2021

dachary changed the status of T3104: Persistent readonly perfect hash table, a subtask of T3054: Scale out object storage design, from Open to Work in Progress.
Oct 18 2021, 9:02 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
moranegg added a subtask for T3128: Improve deposit integration, management and display: T3174: Filter deposit-admin view by deposit client on admin (moderation) page.
Oct 18 2021, 2:53 PM · meta-task, Roadmap 2021, Monitoring, SWORD deposit, Web app
moranegg added a comment to T2624: Create strategy for documentation with a map or a full table of content.

https://docs.softwareheritage.org/devel/contributing/tutorial-docs-contribution.html#doc-contribution

Oct 18 2021, 12:16 PM · Roadmap 2021, meta-task, Documentation
moranegg moved T2624: Create strategy for documentation with a map or a full table of content from Pending validation to Done on the Roadmap 2021 board.
Oct 18 2021, 12:16 PM · Roadmap 2021, meta-task, Documentation

Oct 15 2021

vlorentz updated the task description for T3594: Faithfully store weird git objects.
Oct 15 2021, 2:38 PM · meta-task, Data Model, Storage manager
ardumont closed T1524: save code now: also add new origins for unknown repos, a subtask of T3082: Improve Save Code Now handling, as Resolved.
Oct 15 2021, 12:29 PM · Save Code Now, meta-task, Roadmap 2021, Web app

Oct 14 2021

vlorentz updated the task description for T3609: SWHIDv2: List issues with SWHIDv1 that should be fixed.
Oct 14 2021, 12:15 PM · Roadmap 2020, Data Model, Web app, Roadmap 2021
vlorentz removed a subtask for T3134: SWHID v2: T1957: Handling missing DAG nodes.
Oct 14 2021, 12:14 PM · Roadmap 2022, Roadmap 2020, Data Model, Web app, meta-task, Roadmap 2021
vlorentz added a subtask for T3134: SWHID v2: T1957: Handling missing DAG nodes.
Oct 14 2021, 12:13 PM · Roadmap 2022, Roadmap 2020, Data Model, Web app, meta-task, Roadmap 2021

Oct 11 2021

vlorentz updated the task description for T3595: Support disordered directory entries in git.
Oct 11 2021, 2:49 PM · meta-task, Data Model, Storage manager

Oct 8 2021

ardumont closed T3629: doc: Add a "how to save a forge" as in how it's currently done, a subtask of T1538: Add "forge" now, as Resolved.
Oct 8 2021, 5:55 PM · Add Forge Now , Roadmap 2022, meta-task, Roadmap 2021

Oct 6 2021

ardumont added a subtask for T1538: Add "forge" now: T3629: doc: Add a "how to save a forge" as in how it's currently done.
Oct 6 2021, 6:24 PM · Add Forge Now , Roadmap 2022, meta-task, Roadmap 2021

Sep 24 2021

vlorentz added a parent task for T3594: Faithfully store weird git objects: T3552: Fix corrupted releases, revisions, and directories in the storage.
Sep 24 2021, 3:13 PM · meta-task, Data Model, Storage manager

Sep 23 2021

vlorentz updated the task description for T3609: SWHIDv2: List issues with SWHIDv1 that should be fixed.
Sep 23 2021, 5:01 PM · Roadmap 2020, Data Model, Web app, Roadmap 2021
vlorentz triaged T3609: SWHIDv2: List issues with SWHIDv1 that should be fixed as Normal priority.
Sep 23 2021, 5:00 PM · Roadmap 2020, Data Model, Web app, Roadmap 2021
vlorentz added a subtask for T3604: Document the architecture of all major packages/components: T3607: Document consistency guarantees of the loaders with respect to the storage.
Sep 23 2021, 3:00 PM · meta-task, Documentation
vlorentz added a subtask for T3604: Document the architecture of all major packages/components: T2807: document swh.graph.graph module.
Sep 23 2021, 2:52 PM · meta-task, Documentation
vlorentz renamed T3604: Document the architecture of all major packages/components from Document the architecture of all major packages to Document the architecture of all major packages/components.
Sep 23 2021, 2:51 PM · meta-task, Documentation
vlorentz added a subtask for T3604: Document the architecture of all major packages/components: T3569: Document the RPC architecture.
Sep 23 2021, 2:51 PM · meta-task, Documentation
vlorentz added a subtask for T3604: Document the architecture of all major packages/components: T3333: Document the different storage backends.
Sep 23 2021, 2:51 PM · meta-task, Documentation
vlorentz triaged T3604: Document the architecture of all major packages/components as Normal priority.
Sep 23 2021, 2:50 PM · meta-task, Documentation

Sep 22 2021

vlorentz added a comment to T3596: Support "weird" permissions in directories.

Complete proposal for the above solution:

Sep 22 2021, 2:56 PM · meta-task, Data Model, Storage manager
vlorentz added a comment to T3595: Support disordered directory entries in git.

Complete proposal to implement the above solution:

Sep 22 2021, 2:51 PM · meta-task, Data Model, Storage manager
vlorentz closed T3582: cassandra: Use 'git ordering' for directory entries, a subtask of T3585: Fix inconsistencies of the Cassandra backend with postgres, as Wontfix.
Sep 22 2021, 1:44 PM · meta-task, Storage manager
vlorentz updated the task description for T3594: Faithfully store weird git objects.
Sep 22 2021, 1:42 PM · meta-task, Data Model, Storage manager
vlorentz added a comment to T3596: Support "weird" permissions in directories.

Possible solution: store them as an ascii string instead of an integer.

Sep 22 2021, 1:38 PM · meta-task, Data Model, Storage manager
vlorentz added a comment to T3595: Support disordered directory entries in git.

Possible solution: store a rank along with each directory entry, but ignore it unless we are reconstructing a git object or computing a SWHID (v1?)

Sep 22 2021, 1:37 PM · meta-task, Data Model, Storage manager
vlorentz triaged T3596: Support "weird" permissions in directories as Normal priority.
Sep 22 2021, 1:36 PM · meta-task, Data Model, Storage manager
vlorentz updated the task description for T3595: Support disordered directory entries in git.
Sep 22 2021, 1:34 PM · meta-task, Data Model, Storage manager
vlorentz triaged T3595: Support disordered directory entries in git as Normal priority.
Sep 22 2021, 1:34 PM · meta-task, Data Model, Storage manager
vlorentz triaged T3594: Faithfully store weird git objects as Normal priority.
Sep 22 2021, 1:31 PM · meta-task, Data Model, Storage manager
zack added a comment to T1805: Public API v2.

it's true these do not come "for free" but I still have the impression there is an "Open API way" of handling these and we should stick to them.

Sep 22 2021, 1:03 PM · meta-task, Web app
douardda added a comment to T1805: Public API v2.

Items 5, 6, 7 aka pagination, auth and batches - I believe these come naturally with item 4 (specification wise)

They don't. OpenAPI is a specification to describe APIs, and it contains absolutely nothing about pagination or batches.

Sep 22 2021, 11:36 AM · meta-task, Web app

Sep 20 2021

douardda closed T1510: Have a look at openAPI and decide whether we want to follow these specs, a subtask of T1805: Public API v2, as Resolved.
Sep 20 2021, 11:54 AM · meta-task, Web app
douardda closed T2196: Batch APIs, a subtask of T2194: Archive Integration (Web API), as Wontfix.
Sep 20 2021, 11:54 AM · Roadmap 2021, meta-task
vlorentz added a revision to T3135: Improve integrity of ingested content: D6281: converters: Recompute hashes and check they match the originals.
Sep 20 2021, 11:05 AM · Storage manager, Roadmap 2021, meta-task

Sep 17 2021

vlorentz placed T3586: Figure out what to do with 'misordered' directories in Cassandra up for grabs.
Sep 17 2021, 11:37 AM · Data Model, Storage manager
vlorentz triaged T3586: Figure out what to do with 'misordered' directories in Cassandra as Normal priority.
Sep 17 2021, 11:37 AM · Data Model, Storage manager
vlorentz added a subtask for T3585: Fix inconsistencies of the Cassandra backend with postgres: T3582: cassandra: Use 'git ordering' for directory entries.
Sep 17 2021, 11:35 AM · meta-task, Storage manager
vlorentz triaged T3585: Fix inconsistencies of the Cassandra backend with postgres as Normal priority.
Sep 17 2021, 11:35 AM · meta-task, Storage manager

Sep 10 2021

vlorentz changed the status of T3504: Make the git-bare cooker publicly available, a subtask of T3096: Efficient and reliable download via the Vault, from Open to Work in Progress.
Sep 10 2021, 11:33 AM · meta-task, Roadmap 2021, Vault
vlorentz moved T3096: Efficient and reliable download via the Vault from Backlog to In progress on the Vault board.
Sep 10 2021, 11:33 AM · meta-task, Roadmap 2021, Vault

Sep 9 2021

moranegg added a subtask for T3481: Coordinate SWH Stories project: T3342: Collect material for software stories prototype.
Sep 9 2021, 12:20 PM · Acquisition Process (SWHAP), meta-task, Software Stories

Sep 6 2021

vlorentz added a project to T3558: Enable the swh-search QL in production: Archive search.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz triaged T3559: Enable the swh-search QL in staging as Normal priority.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz added a project to T3558: Enable the swh-search QL in production: System administration.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata
vlorentz triaged T3558: Enable the swh-search QL in production as Normal priority.
Sep 6 2021, 10:36 AM · Archive search, System administration, Intrinsic metadata, Extrinsic metadata

Sep 3 2021

vlorentz triaged T3551: Fix git-fsck errors in the git-bare cooker as Normal priority.
Sep 3 2021, 6:22 PM · Vault

Aug 30 2021

dachary updated the task description for T3054: Scale out object storage design.
Aug 30 2021, 12:53 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Aug 29 2021

dachary changed the status of T3104: Persistent readonly perfect hash table, a subtask of T3054: Scale out object storage design, from Work in Progress to Open.
Aug 29 2021, 1:08 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary changed the status of T3249: Deleting and erasing an object, a subtask of T3054: Scale out object storage design, from Work in Progress to Open.
Aug 29 2021, 1:05 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Aug 26 2021

vlorentz closed T843: Vault: Add a "git bare" tarball cooker, a subtask of T3096: Efficient and reliable download via the Vault, as Resolved.
Aug 26 2021, 3:06 PM · meta-task, Roadmap 2021, Vault

Aug 23 2021

dachary closed T3422: Running the benchmarks: August 6th, 2021, 9 days, a subtask of T3054: Scale out object storage design, as Resolved.
Aug 23 2021, 12:26 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Aug 19 2021

zack updated the task description for T3490: Collect metadata from ClearlyDefined.
Aug 19 2021, 10:13 AM · Extrinsic metadata
vlorentz removed a subtask for T2202: Collect extrinsic metadata: T2513: Copy metadata on revisions to the extrinsic metadata storage.
Aug 19 2021, 9:20 AM · Roadmap 2022, meta-task, Roadmap 2021, Extrinsic metadata
vlorentz updated the task description for T3490: Collect metadata from ClearlyDefined.
Aug 19 2021, 9:18 AM · Extrinsic metadata
vlorentz placed T3490: Collect metadata from ClearlyDefined up for grabs.
Aug 19 2021, 9:18 AM · Extrinsic metadata
vlorentz triaged T3490: Collect metadata from ClearlyDefined as Normal priority.
Aug 19 2021, 9:16 AM · Extrinsic metadata

Aug 11 2021

moranegg added a project to T3481: Coordinate SWH Stories project: Acquisition Process (SWHAP).
Aug 11 2021, 10:32 PM · Acquisition Process (SWHAP), meta-task, Software Stories
moranegg moved T3481: Coordinate SWH Stories project from Backlog to In progress on the Software Stories board.
Aug 11 2021, 10:24 PM · Acquisition Process (SWHAP), meta-task, Software Stories
moranegg triaged T3481: Coordinate SWH Stories project as High priority.
Aug 11 2021, 10:23 PM · Acquisition Process (SWHAP), meta-task, Software Stories

Aug 10 2021

douardda updated the task description for T3085: Complete and updated copy of the archive on S3 (objects+graph).
Aug 10 2021, 4:00 PM · Roadmap 2022, meta-task, Roadmap 2021, System administration, Object storage
douardda updated the task description for T3085: Complete and updated copy of the archive on S3 (objects+graph).
Aug 10 2021, 3:56 PM · Roadmap 2022, meta-task, Roadmap 2021, System administration, Object storage

Aug 3 2021

ardumont added a comment to T3127: Compute and display distribution of origins by forge.

The computation of those metrics will be executed in production on a regular basis, probably each day, to keep them up to date.

Aug 3 2021, 5:00 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
ardumont added a revision to T3127: Compute and display distribution of origins by forge: D6052: Install update-metrics as a service called daily.
Aug 3 2021, 2:32 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Aug 2 2021

dachary added a comment to T3054: Scale out object storage design.

Improve the readability of the graphs

Aug 2 2021, 11:46 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Jul 30 2021

vlorentz added a revision to T3135: Improve integrity of ingested content: D6045: converters: Preserve GPG signatures on releases.
Jul 30 2021, 10:59 AM · Storage manager, Roadmap 2021, meta-task
vlorentz added a project to T3112: Provenance index for the full archive: Provenance database.
Jul 30 2021, 10:15 AM · Roadmap 2022, Provenance database, Roadmap 2021, meta-task

Jul 29 2021

ardumont changed the status of T3402: Deploy swh-counters v0.8.0 and backfill origins, a subtask of T3127: Compute and display distribution of origins by forge, from Wontfix to Resolved.
Jul 29 2021, 1:24 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 23 2021

anlambert added a comment to T3127: Compute and display distribution of origins by forge.
In T3127#67581, @anlambert wrote:

    I am a bit puzzled by the numbers shown: eeally we have only 200k origins for GitLab.com.?

Indeed there is something weird here as we have more than one million gitlab.com origins in database.

softwareheritage=> select count(*) from origin where url like 'https://gitlab.com/%';
  count  
---------
 1023499
(1 row)

Looks like something was missed when computing lister metrics from scheduler database, this needs further investigations.

Indeed, please do look into this, thanks.

Jul 23 2021, 12:17 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 22 2021

anlambert added a comment to T3127: Compute and display distribution of origins by forge.

Thanks for these details: this count is missing the 800k git origins: @ardumont and @olasd should be able to tell you how to find them

Jul 22 2021, 12:29 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
rdicosmo added a comment to T3127: Compute and display distribution of origins by forge.

I am a bit puzzled by the numbers shown: eeally we have only 200k origins for GitLab.com.?

Indeed there is something weird here as we have more than one million gitlab.com origins in database.

softwareheritage=> select count(*) from origin where url like 'https://gitlab.com/%';
  count  
---------
 1023499
(1 row)

Looks like something was missed when computing lister metrics from scheduler database, this needs further investigations.

Jul 22 2021, 9:01 AM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 21 2021

anlambert added a comment to T3127: Compute and display distribution of origins by forge.

I am a bit puzzled by the numbers shown: eeally we have only 200k origins for GitLab.com.?

Jul 21 2021, 5:26 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
rdicosmo added a comment to T3127: Compute and display distribution of origins by forge.

I am a bit puzzled by the numbers shown: eeally we have only 200k origins for GitLab.com.?
And we know we had some 1.5m origins for Google code, why only 700k shown here?

Jul 21 2021, 3:40 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
anlambert added a comment to T3127: Compute and display distribution of origins by forge.

Instead, we could split the coverage widget into two tabs

  • one giving a high level overview of the archived origins, similar to what we have now with logos and counters
  • one giving the details of all forges we archived so far, displayed in a table as you suggested with relevant metrics and links to search origins for a given forge
Jul 21 2021, 3:23 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 19 2021

anlambert added a revision to T3127: Compute and display distribution of origins by forge: D6007: common/utils: Wrap deposits list retrieval in a function.
Jul 19 2021, 5:29 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
anlambert added a comment to T3127: Compute and display distribution of origins by forge.

I think we could also get an accurate count of deposit origins (HAL, IPOL) using swh-deposit API

Jul 19 2021, 3:54 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
dachary closed T3421: Running the benchmarks: July 16th, 2 days, a subtask of T3054: Scale out object storage design, as Resolved.
Jul 19 2021, 7:19 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Jul 16 2021

anlambert added a comment to T3127: Compute and display distribution of origins by forge.

Only one nit about the display. Using modal windows/popover will mean that there will be no easy way to have, as a user, the full list: one will have to click on each logo one by one, which could be quite annoying. Would it be possible to have a page with a rendering of the table above? (not sure if we want all columns, but at least the last update time and the number of origins per forge instance looks relevant and interesting to me). It coule be either in addition of what you propose (e.g., as a "coverage details" link, leading to the full page), or as a replacement of it (e.g., by making each forge icon just a link to the relevant anchor within the table on the "coverage details" page).

Jul 16 2021, 11:43 AM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
zack added a comment to T3127: Compute and display distribution of origins by forge.

Thanks for this update, great work!

Jul 16 2021, 11:29 AM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 15 2021

vlorentz closed T2938: Create API endpoint to access raw_extrinsic_metadata, a subtask of T3097: Expose metadata in the WebApp and make it searchable, as Resolved.
Jul 15 2021, 12:18 PM · Intrinsic metadata, Extrinsic metadata, Roadmap 2021, meta-task

Jul 13 2021

anlambert added a comment to T3127: Compute and display distribution of origins by forge.

Some reports of what have been done so far and some future directions regarding the display of those data in swh-web.

Jul 13 2021, 3:39 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 12 2021

dachary updated the task description for T3054: Scale out object storage design.
Jul 12 2021, 3:41 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Jul 12 2021, 3:41 PM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Jul 9 2021

olasd changed the status of T3403: Use forge URL network location as default lister instance name, a subtask of T3127: Compute and display distribution of origins by forge, from Open to Work in Progress.
Jul 9 2021, 3:37 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
anlambert closed T3402: Deploy swh-counters v0.8.0 and backfill origins, a subtask of T3127: Compute and display distribution of origins by forge, as Wontfix.
Jul 9 2021, 2:34 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task

Jul 6 2021

dachary closed T3186: Ceph Sepia lab for performance testing, a subtask of T3054: Scale out object storage design, as Wontfix.
Jul 6 2021, 8:26 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3068: Using Sorted String Tables as a file format, a subtask of T3054: Scale out object storage design, as Wontfix.
Jul 6 2021, 8:24 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3066: Using RocksDB SST as a file format, a subtask of T3054: Scale out object storage design, as Wontfix.
Jul 6 2021, 8:23 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Jul 6 2021, 8:22 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Jul 6 2021, 8:21 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a subtask for T3054: Scale out object storage design: T3422: Running the benchmarks: August 6th, 2021, 9 days.
Jul 6 2021, 8:18 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Jul 6 2021, 8:13 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Jul 6 2021, 8:12 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary added a subtask for T3054: Scale out object storage design: T3421: Running the benchmarks: July 16th, 2 days.
Jul 6 2021, 7:59 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary updated the task description for T3054: Scale out object storage design.
Jul 6 2021, 7:44 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3149: Benchmark software for the object storage, a subtask of T3054: Scale out object storage design, as Resolved.
Jul 6 2021, 7:42 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task
dachary closed T3048: Using a custom Sorted String Table format, a subtask of T3054: Scale out object storage design, as Wontfix.
Jul 6 2021, 7:37 AM · Roadmap 2022, Object storage (RedHat collaboration), Roadmap 2021, meta-task

Jul 1 2021

anlambert added a revision to T1805: Public API v2: D4629: [POC] OpenAPI and Django REST Framework to specify / implement API v2.
Jul 1 2021, 1:45 PM · meta-task, Web app

Jun 29 2021

moranegg added a subtask for T3128: Improve deposit integration, management and display: T2858: Use keycloak authentication for the deposit.
Jun 29 2021, 11:12 AM · meta-task, Roadmap 2021, Monitoring, SWORD deposit, Web app