Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 8 2019

vlorentz accepted D2084: swh.storage.filter: Add filtering storage implementation.

I don't think this is useful anymore. The postgresql storage does the filtering itself now (see D2034). Are you planning on removing this feature from the postgresql storage?

It always did that.

Oct 8 2019, 3:05 PM · Storage manager
swh-public-ci added a comment to D2085: swh.storage.buffer: Add buffering proxy storage implementation.

Build has FAILED

Oct 8 2019, 3:03 PM · Storage manager
ardumont added a comment to D2085: swh.storage.buffer: Add buffering proxy storage implementation.

Also, I understand how that's currently useful, but FYI, batching content has absolutely no effect for Cassandra (the Cassandra backend breaks batches into individual records)

Oct 8 2019, 3:03 PM · Storage manager
vlorentz added a comment to D2085: swh.storage.buffer: Add buffering proxy storage implementation.

Also, I understand how that's currently useful, but FYI, batching content has absolutely no effect for Cassandra (the Cassandra backend breaks batches into individual records)

Oct 8 2019, 3:01 PM · Storage manager
ardumont added a comment to D2084: swh.storage.filter: Add filtering storage implementation.

BUILD has failed

Oct 8 2019, 3:01 PM · Storage manager
vlorentz requested changes to D2085: swh.storage.buffer: Add buffering proxy storage implementation.

s/Sequence/Iterable/ because we don't do random access (see definition of sequences here: https://docs.python.org/3/glossary.html#term-sequence )

Oct 8 2019, 3:00 PM · Storage manager
swh-public-ci added a comment to D2084: swh.storage.filter: Add filtering storage implementation.

Build has FAILED

Oct 8 2019, 2:58 PM · Storage manager
ardumont added a comment to D2084: swh.storage.filter: Add filtering storage implementation.

I don't think this is useful anymore. The postgresql storage does the filtering itself now (see D2034). Are you planning on removing this feature from the postgresql storage?

Oct 8 2019, 2:57 PM · Storage manager
vlorentz added a comment to D2084: swh.storage.filter: Add filtering storage implementation.

I don't think this is useful anymore. The postgresql storage does the filtering itself now (see D2034). Are you planning on removing this feature from the postgresql storage?

Oct 8 2019, 2:55 PM · Storage manager
swh-public-ci added a comment to D2085: swh.storage.buffer: Add buffering proxy storage implementation.

Build has FAILED

Oct 8 2019, 2:47 PM · Storage manager
Harbormaster failed remote builds in B8173: Diff 7003 for D2085: swh.storage.buffer: Add buffering proxy storage implementation!
Oct 8 2019, 2:47 PM · Storage manager
ardumont created D2085: swh.storage.buffer: Add buffering proxy storage implementation.
Oct 8 2019, 2:46 PM · Storage manager
swh-public-ci added a comment to D2084: swh.storage.filter: Add filtering storage implementation.

Build has FAILED

Oct 8 2019, 2:44 PM · Storage manager
swh-public-ci added a comment to D2084: swh.storage.filter: Add filtering storage implementation.

Build has FAILED

Oct 8 2019, 2:40 PM · Storage manager
Harbormaster failed remote builds in B8172: Diff 7002 for D2084: swh.storage.filter: Add filtering storage implementation!
Oct 8 2019, 2:40 PM · Storage manager
ardumont created D2084: swh.storage.filter: Add filtering storage implementation.
Oct 8 2019, 2:40 PM · Storage manager
vlorentz updated the task description for T1892: Cassandra as a storage backend.
Oct 8 2019, 11:42 AM · meta-task, Storage manager
vlorentz updated the task description for T1892: Cassandra as a storage backend.
Oct 8 2019, 11:32 AM · meta-task, Storage manager

Oct 1 2019

ardumont added a comment to T2019: race condition during concurrent loading of the same objects from multiple origins.

tagged and deployed (loaders are mostly restarted or in progress)

Oct 1 2019, 1:05 PM · Storage manager
zack renamed T2019: race condition during concurrent loading of the same objects from multiple origins from Investigate hash collision error to race condition during concurrent loading of the same objects from multiple origins.
Oct 1 2019, 10:58 AM · Storage manager

Sep 30 2019

ardumont added a comment to T2019: race condition during concurrent loading of the same objects from multiple origins.

Thanks!

Sep 30 2019, 4:17 PM · Storage manager
olasd added a comment to T2019: race condition during concurrent loading of the same objects from multiple origins.

This is a race condition that happens when two different workers are loading the exact same content in parallel transactions.

Sep 30 2019, 2:14 PM · Storage manager
olasd shifted T2019: race condition during concurrent loading of the same objects from multiple origins from the Restricted Space space to the S1 Public space.
Sep 30 2019, 1:32 PM · Storage manager
olasd closed T2018: origin reference in skipped_content is still an id as Resolved.

Should have been closed by rDSTOe2393243e07f ?

Sep 30 2019, 1:25 PM · Storage manager
ardumont triaged T2019: race condition during concurrent loading of the same objects from multiple origins as High priority.
Sep 30 2019, 10:49 AM · Storage manager
vlorentz added a project to T2018: origin reference in skipped_content is still an id: Storage manager.
Sep 30 2019, 10:39 AM · Storage manager

Sep 3 2019

vlorentz claimed T1910: Redesign origin search using a dedicated component (swh-search).
Sep 3 2019, 1:23 PM · Archive search, Storage manager

Aug 21 2019

vlorentz added a parent task for T1910: Redesign origin search using a dedicated component (swh-search): T1523: Search tools on metadata.
Aug 21 2019, 2:34 PM · Archive search, Storage manager

Aug 20 2019

vlorentz added a subtask for T1617: Experiment with generation numbers to improve revisions walk performance: T1957: Handling missing DAG nodes.
Aug 20 2019, 9:59 AM · Storage manager

Aug 5 2019

vlorentz updated the task description for T1892: Cassandra as a storage backend.
Aug 5 2019, 3:22 PM · meta-task, Storage manager
vlorentz updated the task description for T1892: Cassandra as a storage backend.
Aug 5 2019, 3:21 PM · meta-task, Storage manager

Jul 29 2019

anlambert closed T1934: vault timeout on cooking revision_gitfast for repositories with numerous number of revisions as Resolved by committing rDVAU9f673806699a: cookers.revision_*: Fix storage timeouts when fetching a large revision log.
Jul 29 2019, 11:17 AM · Storage manager, Vault

Jul 26 2019

anlambert added a comment to T1934: vault timeout on cooking revision_gitfast for repositories with numerous number of revisions.

The issue came from the fact that the vault tries to retrieve the whole revisions log in a single call to the storage API.

Jul 26 2019, 3:40 PM · Storage manager, Vault
ardumont added projects to T1934: vault timeout on cooking revision_gitfast for repositories with numerous number of revisions: Vault, Storage manager.
Jul 26 2019, 3:39 PM · Storage manager, Vault

Jul 22 2019

twitu closed T1633: skipped_content_missing is not implemented by the in-memory storage as Resolved.
Jul 22 2019, 5:12 PM · Easy hack, Storage manager

Jul 21 2019

zack triaged T1926: FUSE filesystem to navigate the archive as Wishlist priority.
Jul 21 2019, 2:05 PM · Software Heritage filesystem

Jul 12 2019

vlorentz removed a subtask for T1892: Cassandra as a storage backend: T1891: Make 'type' an attribute of origin visits, not origins.
Jul 12 2019, 3:22 PM · meta-task, Storage manager
vlorentz added a subtask for T1892: Cassandra as a storage backend: T1891: Make 'type' an attribute of origin visits, not origins.
Jul 12 2019, 3:21 PM · meta-task, Storage manager

Jul 11 2019

vlorentz updated the task description for T1912: Support origin pagination without origin ids.
Jul 11 2019, 4:48 PM · Web app, Storage manager
vlorentz updated the task description for T1912: Support origin pagination without origin ids.
Jul 11 2019, 4:48 PM · Web app, Storage manager
vlorentz added a parent task for T1912: Support origin pagination without origin ids: T1805: Public API v2.
Jul 11 2019, 4:46 PM · Web app, Storage manager
vlorentz triaged T1912: Support origin pagination without origin ids as Normal priority.
Jul 11 2019, 4:46 PM · Web app, Storage manager
vlorentz updated the task description for T1892: Cassandra as a storage backend.
Jul 11 2019, 4:00 PM · meta-task, Storage manager
vlorentz closed T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types, a subtask of T1892: Cassandra as a storage backend, as Resolved.
Jul 11 2019, 4:00 PM · meta-task, Storage manager
vlorentz closed T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types as Resolved.

Indeed, this task just inherited the tag from its parent task.

Jul 11 2019, 4:00 PM · Storage manager

Jul 10 2019

vlorentz added a project to T1731: Intrinsic identifiers for origins: Storage manager.
Jul 10 2019, 5:01 PM · Storage manager, Data Model
vlorentz reopened T1731: Intrinsic identifiers for origins, a subtask of T1892: Cassandra as a storage backend, as Open.
Jul 10 2019, 4:59 PM · meta-task, Storage manager
vlorentz closed T1731: Intrinsic identifiers for origins, a subtask of T1892: Cassandra as a storage backend, as Resolved.
Jul 10 2019, 4:59 PM · meta-task, Storage manager
vlorentz triaged T1910: Redesign origin search using a dedicated component (swh-search) as Normal priority.
Jul 10 2019, 4:56 PM · Archive search, Storage manager
vlorentz added a parent task for T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types: T1892: Cassandra as a storage backend.
Jul 10 2019, 3:07 PM · Storage manager
vlorentz added subtasks for T1892: Cassandra as a storage backend: T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types, T1731: Intrinsic identifiers for origins.
Jul 10 2019, 3:07 PM · meta-task, Storage manager

Jul 8 2019

vlorentz updated the task description for T1892: Cassandra as a storage backend.
Jul 8 2019, 3:29 PM · meta-task, Storage manager
vlorentz updated the task description for T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types.
Jul 8 2019, 1:51 PM · Storage manager
vlorentz updated the task description for T1892: Cassandra as a storage backend.
Jul 8 2019, 1:28 PM · meta-task, Storage manager
vlorentz updated the task description for T1892: Cassandra as a storage backend.
Jul 8 2019, 1:27 PM · meta-task, Storage manager
vlorentz renamed T1892: Cassandra as a storage backend from Cassandra storage backend (meta-task) to Cassandra as a storage backend (meta-task).
Jul 8 2019, 1:27 PM · meta-task, Storage manager
vlorentz triaged T1892: Cassandra as a storage backend as Normal priority.
Jul 8 2019, 1:25 PM · meta-task, Storage manager

Jul 5 2019

zack added a project to T1816: Make all clients of swh-storage use origin URL as identifier and use visit types instead of origin types: Storage manager.

(not strictly speaking a big *in* swh-storage, but close enough…)

Jul 5 2019, 3:13 PM · Storage manager

Jun 25 2019

twitu closed T1527: Have comments on all columns of all databases as Resolved.
Jun 25 2019, 6:25 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer

Jun 24 2019

twitu updated the task description for T1527: Have comments on all columns of all databases.
Jun 24 2019, 6:28 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer

Jun 20 2019

twitu added a comment to T1527: Have comments on all columns of all databases.

D1582 has been pushed the task can be closed

Jun 20 2019, 10:26 AM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer

Jun 19 2019

olasd closed T1823: make DB/FS transactions nest properly as Resolved by committing rDOBJS67197802d5aa: pathslicing: Make sure data is flushed to disk before renaming the tempfile.
Jun 19 2019, 4:11 PM · Object storage, Storage manager
olasd added a revision to T1823: make DB/FS transactions nest properly: D1611: pathslicing: Make sure data is flushed to disk before renaming the tempfile.
Jun 19 2019, 1:46 PM · Object storage, Storage manager

Jun 18 2019

vlorentz placed T1343: Add type annotations to the in-mem storage up for grabs.
Jun 18 2019, 1:28 PM · Storage manager
zack triaged T1823: make DB/FS transactions nest properly as High priority.
Jun 18 2019, 12:38 PM · Object storage, Storage manager

Jun 17 2019

ardumont closed D1589: storage-db: Fixing migration scripts 135-136.
Jun 17 2019, 10:12 AM · Data Model, Storage manager
vlorentz accepted D1589: storage-db: Fixing migration scripts 135-136.
Jun 17 2019, 10:05 AM · Data Model, Storage manager

Jun 15 2019

twitu added a comment to T1527: Have comments on all columns of all databases.

All columns commented in swh-scheduler, waiting review.
Some columns for swh-storage required a small discussion to frame appropriate comments.

Jun 15 2019, 5:22 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
swh-public-ci added a comment to D1589: storage-db: Fixing migration scripts 135-136.

Build is green
See https://jenkins.softwareheritage.org/job/DSTO/job/tox/493/ for more details.

Jun 15 2019, 8:46 AM · Data Model, Storage manager
ardumont added inline comments to D1589: storage-db: Fixing migration scripts 135-136.
Jun 15 2019, 8:45 AM · Data Model, Storage manager
ardumont added projects to D1589: storage-db: Fixing migration scripts 135-136: Storage manager, Data Model.
Jun 15 2019, 8:42 AM · Data Model, Storage manager

Jun 14 2019

ardumont updated the task description for T1527: Have comments on all columns of all databases.
Jun 14 2019, 5:40 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
twitu added a comment to T1527: Have comments on all columns of all databases.

All columns are already commented in swh-indexer

Jun 14 2019, 5:18 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
twitu added a comment to T1527: Have comments on all columns of all databases.

Have added a few comments in D1582

Jun 14 2019, 8:30 AM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer

Jun 13 2019

ardumont added a comment to T1527: Have comments on all columns of all databases.

The latest upgrade is 136.sql while the version in 30-swh-schema.sql is 133. Should I name the next upgrade 137?

Jun 13 2019, 6:59 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
twitu added a comment to T1527: Have comments on all columns of all databases.

there seems to be an inconsistency between sql/upgrades and latest sql version in swh-storage. The latest upgrade is 136.sql while the version in 30-swh-schema.sql is 133. Should I name the next upgrade 137?

Jun 13 2019, 6:54 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
ardumont added a project to T1527: Have comments on all columns of all databases: Easy hack.
Jun 13 2019, 12:33 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
ardumont updated the task description for T1527: Have comments on all columns of all databases.
Jun 13 2019, 12:31 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
twitu added a comment to T1527: Have comments on all columns of all databases.

is there anything left to be done to close the task?

Jun 13 2019, 12:09 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer

Jun 12 2019

twitu added a comment to T1527: Have comments on all columns of all databases.

modules swh-scheduler, swh-indexer, swh-storage, all seem to have column comments written in 30-swh-schema.sql

Jun 12 2019, 7:44 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
twitu added a comment to T1527: Have comments on all columns of all databases.

Can you provide a few more details so I can work on this? Maybe which packages will be affected and what is expected in the comments.

Jun 12 2019, 6:21 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer

May 25 2019

zack added a project to T1527: Have comments on all columns of all databases: Documentation.
May 25 2019, 5:30 PM · Easy hack, Documentation, Storage manager, Scheduling utilities, Indexer
zack renamed T1377: in-memory storage: compute all counters from in-memory storage: compute all counters. to in-memory storage: compute all counters.
May 25 2019, 5:10 PM · Easy hack, Storage manager
zack closed T41: add actual content to swh-storage-testdata as Invalid.

swh-storage-testdata is gone, closing

May 25 2019, 5:08 PM · Storage manager
zack closed T523: Figure out what to do with corrupted copies detected by the archiver, a subtask of T240: content archiver, as Invalid.
May 25 2019, 4:58 PM · Storage manager
zack closed T523: Figure out what to do with corrupted copies detected by the archiver as Invalid.

the archiver is gone, closing

May 25 2019, 4:58 PM · Storage manager

May 24 2019

faux added a comment to T1446: Add support for slices in Storage.content_get.

Still open?

May 24 2019, 1:28 PM · Storage manager

May 21 2019

faux closed T1349: Storage.content_find should return all matches, not just one. as Resolved.
May 21 2019, 9:53 AM · Easy hack, Storage manager

May 6 2019

anlambert closed T1677: Revisions walker: Add notification for truncated/shallow histories as Resolved by committing rDSTO2e7051659d3b: algos.revisions_walker: Add methods to query produced history state.
May 6 2019, 2:13 PM · Storage manager

Apr 23 2019

vlorentz triaged T1684: Race condition on person insertion in pgsql storage as Low priority.
Apr 23 2019, 9:38 AM · Storage manager

Apr 19 2019

olasd added a comment to T1675: Fix message and error for artifact history view bug.

The fix has been deployed to moma.

Apr 19 2019, 9:05 AM · Storage manager, Web app

Apr 18 2019

anlambert closed T1675: Fix message and error for artifact history view bug as Resolved by committing rDSTO6510b5ec4d83: algos.revisions_walker: Handle truncated/shallow histories.
Apr 18 2019, 5:55 PM · Storage manager, Web app
anlambert triaged T1677: Revisions walker: Add notification for truncated/shallow histories as Normal priority.
Apr 18 2019, 5:24 PM · Storage manager
anlambert added a project to T1675: Fix message and error for artifact history view bug: Storage manager.
Apr 18 2019, 4:17 PM · Storage manager, Web app

Apr 16 2019

faux added a revision to T1349: Storage.content_find should return all matches, not just one.: D1420: Made changes to adapt it to new content_find return type.
Apr 16 2019, 2:27 PM · Easy hack, Storage manager

Apr 12 2019

ardumont retitled D1345: swh.journal: Add backfiller implementation from swh.journal: Bootstrap backfiller to swh.journal: Add backfiller implementation.
Apr 12 2019, 3:50 PM · Storage manager, Journal
ardumont closed D1345: swh.journal: Add backfiller implementation.
Apr 12 2019, 2:21 PM · Storage manager, Journal
swh-public-ci added a comment to D1345: swh.journal: Add backfiller implementation.

Build is green
See https://jenkins.softwareheritage.org/job/DJNL/job/tox/145/ for more details.

Apr 12 2019, 2:14 PM · Storage manager, Journal
ardumont updated the diff for D1345: swh.journal: Add backfiller implementation.

Rebase to latest master and branch diff to master

Apr 12 2019, 2:10 PM · Storage manager, Journal
olasd accepted D1345: swh.journal: Add backfiller implementation.
Apr 12 2019, 2:08 PM · Storage manager, Journal