Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 16 2018

olasd added a comment to T830: Remove tables occurrence and occurrence_history.

Oct 16 2018, 12:21 PM · Storage manager, Archive content
olasd closed T830: Remove tables occurrence and occurrence_history as Resolved by committing rDSTO435ebcdbf412: Drop table occurrence_history.
Oct 16 2018, 12:20 PM · Storage manager, Archive content

Oct 15 2018

olasd added a revision to T830: Remove tables occurrence and occurrence_history: D535: Drop table occurrence_history.
Oct 15 2018, 4:22 PM · Storage manager, Archive content
zack claimed T1269: swh-storage: `make distclean` in docs/ deletes committed files.
Oct 15 2018, 11:22 AM · Documentation, Storage manager
seirl triaged T1269: swh-storage: `make distclean` in docs/ deletes committed files as Normal priority.
Oct 15 2018, 10:54 AM · Documentation, Storage manager
seirl created T1269: swh-storage: `make distclean` in docs/ deletes committed files.
Oct 15 2018, 10:53 AM · Documentation, Storage manager

Oct 12 2018

ardumont added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

well, it will even be simpler than that: let's get rid of those "dumps" and fix the db test fixture to build the db on the fly from the sql files found in the swh-<name>/sql directory.

Oct 12 2018, 5:21 PM · Storage manager
douardda added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

well, it will even be simpler than that: let's get rid of those "dumps" and fix the db test fixture to build the db on the fly from the sql files found in the swh-<name>/sql directory.

Oct 12 2018, 2:28 PM · Storage manager
zack added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

yes I'm aware of this logic in the makefile. Not sure yet what's the best to do with this. Move it up in swh-development?

Oct 12 2018, 2:07 PM · Storage manager
douardda added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

Its main use is the sql generation for our multiple modules using db in their internals (storage, indexer, archiver, scheduler, scheduler-updater)...
See the root Makefile of that repository.

Oct 12 2018, 1:50 PM · Storage manager
ardumont added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

Its main use is the sql generation for our multiple modules using db in their internals (storage, indexer, archiver, scheduler, scheduler-updater)...
See the root Makefile of that repository.

Oct 12 2018, 1:28 PM · Storage manager
ardumont added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

For your information, we recently moved out of this repository the loader-svn and loader-tar's testdata.
So yes, this sounds reasonable.

Oct 12 2018, 1:24 PM · Storage manager
douardda renamed T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package] from make swh-storage-testdata a python package to kill swh-storage-testdata [was: make swh-storage-testdata a python package].
Oct 12 2018, 11:09 AM · Storage manager
douardda added a comment to T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package].

in fact, let's kill this blob repo; in there we have:

Oct 12 2018, 11:05 AM · Storage manager
zack added a comment to T757: Memory leak in swh.storage.api.server.

potentially useful related tooling: http://carsonip.me/posts/debugging-memory-usage-python-tracemalloc/

Oct 12 2018, 10:00 AM · Storage manager

Oct 11 2018

zack edited projects for T1265: kill swh-storage-testdata [was: make swh-storage-testdata a python package], added: Storage manager; removed System administration.
Oct 11 2018, 8:26 PM · Storage manager

Oct 10 2018

olasd closed T838: SQL storage: drop the entity tables as Resolved by committing rDSTO65e6b69eddc5: Drop unused entity tables.
Oct 10 2018, 5:00 PM · Storage manager, Archive content
olasd added a revision to T838: SQL storage: drop the entity tables: D509: Drop unused entity tables.
Oct 10 2018, 4:25 PM · Storage manager, Archive content

Oct 4 2018

zack added a comment to T838: SQL storage: drop the entity tables.

agreed, they should be removed (I've updated the task title accordingly)

Oct 4 2018, 12:19 PM · Storage manager, Archive content
zack renamed T838: SQL storage: drop the entity tables from Decide what to do with the entity tables to SQL storage: drop the entity tables.
Oct 4 2018, 12:18 PM · Storage manager, Archive content

Oct 3 2018

olasd added a comment to T830: Remove tables occurrence and occurrence_history.

All the old visits have now been migrated to snapshots.

Oct 3 2018, 3:04 PM · Storage manager, Archive content
olasd claimed T830: Remove tables occurrence and occurrence_history.

There's some work in progress by @zack in branch feature/schema-revamp of rDSTO, I'll pick that up to drive it to completion.

Oct 3 2018, 12:18 PM · Storage manager, Archive content
olasd renamed T830: Remove tables occurrence and occurrence_history from Remove table occurrence to Remove tables occurrence and occurrence_history.
Oct 3 2018, 12:05 PM · Storage manager, Archive content

Oct 2 2018

zack added a comment to T1194: storage API server: nicer landing page.

This is now done.

Oct 2 2018, 10:55 AM · Storage manager
zack closed T1194: storage API server: nicer landing page as Resolved by committing rDSTOf5dd8ac755b5: nicer landing page, pointing to homepage and API doc.
Oct 2 2018, 10:55 AM · Storage manager

Sep 28 2018

zack renamed T818: indexer DB should not use bytea for mimetype and encoding columns from Indexer mimetype: Use text instead of bytea for mimetype and encoding columns? to indexer DB should not use bytea for mimetype and encoding columns.
Sep 28 2018, 10:34 AM · Storage manager, Indexer

Sep 18 2018

zack added a comment to T1117: Origin search is *slow* when you look for very common words.

Thanks for your analysis.

Sep 18 2018, 1:19 PM · Web app, Storage manager
olasd claimed T1117: Origin search is *slow* when you look for very common words.

The main issue with the current query is that the id sort happens after we had to fetch all the indexed results, even if we're only presenting a few of these results to the user. When searching gitbla, the bitmap index scan will find all entries containing the trigram git (so, almost everything), then recheck them for exact match, then finally sort the results.

Sep 18 2018, 10:56 AM · Web app, Storage manager

Sep 17 2018

zack closed T918: document DB encoding requirements as Resolved.

db-init has been updated a while ago to force UTF8 encoding, and it is now the documented way to initialize the DB, so there is no need to further specify DB encoding requirements elsewhere

Sep 17 2018, 10:46 AM · Documentation, Storage manager
zack added a comment to T1194: storage API server: nicer landing page.
In T1194#22202, @olasd wrote:

However I'd be happy for the static default landing pages to tell people that, and to point to the (static) API documentation of the respective components.

Sep 17 2018, 10:15 AM · Storage manager

Sep 14 2018

olasd added a comment to T1194: storage API server: nicer landing page.

The RPC client/server components have been made as trivial as possible and considering their presence in the critical path of everything we do, I think we should keep them that way. They're not meant to conform to REST or any other standards, and are not supposed to support browsing.

Sep 14 2018, 4:51 PM · Storage manager
zack triaged T1194: storage API server: nicer landing page as Low priority.
Sep 14 2018, 4:26 PM · Storage manager

Sep 5 2018

ardumont added a comment to T1117: Origin search is *slow* when you look for very common words.

transforming the db.py function call into a call to a query server side.

Sep 5 2018, 11:28 AM · Web app, Storage manager
ardumont added a comment to T1117: Origin search is *slow* when you look for very common words.

How about just *not* sort by origin ID then?

Sep 5 2018, 11:10 AM · Web app, Storage manager
ardumont added a comment to T1117: Origin search is *slow* when you look for very common words.

How about just *not* sort by origin ID then?

Sep 5 2018, 10:21 AM · Web app, Storage manager
zack added a project to T1117: Origin search is *slow* when you look for very common words: Web app.
Sep 5 2018, 9:10 AM · Web app, Storage manager
zack updated subscribers of T1117: Origin search is *slow* when you look for very common words.

(pinging this issue, because it's 2018, and it really looks bad that we're apparently not capable of quickly returning results in our main search :-))

Sep 5 2018, 9:09 AM · Web app, Storage manager

Jun 26 2018

olasd triaged T1117: Origin search is *slow* when you look for very common words as High priority.
Jun 26 2018, 11:30 AM · Web app, Storage manager

Jun 19 2018

zack raised the priority of T757: Memory leak in swh.storage.api.server from Normal to High.
Jun 19 2018, 2:34 PM · Storage manager

Mar 16 2018

olasd added a subtask for T829: Remove duplication between fetch_history and origin_visit: T994: origin_visit: distinguish "fetch date" and "injection date".
Mar 16 2018, 4:56 PM · Storage manager, Archive content

Mar 2 2018

rdicosmo updated the task description for T980: Identify and fix releases that are stored as revisions.
Mar 2 2018, 5:19 PM · Archive content
rdicosmo created T980: Identify and fix releases that are stored as revisions.
Mar 2 2018, 5:17 PM · Archive content

Feb 28 2018

olasd closed T760: swh api clients often fail with a BadStatusLine exception as Resolved.

So, that was a beast of a puppet refactoring, but the end result is: all our RPC servers are now nicely tucked behind an instance of nginx.

Feb 28 2018, 7:06 PM · Vault, Object storage, Storage manager

Feb 19 2018

olasd closed T962: Counter updates are long queries that bog down the database as Resolved by committing rDSTO6b80cff86bec: Add a new table for "bucketed" object counts.
Feb 19 2018, 7:30 PM · Storage manager

Feb 16 2018

seirl added a comment to T760: swh api clients often fail with a BadStatusLine exception.

Here is a pcap of the issue observed while a Vault cooker was calling set_progress on the Vault backend. The BadStatusLine packet that only contains "\r\n" is packet 306.

Feb 16 2018, 6:26 PM · Vault, Object storage, Storage manager
olasd added a project to T760: swh api clients often fail with a BadStatusLine exception: Vault.

The BadStatusLine error reared back its ugly head on the vault backend. It was reproducible. Putting a nginx in front of the vault backend fixed it. I guess it's time to puppetize the configuration of a http frontend to all our RPC servers.

Feb 16 2018, 6:17 PM · Vault, Object storage, Storage manager

Feb 13 2018

olasd created T962: Counter updates are long queries that bog down the database.
Feb 13 2018, 7:16 PM · Storage manager

Jan 10 2018

olasd added a comment to T918: document DB encoding requirements.

To use the full features of jsonb, we indeed need the database encoding to be UTF8.

Jan 10 2018, 12:05 PM · Documentation, Storage manager

Jan 7 2018

zack created T918: document DB encoding requirements.
Jan 7 2018, 8:43 AM · Documentation, Storage manager

Dec 15 2017

olasd added a comment to T904: Factor-out common DB wrappers in swh.core.

You can add swh.indexer.storage.db.Db to your list.

Dec 15 2017, 6:11 PM · Core & foundations, Storage manager, Scheduling utilities, Vault
ardumont updated the task description for T904: Factor-out common DB wrappers in swh.core.
Dec 15 2017, 5:40 PM · Core & foundations, Storage manager, Scheduling utilities, Vault
seirl created T904: Factor-out common DB wrappers in swh.core.
Dec 15 2017, 5:28 PM · Core & foundations, Storage manager, Scheduling utilities, Vault
olasd closed T567: adapt SQL storage for repository snapshot objects as Resolved by committing rDSTOf4ea97cb1c1e: Add snapshot models.
Dec 15 2017, 3:37 PM · Storage manager

Dec 14 2017

ardumont edited P203 Clean up wrong googlecode origins (googlecode instead of eclipselabs or apache-extras) - T897 related.
Dec 14 2017, 3:10 PM · Origin-GoogleCode, Storage manager
ardumont edited P203 Clean up wrong googlecode origins (googlecode instead of eclipselabs or apache-extras) - T897 related.
Dec 14 2017, 3:09 PM · Origin-GoogleCode, Storage manager
ardumont created P203 Clean up wrong googlecode origins (googlecode instead of eclipselabs or apache-extras) - T897 related.
Dec 14 2017, 2:55 PM · Origin-GoogleCode, Storage manager
ardumont edited P202 Clean up wrong origins (url is filled with the size instead of the actual origin url) - T896 related.
Dec 14 2017, 2:50 PM · Origin-GoogleCode, Storage manager

Dec 13 2017

ardumont added a project to P202 Clean up wrong origins (url is filled with the size instead of the actual origin url) - T896 related: Origin-GoogleCode.
Dec 13 2017, 6:57 PM · Origin-GoogleCode, Storage manager
ardumont created P202 Clean up wrong origins (url is filled with the size instead of the actual origin url) - T896 related.
Dec 13 2017, 6:55 PM · Origin-GoogleCode, Storage manager

Dec 11 2017

ardumont updated the task description for T867: Separate indexers' database model to its own database - meta task.
Dec 11 2017, 3:41 PM · Storage manager, Indexer

Dec 7 2017

ardumont added a project to T494: swh-journal: archiver-client: Keep archiver table in sync with new contents: Journal.
Dec 7 2017, 10:14 PM · Journal, Restricted Project, Storage manager
ardumont renamed T494: swh-journal: archiver-client: Keep archiver table in sync with new contents from Keep archiver table content_archive in sync with new contents to Keep archiver table in sync with new contents.
Dec 7 2017, 10:13 PM · Journal, Restricted Project, Storage manager
ardumont renamed T494: swh-journal: archiver-client: Keep archiver table in sync with new contents from Add a way to update content_archive table when a new content is added to Keep archiver table content_archive in sync with new contents.
Dec 7 2017, 10:13 PM · Journal, Restricted Project, Storage manager
ardumont closed T867: Separate indexers' database model to its own database - meta task as Resolved.
Dec 7 2017, 9:53 PM · Storage manager, Indexer
ardumont closed T873: Clean up indexer's data reference in softwareheritage's main db, a subtask of T867: Separate indexers' database model to its own database - meta task, as Resolved.
Dec 7 2017, 9:52 PM · Storage manager, Indexer
ardumont closed T873: Clean up indexer's data reference in softwareheritage's main db as Resolved.
Dec 7 2017, 9:52 PM · Storage manager, Indexer
ardumont added a comment to T873: Clean up indexer's data reference in softwareheritage's main db.

I'll do this tomorrow when i'm fresh.

Dec 7 2017, 9:39 PM · Storage manager, Indexer
ardumont added a comment to T873: Clean up indexer's data reference in softwareheritage's main db.

I'll do this tomorrow when i'm fresh.

Dec 7 2017, 6:00 PM · Storage manager, Indexer
ardumont created T873: Clean up indexer's data reference in softwareheritage's main db.
Dec 7 2017, 5:40 PM · Storage manager, Indexer
ardumont closed T871: Migrate swh-storage api functions relative to indexers to swh-indexer as Resolved.
Dec 7 2017, 5:28 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont closed T871: Migrate swh-storage api functions relative to indexers to swh-indexer, a subtask of T867: Separate indexers' database model to its own database - meta task, as Resolved.
Dec 7 2017, 5:28 PM · Storage manager, Indexer
ardumont closed T872: Deploy and restart indexers as Resolved.
Dec 7 2017, 5:28 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont closed T872: Deploy and restart indexers, a subtask of T871: Migrate swh-storage api functions relative to indexers to swh-indexer, as Resolved.
Dec 7 2017, 5:28 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 5:24 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 5:10 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 5:10 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont changed the status of T872: Deploy and restart indexers from Open to Work in Progress.
Dec 7 2017, 4:41 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont changed the status of T872: Deploy and restart indexers, a subtask of T871: Migrate swh-storage api functions relative to indexers to swh-indexer, from Open to Work in Progress.
Dec 7 2017, 4:41 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont edited parent tasks for T872: Deploy and restart indexers, added: T871: Migrate swh-storage api functions relative to indexers to swh-indexer; removed: T867: Separate indexers' database model to its own database - meta task.
Dec 7 2017, 4:40 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont removed a subtask for T867: Separate indexers' database model to its own database - meta task: T872: Deploy and restart indexers.
Dec 7 2017, 4:40 PM · Storage manager, Indexer
ardumont added a subtask for T871: Migrate swh-storage api functions relative to indexers to swh-indexer: T872: Deploy and restart indexers.
Dec 7 2017, 4:40 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 4:40 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T871: Migrate swh-storage api functions relative to indexers to swh-indexer.
Dec 7 2017, 4:39 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont added projects to T872: Deploy and restart indexers: Core Loader, SWORD deposit.
Dec 7 2017, 3:39 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 3:07 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 1:57 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 12:29 PM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T871: Migrate swh-storage api functions relative to indexers to swh-indexer.
Dec 7 2017, 12:28 PM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont updated the task description for T871: Migrate swh-storage api functions relative to indexers to swh-indexer.
Dec 7 2017, 11:26 AM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont updated the task description for T871: Migrate swh-storage api functions relative to indexers to swh-indexer.
Dec 7 2017, 10:38 AM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont updated the task description for T871: Migrate swh-storage api functions relative to indexers to swh-indexer.
Dec 7 2017, 10:38 AM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont added projects to T871: Migrate swh-storage api functions relative to indexers to swh-indexer: Core Loader, SWORD deposit.
Dec 7 2017, 10:35 AM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 9:55 AM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont updated the task description for T872: Deploy and restart indexers.
Dec 7 2017, 9:53 AM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont added projects to T872: Deploy and restart indexers: Web app, Storage manager.
Dec 7 2017, 9:53 AM · SWORD deposit, Core Loader, Storage manager, Web app, Puppet recipes, Indexer
ardumont renamed T867: Separate indexers' database model to its own database - meta task from Separate indexers' database model to its own database to Separate indexers' database model to its own database - meta task.
Dec 7 2017, 9:43 AM · Storage manager, Indexer
ardumont created T871: Migrate swh-storage api functions relative to indexers to swh-indexer.
Dec 7 2017, 9:42 AM · SWORD deposit, Core Loader, Web app, Development environment, Storage manager, Indexer

Dec 6 2017

ardumont updated the task description for T867: Separate indexers' database model to its own database - meta task.
Dec 6 2017, 12:49 PM · Storage manager, Indexer

Dec 2 2017

ardumont added a comment to T494: swh-journal: archiver-client: Keep archiver table in sync with new contents.

Well, strike POC, i had something that worked at that time.

Dec 2 2017, 1:01 PM · Journal, Restricted Project, Storage manager
ardumont created T867: Separate indexers' database model to its own database - meta task.
Dec 2 2017, 12:40 PM · Storage manager, Indexer