Page MenuHomeSoftware Heritage
Feed Advanced Search

Jul 19 2016

ardumont claimed T412: Bootstrap archiver's database.
Jul 19 2016, 7:42 PM · Storage manager
ardumont renamed T412: Bootstrap archiver's database from Bootstrap database's archival tables to Bootstrap archiver's database.
Jul 19 2016, 4:00 PM · Storage manager
qcampos added a comment to T494: swh-journal: archiver-client: Keep archiver table in sync with new contents.

Yes, I think we need to split the archival task to separate the first run that ensure we have a copy of each content from the full archiver we will have more time to improve.

Jul 19 2016, 3:13 PM · Journal, Restricted Project, Storage manager
ardumont added a comment to T494: swh-journal: archiver-client: Keep archiver table in sync with new contents.

I'm not yet 100% sure that content_add is the place where we want to update the archive table. Another possibility, for instance, would be relying on the upcoming persistent log (T424) and some watcher for it that will update the archiver table.

Jul 19 2016, 2:26 PM · Journal, Restricted Project, Storage manager
ardumont updated the task description for T494: swh-journal: archiver-client: Keep archiver table in sync with new contents.
Jul 19 2016, 2:14 PM · Journal, Restricted Project, Storage manager
ardumont added a comment to T412: Bootstrap archiver's database.

According to documentation, to defer a constraint, first said constraint must be deferrable (which it is not the default).

Jul 19 2016, 2:07 PM · Storage manager
ardumont added a comment to T412: Bootstrap archiver's database.

and the first part is done ^^

0:41:23 [15.8MiB/s]
Jul 19 2016, 1:35 PM · Storage manager
ardumont added a comment to T412: Bootstrap archiver's database.

Also in regards to db, softwareheritage-archiver has been created with the following schema.

Jul 19 2016, 12:30 PM · Storage manager

Jul 18 2016

ardumont added a comment to T412: Bootstrap archiver's database.

(forgot to add the Related keyword in commits... and already pushed so better luck next time...)

Jul 18 2016, 8:33 PM · Storage manager
zack added a comment to T494: swh-journal: archiver-client: Keep archiver table in sync with new contents.

I'm not yet 100% sure that content_add is the place where we want to update the archive table. Another possibility, for instance, would be relying on the upcoming persistent log (T424) and some watcher for it that will update the archiver table.

Jul 18 2016, 5:18 PM · Journal, Restricted Project, Storage manager
qcampos created T494: swh-journal: archiver-client: Keep archiver table in sync with new contents.
Jul 18 2016, 4:11 PM · Journal, Restricted Project, Storage manager

Jul 17 2016

ardumont added a comment to T412: Bootstrap archiver's database.

For info, i ran another failed attempt yesterday (Saturday the 16th of July 2016).
This stopped before finishing.

Jul 17 2016, 2:16 PM · Storage manager

Jul 16 2016

ardumont changed the status of T412: Bootstrap archiver's database, a subtask of T240: content archiver, from Open to Work in Progress.
Jul 16 2016, 10:21 AM · Storage manager
ardumont changed the status of T412: Bootstrap archiver's database from Open to Work in Progress.
Jul 16 2016, 10:21 AM · Storage manager
ardumont changed the status of T412: Bootstrap archiver's database, a subtask of T482: First swh-storage-archiver run to catch up uffizi, from Open to Work in Progress.
Jul 16 2016, 10:21 AM · Storage manager
ardumont added a comment to T412: Bootstrap archiver's database.

Moving softwareheritage-log from the ssd to hdd (T487), we reclaimed 1.1T of data on the ssd (which were the initial blocking point).
So now, we can try to inject back the archiver's bootstrap data to finally... run it ^^

Jul 16 2016, 10:20 AM · Storage manager
ardumont closed T487: Moving the log DB from the SSD-based DB to the spinning-drive one as Resolved.
Jul 16 2016, 10:09 AM · Storage manager
ardumont closed T487: Moving the log DB from the SSD-based DB to the spinning-drive one, a subtask of T240: content archiver, as Resolved.
Jul 16 2016, 10:09 AM · Storage manager
ardumont closed T487: Moving the log DB from the SSD-based DB to the spinning-drive one, a subtask of T486: DB storage cleanup, as Resolved.
Jul 16 2016, 10:09 AM · Storage manager

Jul 14 2016

ardumont changed the status of T489: Reduce current log configuration of various services from Open to Work in Progress.
Jul 14 2016, 12:33 PM · Storage manager
ardumont changed the status of T489: Reduce current log configuration of various services, a subtask of T486: DB storage cleanup, from Open to Work in Progress.
Jul 14 2016, 12:33 PM · Storage manager
ardumont added a comment to T489: Reduce current log configuration of various services.

It's partially done (worker01, worker02, worker03, worker08).
Some workers are stopped so those one need to be booted first,
I think they were stopped for the ram issue we had on louvre the 4th of July (to restart prado).

Jul 14 2016, 12:32 PM · Storage manager
ardumont added a comment to T489: Reduce current log configuration of various services.

Looks like this part is not yet puppetized

Jul 14 2016, 12:26 PM · Storage manager
ardumont added a comment to T489: Reduce current log configuration of various services.

For me, this sums up to change the line ExecStart in /etc/systemd/system/swh-worker.service in our multiple workers.

Jul 14 2016, 12:07 PM · Storage manager

Jul 13 2016

jbertran added a revision to T486: DB storage cleanup: D76: Remove dead code related to file upload (T468).
Jul 13 2016, 4:17 PM · Storage manager
ardumont updated the task description for T486: DB storage cleanup.
Jul 13 2016, 2:02 PM · Storage manager
ardumont added a comment to T486: DB storage cleanup.

But if you start looking into this (which I recommend you do), it'd be nice to fix all of it, while the context is clear.

Jul 13 2016, 11:39 AM · Storage manager
ardumont closed T488: Removing old verbose log entries (level > info), a subtask of T486: DB storage cleanup, as Resolved.
Jul 13 2016, 11:37 AM · Storage manager
ardumont closed T488: Removing old verbose log entries (level > info) as Resolved.
Jul 13 2016, 11:37 AM · Storage manager
ardumont changed the status of T487: Moving the log DB from the SSD-based DB to the spinning-drive one from Open to Work in Progress.
Jul 13 2016, 11:36 AM · Storage manager
ardumont changed the status of T487: Moving the log DB from the SSD-based DB to the spinning-drive one, a subtask of T240: content archiver, from Open to Work in Progress.
Jul 13 2016, 11:36 AM · Storage manager
ardumont changed the status of T487: Moving the log DB from the SSD-based DB to the spinning-drive one, a subtask of T486: DB storage cleanup, from Open to Work in Progress.
Jul 13 2016, 11:36 AM · Storage manager
ardumont updated the task description for T486: DB storage cleanup.
Jul 13 2016, 11:23 AM · Storage manager
ardumont removed a parent task for T486: DB storage cleanup: T240: content archiver.
Jul 13 2016, 10:55 AM · Storage manager
ardumont added a parent task for T487: Moving the log DB from the SSD-based DB to the spinning-drive one: T240: content archiver.
Jul 13 2016, 10:55 AM · Storage manager
ardumont edited subtasks for T240: content archiver, added: T487: Moving the log DB from the SSD-based DB to the spinning-drive one; removed: T486: DB storage cleanup.
Jul 13 2016, 10:55 AM · Storage manager
ardumont renamed T486: DB storage cleanup from db storage cleanup to DB storage cleanup.
Jul 13 2016, 10:55 AM · Storage manager
ardumont created T489: Reduce current log configuration of various services.
Jul 13 2016, 10:54 AM · Storage manager
ardumont created T488: Removing old verbose log entries (level > info).
Jul 13 2016, 10:53 AM · Storage manager
ardumont created T487: Moving the log DB from the SSD-based DB to the spinning-drive one.
Jul 13 2016, 10:52 AM · Storage manager
zack updated subscribers of T486: DB storage cleanup.

Thanks for opening this task, I agree this is the way to go. Once done we can easily deploy the archiver as it is supposed to work in the long-term, reducing the need of further migrations down the road.

Jul 13 2016, 9:01 AM · Storage manager

Jul 12 2016

ardumont updated subscribers of T412: Bootstrap archiver's database.
As we said on irc, the foreign key from content_archive.id to content.sha1 makes the creation of an archiver single db quite uneasy.
Jul 12 2016, 7:00 PM · Storage manager
ardumont created T486: DB storage cleanup.
Jul 12 2016, 6:42 PM · Storage manager
ardumont renamed T485: Synchronously catchup backup from uffizi to banco from Synchroneously catchup backup from uffizi to banco to Synchronously catchup backup from uffizi to banco.
Jul 12 2016, 6:30 PM · Object storage, Storage manager
ardumont added a comment to T485: Synchronously catchup backup from uffizi to banco.

Currently running in uffizi in a tmux session

Jul 12 2016, 1:07 PM · Object storage, Storage manager
ardumont created T485: Synchronously catchup backup from uffizi to banco.
Jul 12 2016, 1:06 PM · Object storage, Storage manager

Jul 11 2016

ardumont added a subtask for T412: Bootstrap archiver's database: Unknown Object (Maniphest Task).
Jul 11 2016, 2:27 PM · Storage manager
ardumont added a comment to T412: Bootstrap archiver's database.

'status' is of type archive_status which is already an enum. I guess that Postgres do the right thing with integers.

Jul 11 2016, 12:18 PM · Storage manager
qcampos added a comment to T412: Bootstrap archiver's database.
In T412#7853, @ardumont wrote:
  • status is also TEXT and could be replaced with an enum or something
Jul 11 2016, 12:04 PM · Storage manager
ardumont updated subscribers of T412: Bootstrap archiver's database.

This is a failure for now.
After multiple attempts in the week-end, there is not enough space on disk for the process to finish.

Jul 11 2016, 9:40 AM · Storage manager

Jul 9 2016

ardumont added a comment to T412: Bootstrap archiver's database.

Running on banco:

Jul 9 2016, 10:05 AM · Storage manager
ardumont closed T484: List banco's current sha1s for injection in archiver db as Invalid.
Jul 9 2016, 9:59 AM · Storage manager
ardumont closed T484: List banco's current sha1s for injection in archiver db, a subtask of T412: Bootstrap archiver's database, as Invalid.
Jul 9 2016, 9:59 AM · Storage manager
ardumont added a comment to T484: List banco's current sha1s for injection in archiver db.

Irrelevant since the file is in banco
/srv/storage/space/lists/content-id-by-ctime.after-T7.txt.gz

Jul 9 2016, 9:58 AM · Storage manager

Jul 8 2016

ardumont created T484: List banco's current sha1s for injection in archiver db.
Jul 8 2016, 4:11 PM · Storage manager
ardumont closed T481: Deploying archiver on uffizi, a subtask of T240: content archiver, as Resolved.
Jul 8 2016, 4:08 PM · Storage manager
ardumont closed T481: Deploying archiver on uffizi, a subtask of T482: First swh-storage-archiver run to catch up uffizi, as Resolved.
Jul 8 2016, 4:08 PM · Storage manager
ardumont closed T481: Deploying archiver on uffizi as Resolved.
Jul 8 2016, 4:08 PM · Storage manager
ardumont closed T405: Deploying objstorage API server as Resolved.
Jul 8 2016, 4:07 PM · System administration, Language-Puppet, Storage manager
ardumont closed T405: Deploying objstorage API server, a subtask of T240: content archiver, as Resolved.
Jul 8 2016, 4:07 PM · Storage manager

Jul 7 2016

ardumont added subtasks for T482: First swh-storage-archiver run to catch up uffizi: T481: Deploying archiver on uffizi, T412: Bootstrap archiver's database.
Jul 7 2016, 4:20 PM · Storage manager
ardumont added a parent task for T412: Bootstrap archiver's database: T482: First swh-storage-archiver run to catch up uffizi.
Jul 7 2016, 4:20 PM · Storage manager
ardumont added a parent task for T481: Deploying archiver on uffizi: T482: First swh-storage-archiver run to catch up uffizi.
Jul 7 2016, 4:20 PM · Storage manager
ardumont updated the task description for T482: First swh-storage-archiver run to catch up uffizi.
Jul 7 2016, 4:19 PM · Storage manager
ardumont created T482: First swh-storage-archiver run to catch up uffizi.
Jul 7 2016, 4:19 PM · Storage manager
ardumont added a subtask for T240: content archiver: T481: Deploying archiver on uffizi.
Jul 7 2016, 12:31 PM · Storage manager
ardumont added a parent task for T481: Deploying archiver on uffizi: T240: content archiver.
Jul 7 2016, 12:31 PM · Storage manager
ardumont removed a subtask for T481: Deploying archiver on uffizi: T240: content archiver.
Jul 7 2016, 12:31 PM · Storage manager
ardumont removed a parent task for T240: content archiver: T481: Deploying archiver on uffizi.
Jul 7 2016, 12:31 PM · Storage manager
ardumont added a parent task for T240: content archiver: T481: Deploying archiver on uffizi.
Jul 7 2016, 12:29 PM · Storage manager
ardumont added a subtask for T481: Deploying archiver on uffizi: T240: content archiver.
Jul 7 2016, 12:29 PM · Storage manager
ardumont raised the priority of T405: Deploying objstorage API server from Normal to High.
Jul 7 2016, 12:28 PM · System administration, Language-Puppet, Storage manager
ardumont created T481: Deploying archiver on uffizi.
Jul 7 2016, 12:28 PM · Storage manager

Jun 27 2016

qcampos closed T460: Extract the objstorage component from storage module to an independent module as Resolved by committing rDSTO92e6cd845775: Update debian packaging configuration files.
Jun 27 2016, 2:57 PM · Development environment, Object storage, Storage manager
qcampos added a revision to T460: Extract the objstorage component from storage module to an independent module: D67: Recreate DOBJS commit history.
Jun 27 2016, 1:37 PM · Development environment, Object storage, Storage manager

Jun 23 2016

qcampos renamed T460: Extract the objstorage component from storage module to an independent module from Extract the objstorage component from storage module to an independant module to Extract the objstorage component from storage module to an independent module.
Jun 23 2016, 4:31 PM · Development environment, Object storage, Storage manager
qcampos added a revision to T460: Extract the objstorage component from storage module to an independent module: D64: Extract the object storage to its own package and do the required modifications on the imports.
Jun 23 2016, 2:34 PM · Development environment, Object storage, Storage manager
qcampos added a revision to T460: Extract the objstorage component from storage module to an independent module: D63: Extract objstorage from swh.storage module and recreate its commit history..
Jun 23 2016, 12:44 PM · Development environment, Object storage, Storage manager

Jun 22 2016

qcampos added projects to T460: Extract the objstorage component from storage module to an independent module: Storage manager, Object storage, Development environment.
Jun 22 2016, 4:08 PM · Development environment, Object storage, Storage manager

Jun 21 2016

qcampos closed T459: Add an instantiation example to the MultiplexerObjStorage main docstring as Resolved by committing rDSTOa623ce836f2d: objstorage.multiplexer: add some use case example to the….
Jun 21 2016, 4:06 PM · Documentation, Storage manager
qcampos added a revision to T459: Add an instantiation example to the MultiplexerObjStorage main docstring: D57: objstorage.multiplexer: add some use case example to the MultiplexerObjStorage main docstring.
Jun 21 2016, 3:49 PM · Documentation, Storage manager
qcampos added a revision to T445: Create an object storage that demultiplex inputs to multiple object storages: D56: Add the new created packages into objstorage to debian packaging lists.
Jun 21 2016, 3:05 PM · Storage manager
qcampos created T459: Add an instantiation example to the MultiplexerObjStorage main docstring.
Jun 21 2016, 2:41 PM · Documentation, Storage manager
qcampos closed T445: Create an object storage that demultiplex inputs to multiple object storages as Resolved.

Closed by D53.

Jun 21 2016, 2:38 PM · Storage manager
ardumont closed T457: Add tests for revision_log_by, a subtask of T432: Expose revision_log_by in client and server, as Resolved.
Jun 21 2016, 11:25 AM · Storage manager
ardumont closed T457: Add tests for revision_log_by as Resolved by committing rDSTOcbaed54c97d9: Open (and fix) swh.storage.storage.revision_log_by.
Jun 21 2016, 11:25 AM · Storage manager
ardumont closed T432: Expose revision_log_by in client and server as Resolved by committing rDSTOcbaed54c97d9: Open (and fix) swh.storage.storage.revision_log_by.
Jun 21 2016, 11:25 AM · Storage manager

Jun 20 2016

ardumont added a comment to T432: Expose revision_log_by in client and server.

Well, this implementation seems to be some dead code since it's:

  • neither tested
  • nor correct (it's calling inexistant code db.revision_log_by does not exist)
Jun 20 2016, 11:30 PM · Storage manager
olasd created T457: Add tests for revision_log_by.
Jun 20 2016, 5:24 PM · Storage manager
qcampos added a revision to T432: Expose revision_log_by in client and server: D54: api: Storage method "revision_log_by" is now accessible via the http API.
Jun 20 2016, 3:46 PM · Storage manager
qcampos claimed T432: Expose revision_log_by in client and server.
Jun 20 2016, 3:37 PM · Storage manager
zack updated the task description for T454: object storage backend that can read from/write to S3.
Jun 20 2016, 10:03 AM · Storage manager
zack created T454: object storage backend that can read from/write to S3.
Jun 20 2016, 10:03 AM · Storage manager

Jun 16 2016

zack triaged T445: Create an object storage that demultiplex inputs to multiple object storages as Normal priority.
Jun 16 2016, 9:03 PM · Storage manager
qcampos added a revision to T445: Create an object storage that demultiplex inputs to multiple object storages: D53: Implement the object storage multiplexer.
Jun 16 2016, 4:49 PM · Storage manager
qcampos closed T433: Refactor objstorage module as Resolved by committing rDSTO68abde3f243d: archiver: change objstorage initialization into the archiver.
Jun 16 2016, 3:11 PM · Storage manager

Jun 15 2016

qcampos created T445: Create an object storage that demultiplex inputs to multiple object storages.
Jun 15 2016, 3:58 PM · Storage manager
qcampos added a project to T433: Refactor objstorage module: Storage manager.
Jun 15 2016, 1:46 PM · Storage manager

Jun 13 2016

olasd closed T436: Content archiver director tests failure as Invalid.

Gah. ._. I had some local changes in swh-scheduler and the tasks wouldn't be registered in celery. Sorry for the hassle...

Jun 13 2016, 4:36 PM · Storage manager
qcampos added a revision to T401: Content archiver - Asynchronous version: D46: Correct an error in the ArchiveWorker initialization.
Jun 13 2016, 4:00 PM · Restricted Project, Storage manager