Page MenuHomeSoftware Heritage
Feed Advanced Search

Jan 21 2016

zack placed T274: investigate slow uffizi read performances from the object store up for grabs.
Jan 21 2016, 10:24 PM
zack added a comment to T274: investigate slow uffizi read performances from the object store.

Here are some bonnie++ tests on both uffizi and banco. They seem consistent with the fact that reads on uffizi from the object storage are much slower (factor 3x) when compared with banco. But further investigation is needed.

Jan 21 2016, 10:21 PM
zack created T274: investigate slow uffizi read performances from the object store.
Jan 21 2016, 10:18 PM
zack changed the status of T7: backup: object storage — 2nd copy after first large batch import from Work in Progress to Open.

This is back on hold now, as we discovered that the read performances on uffizi from the object store are not as good as they should.

Jan 21 2016, 10:16 PM
olasd added a comment to T272: Fix double-encoded release.name entries.

Currentlly running

Jan 21 2016, 8:51 PM · Data Model, Storage manager
olasd added a comment to T75: Check integrity of directories, revisions, and releases.

I just noticed that empty messages with empty lines are stored as an empty bytea, whereas empty messages without the empty line are stored as NULL. So there's that.

Jan 21 2016, 8:40 PM · Archive content, Restricted Project
olasd added a comment to T75: Check integrity of directories, revisions, and releases.

Some example releases:

Jan 21 2016, 8:22 PM · Archive content, Restricted Project
olasd changed the status of T75: Check integrity of directories, revisions, and releases from Open to Work in Progress.

I have done some investigations on this in light of T272. Bottom line: not good: git is very proficient in the corner cases department.

Jan 21 2016, 7:41 PM · Archive content, Restricted Project
zack added a comment to T273: Darcs loader.

As a starting point, I've briefly discussed with Darcs developers how the push/pull protocol works in Darcs. Unfortunately the protocol doesn't seem to be documented anywhere. The relevant entry points in the code are:

Jan 21 2016, 5:17 PM · General
zack created T273: Darcs loader.
Jan 21 2016, 5:03 PM · General
olasd created T272: Fix double-encoded release.name entries.
Jan 21 2016, 4:38 PM · Data Model, Storage manager
ardumont added a subtask for T271: Update clients on impacts + upgrade respective package dependencies: T270: TEXT → BYTEA conversion for columns that represent VCS tags/branches.
Jan 21 2016, 4:33 PM · Git loader, Directory loader, Data Model, Web app
ardumont added a parent task for T270: TEXT → BYTEA conversion for columns that represent VCS tags/branches: T271: Update clients on impacts + upgrade respective package dependencies.
Jan 21 2016, 4:33 PM · Language-SQL, Storage manager
ardumont removed a subtask for T270: TEXT → BYTEA conversion for columns that represent VCS tags/branches: T271: Update clients on impacts + upgrade respective package dependencies.
Jan 21 2016, 4:32 PM · Language-SQL, Storage manager
ardumont removed a parent task for T271: Update clients on impacts + upgrade respective package dependencies: T270: TEXT → BYTEA conversion for columns that represent VCS tags/branches.
Jan 21 2016, 4:32 PM · Git loader, Directory loader, Data Model, Web app
ardumont created T271: Update clients on impacts + upgrade respective package dependencies.
Jan 21 2016, 4:30 PM · Git loader, Directory loader, Data Model, Web app
zack created T270: TEXT → BYTEA conversion for columns that represent VCS tags/branches.
Jan 21 2016, 4:24 PM · Language-SQL, Storage manager
zack renamed T268: ontology - document on the wiki the data model terminology from Ontology - Define common terminology to ontology - document on the wiki the data model terminology.
Jan 21 2016, 12:16 PM · General, Restricted Project
ardumont updated the task description for T268: ontology - document on the wiki the data model terminology.
Jan 21 2016, 12:16 PM · General, Restricted Project
ardumont renamed T268: ontology - document on the wiki the data model terminology from Ontology to Ontology - Define common terminology.
Jan 21 2016, 12:11 PM · General, Restricted Project
olasd added a comment to T72: occurrence history table: use an array of timestamps instead of intervals.

Our main query on occurrences is looking for occurrences that are

  • from a given origin
  • on a given branch (or all branches)
  • that are the newest, or the closest to a given timestamp.
Jan 21 2016, 12:08 PM · Storage manager
olasd claimed T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
olasd closed T262: Upgrade postgresql to 9.5 as Resolved.

PostgreSQL has now been updated to 9.5 (and split into three clusters).

Jan 21 2016, 12:08 PM
ardumont added a comment to T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too).

For information, sample test_update.py adapted in swh-loader-git https://forge.softwareheritage.org/diffusion/DLDG/browse/master/swh/loader/git/updater.py to use the swh-storage.

Jan 21 2016, 12:08 PM · Git cloner, Git loader
ardumont closed T269: Deal nicely with swh-storage's backend down time in clients as Resolved.
Jan 21 2016, 12:08 PM · Web app, Storage manager
olasd closed T267: Migrate databases to postgres 9.5 as Resolved.
Jan 21 2016, 12:08 PM
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
ardumont changed the status of T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too), a subtask of T66: clone and load fork GitHub repositories, from Open to Work in Progress.
Jan 21 2016, 12:08 PM · Restricted Project, General
ardumont changed the status of T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too) from Open to Work in Progress.
Jan 21 2016, 12:08 PM · Git cloner, Git loader
ardumont renamed T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too) from smart, all-in-one git cloner/loader to smart, all-in-one git cloner/loader/ (+ dealing with updates too).
Jan 21 2016, 12:08 PM · Git cloner, Git loader
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
olasd closed T267: Migrate databases to postgres 9.5, a subtask of T262: Upgrade postgresql to 9.5, as Resolved.
Jan 21 2016, 12:08 PM
ardumont added a comment to T269: Deal nicely with swh-storage's backend down time in clients.

+ v0.0.21 deployed on archive

Jan 21 2016, 12:08 PM · Web app, Storage manager
ardumont added a comment to T269: Deal nicely with swh-storage's backend down time in clients.
  • 68a8283 Deal nicely with communication downtime with storage
  • 3afbd2d Deal more appropriately with storage error
Jan 21 2016, 12:08 PM · Web app, Storage manager
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
ardumont created T269: Deal nicely with swh-storage's backend down time in clients.
Jan 21 2016, 12:08 PM · Web app, Storage manager
ardumont added a comment to T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too).

Related but not limited to:
58903e5 * origin/master origin/HEAD Open occurrence_get(origin_id) to retrieve latest occurrences per origin
bc23eb9 * sql/upgrades/043: add 042→043 upgrade script
d05afde * revision_log from multiple root revisions
3a40f00 * sql/upgrades/042: add 041→042 upgrade script
f54fd8d * Open release_get_by to retrieve a release by origin.
5dc4244 * revision_get_by: branch name filtering is optional
7e623c8 * sql/upgrades/040: add 040→041 upgrade script
7e2dcbc * Open directory_get to retrieve information on directory by id

Jan 21 2016, 12:08 PM · Git cloner, Git loader
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
ardumont created T268: ontology - document on the wiki the data model terminology.
Jan 21 2016, 12:08 PM · General, Restricted Project
olasd closed T266: Setup pgbouncer on prado as Resolved.

pgbouncer is now listening on port 5432, and postgres 9.4 on port 5439.

Jan 21 2016, 12:08 PM
olasd added a comment to T267: Migrate databases to postgres 9.5.

Database cluster initialization and credentials sync (-g: dump only tablespaces and users):
pg_dumpall -g -p <old db port> | psql -p <new db port>

Jan 21 2016, 12:08 PM
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM
olasd closed T266: Setup pgbouncer on prado, a subtask of T262: Upgrade postgresql to 9.5, as Resolved.
Jan 21 2016, 12:08 PM
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 21 2016, 12:08 PM

Jan 18 2016

olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 18 2016, 10:58 PM
olasd updated the task description for T267: Migrate databases to postgres 9.5.
Jan 18 2016, 6:17 PM
olasd changed the status of T267: Migrate databases to postgres 9.5 from Open to Work in Progress.
Jan 18 2016, 5:59 PM
olasd changed the status of T267: Migrate databases to postgres 9.5, a subtask of T262: Upgrade postgresql to 9.5, from Open to Work in Progress.
Jan 18 2016, 5:59 PM
olasd created T267: Migrate databases to postgres 9.5.
Jan 18 2016, 5:59 PM
ardumont added a comment to T266: Setup pgbouncer on prado.

With your changes regarding secure connection:

Jan 18 2016, 4:59 PM
olasd added a comment to T266: Setup pgbouncer on prado.

Added opportunistic TLS for client and server connections:

Jan 18 2016, 4:57 PM
ardumont added a comment to T266: Setup pgbouncer on prado.

Configuration changed in ~/.pg_service.conf and ~/.pg_pass (for the port)

Jan 18 2016, 4:57 PM
olasd changed the status of T266: Setup pgbouncer on prado, a subtask of T262: Upgrade postgresql to 9.5, from Open to Work in Progress.
Jan 18 2016, 4:37 PM
olasd changed the status of T266: Setup pgbouncer on prado from Open to Work in Progress.

pgbouncer has been setup on prado (port 6432) and should proxy all connections to postgres 9.4.

Jan 18 2016, 4:37 PM
olasd created T266: Setup pgbouncer on prado.
Jan 18 2016, 4:37 PM
zack changed the status of T7: backup: object storage — 2nd copy after first large batch import from Open to Work in Progress.
Jan 18 2016, 4:23 PM
zack added a comment to T7: backup: object storage — 2nd copy after first large batch import.

Now that the first batch import (github + snapshot.debian.org + gnu.org) is done and we won't be importing other sources for a while, a full object store backup from uffizi to banco has now started.

Jan 18 2016, 4:22 PM
ardumont closed T163: Add entry points to create / request entities in swh.storage as Resolved.
Jan 18 2016, 4:09 PM · Storage manager
ardumont closed T163: Add entry points to create / request entities in swh.storage, a subtask of T164: Port GitHub lister to swh.storage instead of sqlalchemy, as Resolved.
Jan 18 2016, 4:09 PM · GitHub lister
ardumont closed T166: functional tests for the web UI, with actual data as Resolved.
Jan 18 2016, 4:08 PM · Web app
zack claimed T7: backup: object storage — 2nd copy after first large batch import.
Jan 18 2016, 3:25 PM
zack renamed T7: backup: object storage — 2nd copy after first large batch import from backup: object storage (AKA file content) to backup: object storage — 2nd copy after first large batch import.
Jan 18 2016, 3:25 PM
olasd closed T265: Increase storage space on uffizi as Resolved.
louvre# for i in `seq 0 9` a b c d e f; do lvextend -L +5T vg-data/$i; done
Jan 18 2016, 3:18 PM
olasd created T265: Increase storage space on uffizi.
Jan 18 2016, 3:12 PM
ardumont created T264: directory_get: Retrieve information on a directory per hash..
Jan 18 2016, 2:44 PM · Web app, Storage manager
ardumont added a comment to T264: directory_get: Retrieve information on a directory per hash..

storage:

  • 7e623c8 * master origin/master origin/HEAD sql/upgrades/040: add 040→041 upgrade script
  • 7e2dcbc * Open directory_get to retrieve information on directory by id
  • ac380c9 * Rename directory_get to directory_ls
Jan 18 2016, 2:39 PM · Web app, Storage manager
ardumont closed T116: Add storage endpoints to help with repository updates, a subtask of T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too), as Resolved.
Jan 18 2016, 2:39 PM · Git cloner, Git loader
ardumont closed T116: Add storage endpoints to help with repository updates as Resolved.
Jan 18 2016, 2:39 PM · Storage manager
ardumont changed the status of T264: directory_get: Retrieve information on a directory per hash. from Open to Work in Progress.
Jan 18 2016, 2:39 PM · Web app, Storage manager
ardumont closed T264: directory_get: Retrieve information on a directory per hash., a subtask of T116: Add storage endpoints to help with repository updates, as Resolved.
Jan 18 2016, 2:39 PM · Storage manager
ardumont changed the status of T264: directory_get: Retrieve information on a directory per hash., a subtask of T116: Add storage endpoints to help with repository updates, from Open to Work in Progress.
Jan 18 2016, 2:39 PM · Storage manager
ardumont closed T264: directory_get: Retrieve information on a directory per hash. as Resolved.
Jan 18 2016, 2:39 PM · Web app, Storage manager
ardumont closed T206: /api/1/revision/?timestamp=<TIMESTAMP>&origin=<ORIGIN> as Invalid.
Jan 18 2016, 2:39 PM · Web app
ardumont closed T219: Package swh-web-ui with flask-api declared as requirement as Resolved.
Jan 18 2016, 2:39 PM · Web app
ardumont added a comment to T163: Add entry points to create / request entities in swh.storage.
  • 'Single entity query' done in 9fe94d3, ebe3a29 (entity_get)
Jan 18 2016, 2:39 PM · Storage manager
ardumont added a comment to T116: Add storage endpoints to help with repository updates.

Current status on this:

  • find objects of any type by sha1_git (release/revision/directory/content)
Jan 18 2016, 2:39 PM · Storage manager

Jan 15 2016

ardumont closed T247: Open /api/1/entity/<uuid>, a subtask of T246: Open /browse/entity/<uuid> view, as Resolved.
Jan 15 2016, 4:44 PM · Web app
ardumont closed T247: Open /api/1/entity/<uuid> as Resolved by committing rDWAPPS9b59b77bd9d0: Open /api/1/entity/<string:uuid>/.
Jan 15 2016, 4:44 PM · Web app
ardumont changed the status of T247: Open /api/1/entity/<uuid> from Open to Work in Progress.
Jan 15 2016, 3:44 PM · Web app
ardumont changed the status of T247: Open /api/1/entity/<uuid>, a subtask of T246: Open /browse/entity/<uuid> view, from Open to Work in Progress.
Jan 15 2016, 3:44 PM · Web app
zack added a comment to T78: Unable to perfectly round-trip releases pointed at by occurrences.

What about having "occurrences" for all kinds of objects in a VCS, releases, tags, revisions, etc. ?

Jan 15 2016, 9:28 AM · Storage manager
rdicosmo added a comment to T78: Unable to perfectly round-trip releases pointed at by occurrences.

What about having "occurrences" for all kinds of objects in a VCS, releases, tags, revisions, etc. ?
We would definitely need to look at other VCS to get a general model; for example, in Darcs patches are first class citizens: do we have a way of accomodating this in our data model?

Jan 15 2016, 9:08 AM · Storage manager

Jan 14 2016

zack added a comment to T236: make website counters resilient to webapp downtime.

it's not visible in the commit, but I've also just added the following (horrible) anacrontab line to our gandi simple-paas instance

4@hourly 0 statcache wget -qO /srv/data/web/vhosts/www.softwareheritage.org/htdocs/stat-counters.js.new https://archive.softwareheritage.org/api/1/stat/counters/\?callback=swh_load_counters && mv /srv/data/web/vhosts/www.softwareheritage.org/htdocs/stat-counters.js.new /srv/data/web/vhosts/www.softwareheritage.org/htdocs/stat-counters.js

to maintain the cached copy of stat-counters up to date (every 4 hours)

Jan 14 2016, 5:49 PM · Website
zack closed T236: make website counters resilient to webapp downtime as Resolved by committing Restricted Diffusion Commit.
Jan 14 2016, 5:35 PM · Website

Jan 13 2016

ardumont closed T253: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/directory/[<PATH>] as Resolved by committing rDWAPPS9dc48ee7ee11: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>]….
Jan 13 2016, 12:59 PM · Web app
ardumont claimed T253: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/directory/[<PATH>] .
Jan 13 2016, 11:29 AM · Web app
ardumont closed T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ , a subtask of T253: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/directory/[<PATH>] , as Resolved.
Jan 13 2016, 11:29 AM · Web app
ardumont closed T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ as Resolved by committing rDWAPPS9f07eb8ba99e: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>]….
Jan 13 2016, 11:29 AM · Web app
ardumont changed the status of T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ , a subtask of T253: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/directory/[<PATH>] , from Open to Work in Progress.
Jan 13 2016, 10:11 AM · Web app
ardumont changed the status of T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ from Open to Work in Progress.
Jan 13 2016, 10:11 AM · Web app

Jan 12 2016

ardumont closed T252: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/directory/[<PATH>] as Resolved by committing rDWAPPS980e08b7038f: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>]….
Jan 12 2016, 6:00 PM · Web app
ardumont changed the status of T252: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/directory/[<PATH>] from Open to Work in Progress.
Jan 12 2016, 5:59 PM · Web app
ardumont changed the status of T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ , a subtask of T253: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/directory/[<PATH>] , from Work in Progress to Open.
Jan 12 2016, 5:59 PM · Web app
ardumont changed the status of T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ from Work in Progress to Open.
Jan 12 2016, 5:59 PM · Web app
ardumont changed the status of T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ from Open to Work in Progress.
Jan 12 2016, 4:32 PM · Web app
ardumont changed the status of T251: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/ , a subtask of T253: Open /api/1/revision/origin/<ORIG_ID>[/branch/<BRANCH>][/ts/<TIMESTAMP>]/history/<SHA1>/directory/[<PATH>] , from Open to Work in Progress.
Jan 12 2016, 4:32 PM · Web app
zack added a comment to T262: Upgrade postgresql to 9.5.

I'd be in favor of working with three clusters then:

Jan 12 2016, 4:22 PM
olasd added a comment to T262: Upgrade postgresql to 9.5.

That sounds sensible. We should be able to export a LV from our spinning rust storage to prado for that "mirrors" postgresql cluster.

Jan 12 2016, 4:18 PM