Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 2 2015

zack added a project to T45: Fix swh.storage.storage.occurrence_add for overlapping intervals: Storage manager.
Oct 2 2015, 11:03 PM · Storage manager
zack added a project to T3: Discuss the project <-> origin mapping: Storage manager.
Oct 2 2015, 11:03 PM · Storage manager
zack added a comment to T22: DB schema: add metadata field to revision table.

see also T62

Oct 2 2015, 11:02 PM · Storage manager
zack added a comment to T62: DB schema: add directory→tarball provenance information.

see also T22

Oct 2 2015, 11:02 PM · Storage manager
zack added a project to T22: DB schema: add metadata field to revision table: Storage manager.
Oct 2 2015, 11:02 PM · Storage manager
zack added a project to T41: add actual content to swh-storage-testdata: Storage manager.
Oct 2 2015, 11:00 PM · Storage manager
zack updated subscribers of T48: upload-based content search.
Oct 2 2015, 10:59 PM · Web app
zack added a project to T49: DB schema: add missing unicity constraint on origin (type, url): Storage manager.
Oct 2 2015, 10:59 PM · Restricted Project, Storage manager
zack added projects to T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too): Git loader, Git cloner.
Oct 2 2015, 10:59 PM · Git cloner, Git loader
zack added a project to T60: deploy webapp at http://base.softwareheritage.org: Web app.
Oct 2 2015, 10:58 PM · Web app
zack added a project to T61: loading: trigger to update occurrence table: Storage manager.
Oct 2 2015, 10:58 PM · Storage manager
zack added a project to T62: DB schema: add directory→tarball provenance information: Storage manager.
Oct 2 2015, 10:58 PM · Storage manager
zack added a project to T9: directory (= extracted archive) loader - 1st deployable version: Directory loader.
Oct 2 2015, 10:58 PM · Directory loader
zack added a project to T36: performance estimation: how long will it take to git-bulk-load all the GitHub repos we have: Git loader.
Oct 2 2015, 10:58 PM · Git loader
zack added a project to T29: evaluate conffile/argparse Python module: Core & foundations.
Oct 2 2015, 10:57 PM · Core & foundations
zack added a project to T30: port ghlister to swh.* namespace and conventions: GitHub lister.
Oct 2 2015, 10:56 PM · GitHub lister
zack added a project to T38: port ghlister to swh task interface - list all / catch up: GitHub lister.
Oct 2 2015, 10:56 PM · GitHub lister
zack added a project to T37: port ghlister to swh task interface - daily updates: GitHub lister.
Oct 2 2015, 10:56 PM · GitHub lister
olasd added a comment to T36: performance estimation: how long will it take to git-bulk-load all the GitHub repos we have.

This makes me think that we are now i/o bound on writes on our storage.

Oct 2 2015, 6:19 PM · Git loader
olasd added a comment to T36: performance estimation: how long will it take to git-bulk-load all the GitHub repos we have.

This task made good progress today. I spent a small while perusing our logging to understand the margins for performance.

Oct 2 2015, 5:56 PM · Git loader

Oct 1 2015

ardumont added a comment to T60: deploy webapp at http://base.softwareheritage.org.
  • /revision/<SHA1_GIT>: show commit information
  • /directory/<SHA1_GIT>: show directory information (including ls)
  • /directory/<SHA1_GIT>/path/to/file-or-dir: ditto, but for dir pointed by path
  • /content/[<HASH_ALGO>:]<HASH>: show content information
  • /release/<SHA1_GIT>: show release information
  • /person/<PERSON_ID>: show person information
  • /origin/<ORIGIN_ID>: show origin information
  • /project/<PROJECT_ID>: show project information
  • /organization/<ORGANIZATION_ID>: show organization information
  • /directory/<TIMESTAMP>/<ORIGIN>|/<BRANCH>|/path/to/file-or-dir : show directory information at timestamp/origin/branch
  • /revision/<TIMESTAMP>/<ORIGIN>|/<BRANCH> : show revision information at origin/branch/timestamp
  • /revision/<TIMESTAMP>/<ORIGIN>| : Show all branches of origin at a given timestamp
  • /revision/<TIMESTAMP>/<ORIGIN>|/<BRANCH>| : Show all revisions (~git log) of origin and branch at a given timestamp
Oct 1 2015, 5:03 PM · Web app
zack created T62: DB schema: add directory→tarball provenance information.
Oct 1 2015, 4:38 PM · Storage manager
ardumont claimed T9: directory (= extracted archive) loader - 1st deployable version.
Oct 1 2015, 4:02 PM · Directory loader
zack created T61: loading: trigger to update occurrence table.
Oct 1 2015, 3:48 PM · Storage manager
zack added a project to T60: deploy webapp at http://base.softwareheritage.org: System administrators.
Oct 1 2015, 3:45 PM · Web app
zack created T60: deploy webapp at http://base.softwareheritage.org.
Oct 1 2015, 3:45 PM · Web app
zack raised the priority of T36: performance estimation: how long will it take to git-bulk-load all the GitHub repos we have from Normal to High.
Oct 1 2015, 3:44 PM · Git loader
zack renamed T9: directory (= extracted archive) loader - 1st deployable version from Debian dir loader - 1st deployable version to directory (= extracted archive) loader - 1st deployable version.
Oct 1 2015, 3:41 PM · Directory loader
zack renamed T9: directory (= extracted archive) loader - 1st deployable version from Debian (.dsc) loader - 1st deployable version to Debian dir loader - 1st deployable version.
Oct 1 2015, 3:41 PM · Directory loader
zack closed T58: move last batch of github clones (~3M) from /incoming to /data as Resolved.
Oct 1 2015, 2:25 PM
ardumont updated the task description for T59: Build and upload one swh-web-ui debian package.
Oct 1 2015, 11:30 AM
ardumont updated the task description for T59: Build and upload one swh-web-ui debian package.
Oct 1 2015, 11:29 AM
ardumont closed T59: Build and upload one swh-web-ui debian package as Resolved.
Oct 1 2015, 11:28 AM
ardumont updated the task description for T59: Build and upload one swh-web-ui debian package.
Oct 1 2015, 11:27 AM
ardumont updated the task description for T59: Build and upload one swh-web-ui debian package.
Oct 1 2015, 11:13 AM
ardumont triaged T59: Build and upload one swh-web-ui debian package as Normal priority.
Oct 1 2015, 11:08 AM
zack closed T21: gzip antelink content on sesi-pv-lc2 as Resolved.
Oct 1 2015, 10:00 AM
zack closed T21: gzip antelink content on sesi-pv-lc2, a subtask of T19: transfer antelink content from sesi-pv-lc2 to SWH infra, as Resolved.
Oct 1 2015, 10:00 AM
ardumont created T59: Build and upload one swh-web-ui debian package.
Oct 1 2015, 9:49 AM
ardumont closed T47: lookup one hash and returns information about it (origin, revision, etc...), a subtask of T32: web UI: checksum search, as Resolved.
Oct 1 2015, 9:43 AM
ardumont closed T47: lookup one hash and returns information about it (origin, revision, etc...) as Resolved.
Oct 1 2015, 9:43 AM

Sep 30 2015

zack added a project to T30: port ghlister to swh.* namespace and conventions: Developers.
Sep 30 2015, 6:28 PM · GitHub lister
olasd moved T36: performance estimation: how long will it take to git-bulk-load all the GitHub repos we have from Backlog to This week on the Staff board.
Sep 30 2015, 4:35 PM · Git loader
olasd closed T35: port git-bulk-loader to generic task scheduling API as Resolved.

Done as of rDLDG69a5070

Sep 30 2015, 4:34 PM
zack added a comment to T58: move last batch of github clones (~3M) from /incoming to /data.

mv started on uffizi, in a screen session

Sep 30 2015, 2:33 PM
zack added a project to T58: move last batch of github clones (~3M) from /incoming to /data: Developers.
Sep 30 2015, 1:35 PM
zack closed T55: DB stored procedures: use LANGUAGE sql for pure-SQL functions instead of LANGUAGE plpgsql as Resolved by committing rDSTO3b67e58d1e2d: swh-func.sql: use LANGUAGE sql wherever possible.
Sep 30 2015, 11:58 AM

Sep 29 2015

olasd closed T43: Convention for error passing from storage "backend" to storage "API server" to storage "API client" as Resolved.

Resolved as of rDSTO2b46e6941afe

Sep 29 2015, 5:02 PM
zack added a comment to T19: transfer antelink content from sesi-pv-lc2 to SWH infra.

As discussed on swh-private, this is no longer required now. We will reassess after having injected all the content we already have, selectively transfering only what we want/need.

Sep 29 2015, 4:34 PM
zack closed T19: transfer antelink content from sesi-pv-lc2 to SWH infra as Wontfix.
Sep 29 2015, 4:33 PM
ardumont closed T32: web UI: checksum search as Resolved.
Sep 29 2015, 11:09 AM
ardumont added a comment to T32: web UI: checksum search.
  • Done once with basic API
  • Refactor to use an unified API call
  • Keep up with latest change on swh-storage
Sep 29 2015, 11:09 AM
ardumont closed T33: Git cloner: catch up with new GitHub repositories after the summer as Resolved.
Sep 29 2015, 11:08 AM
olasd closed T44: Rename swh.storage.api to swh.storage.api.server, swh.storage.remote_storage to swh.storage.api.client as Resolved by committing rDSTO1ddeff4be42d: Move remote_storage and api to api.client and api.server.
Sep 29 2015, 10:22 AM

Sep 28 2015

zack closed T54: DB schema: merge directory_list_* tables into directory as Resolved.
Sep 28 2015, 4:36 PM
zack added a comment to T54: DB schema: merge directory_list_* tables into directory.

done in rDSTObe3910ecff368967cbef7f803dbdf191c1510c3d (and subsequent fixups by olasd)

Sep 28 2015, 4:36 PM

Sep 27 2015

zack created T55: DB stored procedures: use LANGUAGE sql for pure-SQL functions instead of LANGUAGE plpgsql.
Sep 27 2015, 7:12 PM
zack created T54: DB schema: merge directory_list_* tables into directory.
Sep 27 2015, 4:50 PM
zack created T52: swh-cron: manifest-based scheduler for recurring tasks.
Sep 27 2015, 11:21 AM
zack created T51: smart, all-in-one git cloner/loader/ (+ dealing with updates too).
Sep 27 2015, 11:16 AM · Git cloner, Git loader
olasd created T49: DB schema: add missing unicity constraint on origin (type, url).
Sep 27 2015, 11:02 AM · Restricted Project, Storage manager
olasd closed T8: Git loader - 1st deployable version as Resolved.

python3-swh.loader.git is installed and running on worker0{5..8}

Sep 27 2015, 9:51 AM

Sep 26 2015

zack added a comment to T21: gzip antelink content on sesi-pv-lc2.

gzip/checksumming restarted, after fixing the /etc/fstab mess on the machine

Sep 26 2015, 9:28 PM
zack added a comment to T19: transfer antelink content from sesi-pv-lc2 to SWH infra.

priority lowered as, for better or worse, we have already freed enough space on the machine for DB backups without having to transfer the data

Sep 26 2015, 3:36 PM
zack lowered the priority of T19: transfer antelink content from sesi-pv-lc2 to SWH infra from Normal to Low.
Sep 26 2015, 3:35 PM

Sep 25 2015

zack removed a parent task for T19: transfer antelink content from sesi-pv-lc2 to SWH infra: T6: backup: postgres DB.
Sep 25 2015, 5:07 PM

Sep 24 2015

ardumont created T48: upload-based content search.
Sep 24 2015, 3:16 PM · Web app
ardumont created T47: lookup one hash and returns information about it (origin, revision, etc...).
Sep 24 2015, 3:15 PM
ardumont closed T46: lookup one hash to check presence, a subtask of T32: web UI: checksum search, as Resolved.
Sep 24 2015, 3:07 PM
ardumont closed T46: lookup one hash to check presence as Resolved.
Sep 24 2015, 3:07 PM
ardumont created T46: lookup one hash to check presence.
Sep 24 2015, 3:07 PM
olasd triaged T43: Convention for error passing from storage "backend" to storage "API server" to storage "API client" as High priority.
Sep 24 2015, 3:03 PM
olasd created T45: Fix swh.storage.storage.occurrence_add for overlapping intervals.
Sep 24 2015, 3:02 PM · Storage manager
zack added a comment to T43: Convention for error passing from storage "backend" to storage "API server" to storage "API client".

the {exception:.../args:....} proposal + re-raising in the client looks good to me

Sep 24 2015, 2:58 PM
olasd created T44: Rename swh.storage.api to swh.storage.api.server, swh.storage.remote_storage to swh.storage.api.client.
Sep 24 2015, 2:56 PM
olasd added a comment to T43: Convention for error passing from storage "backend" to storage "API server" to storage "API client".

We should standardize on :

  • an HTTP error code (400 / Bad Request ?)
  • a serialization format for the answer (JSON, probably serializing the error type and error args), e.g.
{
    'exception': e.__class__.__name__,
    'args': e.args,
}

and then deserialize that and raise in the client.

Sep 24 2015, 2:49 PM
olasd created T43: Convention for error passing from storage "backend" to storage "API server" to storage "API client".
Sep 24 2015, 2:45 PM
olasd closed T42: swh-git-loader: merge bulk-git-loader into master as Resolved by committing rDLDG62aa44fb0097: Merge branch 'bulk-git-loader'.
Sep 24 2015, 12:12 PM
olasd closed T42: swh-git-loader: merge bulk-git-loader into master, a subtask of T41: add actual content to swh-storage-testdata, as Resolved.
Sep 24 2015, 12:12 PM · Storage manager
olasd moved T42: swh-git-loader: merge bulk-git-loader into master from Backlog to Today on the Staff board.
Sep 24 2015, 11:35 AM
zack added a parent task for T42: swh-git-loader: merge bulk-git-loader into master: T41: add actual content to swh-storage-testdata.
Sep 24 2015, 11:21 AM
zack added a subtask for T41: add actual content to swh-storage-testdata: T42: swh-git-loader: merge bulk-git-loader into master.
Sep 24 2015, 11:21 AM · Storage manager
zack created T42: swh-git-loader: merge bulk-git-loader into master.
Sep 24 2015, 11:21 AM
ardumont added a comment to T33: Git cloner: catch up with new GitHub repositories after the summer.
Sep 24 2015, 11:15 AM
zack renamed T41: add actual content to swh-storage-testdata from add actual content to swh-content-testdata to add actual content to swh-storage-testdata.
Sep 24 2015, 11:10 AM · Storage manager
zack added a comment to T41: add actual content to swh-storage-testdata.

As a first step to address this, in rDSTOTb841c72f6a240eac8131e40614c9ce20d75b1c96 I've checked into swh-storage-testdata several sample repositories, both as submodules and fast-export dumps.
Starting from those, we now need a Makefile target that batch import them into the SWH DB, and recreate the test database dumps.

Sep 24 2015, 11:10 AM · Storage manager
ardumont closed T34: Git cloner: port to generic task scheduling interface as Resolved.
Sep 24 2015, 11:08 AM

Sep 23 2015

zack created T41: add actual content to swh-storage-testdata.
Sep 23 2015, 5:54 PM · Storage manager
zack moved T39: git conf: allow git push --force on all repos referenced by swh-environment (at least) from Today to Done on the Staff board.
Sep 23 2015, 12:09 PM
zack closed T39: git conf: allow git push --force on all repos referenced by swh-environment (at least) as Resolved.
Sep 23 2015, 12:09 PM
zack moved T23: Storage API: object writing methods from This week to Done on the Staff board.
Sep 23 2015, 12:09 PM
zack moved T25: Storage API: occurrence writing method from Today to Done on the Staff board.
Sep 23 2015, 12:08 PM
zack moved T24: Storage API: release writing method from Today to Done on the Staff board.
Sep 23 2015, 12:08 PM
zack moved T10: Statistics from Git loading testdrive from Today to Done on the Staff board.
Sep 23 2015, 12:08 PM
zack created T40: prevent git push --force to master.
Sep 23 2015, 12:08 PM · Phabricator
zack created T39: git conf: allow git push --force on all repos referenced by swh-environment (at least).
Sep 23 2015, 12:02 PM
zack moved T21: gzip antelink content on sesi-pv-lc2 from Today to Backlog on the Staff board.
Sep 23 2015, 12:02 PM
zack added a comment to T21: gzip antelink content on sesi-pv-lc2.

now blocked due to the issues on sesi-pv-lc2 after the reboot :-/

Sep 23 2015, 12:02 PM

Sep 22 2015

zack closed T31: shw.core: generic API for (celery) task scheduling, a subtask of T34: Git cloner: port to generic task scheduling interface, as Resolved.
Sep 22 2015, 2:21 PM
zack closed T31: shw.core: generic API for (celery) task scheduling as Resolved.
Sep 22 2015, 2:21 PM