Page MenuHomeSoftware Heritage
Feed Advanced Search

Nov 6 2020

douardda updated the diff for D4429: doc: add an introduction paragraph in blueprint.rst.

rebase + review comments

Nov 6 2020, 11:48 AM
douardda added inline comments to D4429: doc: add an introduction paragraph in blueprint.rst.
Nov 6 2020, 11:46 AM
douardda added inline comments to D4429: doc: add an introduction paragraph in blueprint.rst.
Nov 6 2020, 11:42 AM
douardda committed rDDEP04c11145081f: doc: improve the spec-loading doc (authored by douardda).
doc: improve the spec-loading doc
Nov 6 2020, 11:36 AM
douardda closed D4428: doc: improve the spec-loading doc.
Nov 6 2020, 11:36 AM
douardda updated the diff for D4428: doc: improve the spec-loading doc.

split code blocks in 2 as requested by vlorentz

Nov 6 2020, 11:32 AM
douardda added inline comments to D4428: doc: improve the spec-loading doc.
Nov 6 2020, 11:27 AM
douardda committed rDDEP61653ddfc7c3: doc: improve the doc of API endpoints (authored by douardda).
doc: improve the doc of API endpoints
Nov 6 2020, 11:26 AM
douardda closed D4427: doc: improve the doc of API endpoints.
Nov 6 2020, 11:26 AM
douardda committed rDDEP1736371f2c6d: doc: rename docs/specs/specs.rst as docs/specs/index.rst (authored by douardda).
doc: rename docs/specs/specs.rst as docs/specs/index.rst
Nov 6 2020, 11:25 AM
douardda closed D4426: doc: rename docs/specs/specs.rst as docs/specs/index.rst.
Nov 6 2020, 11:25 AM
douardda created D4433: Reformat optional arguments passed cli tests to make them easier to read.
Nov 6 2020, 11:16 AM
douardda created D4432: Simplify `swh deposit upload` cli options.
Nov 6 2020, 11:15 AM
douardda created D4431: Refactor BaseApiDepositClient to get rid of the _client argument.
Nov 6 2020, 11:15 AM
douardda created D4430: doc: rename Getting Started as User Manual and update the content.
Nov 6 2020, 11:00 AM
douardda created D4429: doc: add an introduction paragraph in blueprint.rst.
Nov 6 2020, 10:59 AM
douardda created D4428: doc: improve the spec-loading doc.
Nov 6 2020, 10:58 AM
douardda created D4427: doc: improve the doc of API endpoints.
Nov 6 2020, 10:58 AM
douardda created D4426: doc: rename docs/specs/specs.rst as docs/specs/index.rst.
Nov 6 2020, 10:57 AM
douardda committed rDDEP0f4ec31168f6: Fix the sphinx config (authored by douardda).
Fix the sphinx config
Nov 6 2020, 10:29 AM
douardda closed D4425: Fix the sphinx config.
Nov 6 2020, 10:29 AM
douardda created D4425: Fix the sphinx config.
Nov 6 2020, 10:26 AM
douardda created P859 (An Untitled Masterwork).
Nov 6 2020, 10:02 AM
douardda created P858 (An Untitled Masterwork).
Nov 6 2020, 10:01 AM

Nov 5 2020

douardda added inline comments to D4356: common/archive: Avoid db timeouts in lookup_snapshot_sizes.
Nov 5 2020, 2:10 PM
douardda added a comment to D4356: common/archive: Avoid db timeouts in lookup_snapshot_sizes.

It remains unclear to me how this diff addresses the original timeout problem. I see the beneficial effect of having a lazy loading mechanism and cache to reduce the load, but not to prevent the timeouts to occur in the first place (thus failing to fill the cache).

Nov 5 2020, 11:45 AM

Nov 4 2020

douardda updated the task description for T2757: the Slug header should not be mandatory.
Nov 4 2020, 1:16 PM · SWORD deposit
douardda triaged T2757: the Slug header should not be mandatory as Normal priority.
Nov 4 2020, 1:15 PM · SWORD deposit
douardda created T2757: the Slug header should not be mandatory.
Nov 4 2020, 1:14 PM · SWORD deposit
douardda added a comment to T2752: Update the usage of --slug or external_identifier in the deposit cli.

I also think this external_identifier should go away, the spec is rich (aka complicated) enough without we adding some layers :-)

Nov 4 2020, 12:49 PM · SWORD deposit
douardda added a comment to T2752: Update the usage of --slug or external_identifier in the deposit cli.

also note that making the slug a MUST (server-side) is not valid w.r.t. the specs ("The client MAY supply a Slug header")

Nov 4 2020, 12:47 PM · SWORD deposit
douardda added inline comments to D4356: common/archive: Avoid db timeouts in lookup_snapshot_sizes.
Nov 4 2020, 10:10 AM
douardda accepted D4313: Add new example repository generated from script.

same as the previous one, ok once green

Nov 4 2020, 9:29 AM
douardda accepted D4311: Add testing repository builder.

I'm fine with this, can be landed as soon as tests are ok.

Nov 4 2020, 9:25 AM
douardda added a comment to D4216: add swh-hg-identify a cli to identify hg objects.

Any reason for not landing this? (@marmoute ? do you keep your request for changes?)

Nov 4 2020, 9:22 AM

Nov 2 2020

douardda accepted D4082: Make the type of values of JournalWriter generic, so it works with types not from swh-model..
Nov 2 2020, 12:06 PM

Oct 30 2020

douardda accepted D4216: add swh-hg-identify a cli to identify hg objects.
Oct 30 2020, 5:03 PM
douardda triaged T2744: Add a support@softwareheritage.org email address that creates tasks in the forge as Normal priority.
Oct 30 2020, 3:31 PM · System administration

Oct 29 2020

douardda accepted D4374: cli.db: Open init-admin subcmd to initialize superuser-level scripts.
Oct 29 2020, 4:18 PM

Oct 28 2020

douardda created P843 (An Untitled Masterwork).
Oct 28 2020, 4:40 PM
douardda requested changes to D4216: add swh-hg-identify a cli to identify hg objects.

see comments

Oct 28 2020, 11:14 AM
douardda added a comment to D4216: add swh-hg-identify a cli to identify hg objects.

So the test fails on jenkins because they use the hg command from the system (since mercurial is oddly enough, not a dependency of swh-loader-mercurial) and on stretch, mercurial is 4.8
Using mercurial 5.5 is ok.

Oct 28 2020, 11:14 AM
douardda added inline comments to D4311: Add testing repository builder.
Oct 28 2020, 10:09 AM
douardda committed rDDOC4533ec80080d: Remove deprecated chapter on SWH infrastrucuture (authored by douardda).
Remove deprecated chapter on SWH infrastrucuture
Oct 28 2020, 10:04 AM
douardda closed D4354: Remove deprecated chapter on SWH infrastrucuture.
Oct 28 2020, 10:04 AM
douardda added a comment to D4354: Remove deprecated chapter on SWH infrastrucuture.

Is this chapter saved anywhere? or is it so deprecated it shouldn't be saved?

Oct 28 2020, 10:04 AM
douardda committed rDDEP605bfda59483: Add a description of the deposit in the docs' index page (authored by douardda).
Add a description of the deposit in the docs' index page
Oct 28 2020, 10:00 AM
douardda closed D4358: Add a description of the deposit in the docs' index page.
Oct 28 2020, 10:00 AM
douardda updated the diff for D4358: Add a description of the deposit in the docs' index page.

rebase

Oct 28 2020, 9:59 AM

Oct 27 2020

douardda updated the diff for D4358: Add a description of the deposit in the docs' index page.

Define and use the SWH acronym

Oct 27 2020, 5:14 PM
douardda updated the diff for D4358: Add a description of the deposit in the docs' index page.

fixes and improvements suggested by maoranegg (big thx)

Oct 27 2020, 2:43 PM
douardda added inline comments to D4358: Add a description of the deposit in the docs' index page.
Oct 27 2020, 2:37 PM
douardda accepted D4362: Update swh.storage.validate for swh.model 0.7.2.
Oct 27 2020, 2:17 PM
douardda updated the diff for D4358: Add a description of the deposit in the docs' index page.

typos (thx ardumont)

Oct 27 2020, 11:54 AM
douardda added a reviewer for D4358: Add a description of the deposit in the docs' index page: moranegg.
Oct 27 2020, 11:52 AM
douardda added a comment to D4313: Add new example repository generated from script.

I would expect the commit message to be a bot more explanatory: either this new test case adds some tested aspects that were not tested before, and it should mention it, or it does not, and it should also mention it explaining this new test is the base for futures extended ones in a more manageable way (what's the "updatable" stands for, if I get this right).

Oct 27 2020, 11:46 AM
douardda created D4358: Add a description of the deposit in the docs' index page.
Oct 27 2020, 10:50 AM

Oct 26 2020

douardda created D4354: Remove deprecated chapter on SWH infrastrucuture.
Oct 26 2020, 3:09 PM
douardda closed T2421: Make model objects immutable as Resolved.

should be ok now (even if via ImmutableDict :-) )

Oct 26 2020, 2:51 PM · Data Model
douardda closed T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object as Resolved.
Oct 26 2020, 2:45 PM · Data Model
douardda closed T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object, a subtask of T2421: Make model objects immutable, as Resolved.
Oct 26 2020, 2:45 PM · Data Model
douardda updated the task description for T2423: Extract the `extra_headers` away from `Revision.metadata` into a top-level immutable object.
Oct 26 2020, 2:44 PM · Data Model
douardda added projects to T2585: Properly handle deleted/no longer available origins: Core Loader, Lister.
Oct 26 2020, 2:41 PM · Lister, Core Loader
douardda created P836 (An Untitled Masterwork).
Oct 26 2020, 2:29 PM
douardda added a comment to T1577: Compare/benchmark objstorage backends .

See also T2706

Oct 26 2020, 12:30 PM · Object storage
douardda closed T1370: Report key code metrics in prometheus as Resolved.
Oct 26 2020, 12:30 PM · Metrics/monitoring, Restricted Project, Continuous Integration, System administration
douardda closed T1370: Report key code metrics in prometheus, a subtask of T1024: Proper continuous integration setup, as Resolved.
Oct 26 2020, 12:30 PM · Restricted Project, Continuous Integration, System administration
douardda added a comment to D4082: Make the type of values of JournalWriter generic, so it works with types not from swh-model..

Wouldn't it make a bit easier to name the generic version of the journal writer something like GenericKafkaJournalWriter and have KafkaJournalWriter = GenericKafkaJournalWriter[BaseModel] ? (for bw compat)

Why? This change won't break any code using KafkaJournalWriter

Oct 26 2020, 12:25 PM
douardda committed rDOBJSRPLca21ed290d25: Normalize the expected config entry for the journal_client (authored by douardda).
Normalize the expected config entry for the journal_client
Oct 26 2020, 12:25 PM
douardda closed D4333: Normalize the expected config entry for the journal_client.
Oct 26 2020, 12:25 PM
douardda triaged T2735: Add / expose metrics on BW usage for authenticated users as Normal priority.
Oct 26 2020, 12:16 PM · Web app, System administration
douardda triaged T2733: Explore / install a varnish prometheus probe as High priority.
Oct 26 2020, 11:01 AM · Metrics/monitoring, System administration

Oct 23 2020

douardda closed D4193: swh identify: add --exclude.

closed by 2b869aa7d30d099ed6146d9f8dc667cd7a8eefc3

Oct 23 2020, 11:52 AM
douardda commandeered D4193: swh identify: add --exclude.
Oct 23 2020, 11:52 AM
douardda requested changes to D4311: Add testing repository builder.
Oct 23 2020, 11:25 AM
douardda added a comment to D4311: Add testing repository builder.

Also the commit message should give a bit more information on what this new script is needed for, maybe with a usage example.
This defines a bunch of commands. When and how should "I" use them?

Oct 23 2020, 11:25 AM
douardda added a comment to D4311: Add testing repository builder.

please do not put the "depends on Dxxx" line in the git commit message.

Oct 23 2020, 11:21 AM
douardda requested changes to D4313: Add new example repository generated from script.

ok on the diff itself, but why is this new example repo needed for? This should be explained in the commit message. (the "why"! always insist on the "why" rather than (or in addition to) the "what" in your commit messages, please.

Oct 23 2020, 11:20 AM
douardda accepted D4193: swh identify: add --exclude.

ok but please properly document arguments in docstrings.

Oct 23 2020, 11:00 AM
douardda updated the task description for T2645: Add listing tasks for gitea instances.
Oct 23 2020, 10:20 AM · Origin-Gitea/Gogs, Archive coverage, Lister

Oct 22 2020

douardda created D4333: Normalize the expected config entry for the journal_client.
Oct 22 2020, 4:01 PM
douardda accepted D4312: Add `Hg20BundleLoader` tests from json files.

globally ok, but please add a README file as suggested in the previous comment

Oct 22 2020, 12:32 PM
douardda added a comment to D4312: Add `Hg20BundleLoader` tests from json files.

Would be nice to have a README file in tests/data explaining what these json files are and how to produce them.

Oct 22 2020, 11:52 AM
douardda added inline comments to D4082: Make the type of values of JournalWriter generic, so it works with types not from swh-model..
Oct 22 2020, 11:46 AM
douardda added a comment to D4082: Make the type of values of JournalWriter generic, so it works with types not from swh-model..

Wouldn't it make a bit easier to name the generic version of the journal writer something like GenericKafkaJournalWriter and have KafkaJournalWriter = GenericKafkaJournalWriter[BaseModel] ? (for bw compat)

Oct 22 2020, 11:43 AM
douardda requested changes to D4193: swh identify: add --exclude.

This globally LGTM but there is this path encoding issue. The 2 new functions in from_disk.py should take a bytes argument instead of a str one.

Oct 22 2020, 11:25 AM

Oct 21 2020

douardda created P830 (An Untitled Masterwork).
Oct 21 2020, 2:39 PM
douardda updated the task description for T2645: Add listing tasks for gitea instances.
Oct 21 2020, 12:18 PM · Origin-Gitea/Gogs, Archive coverage, Lister

Oct 19 2020

douardda created P828 (An Untitled Masterwork).
Oct 19 2020, 5:48 PM
douardda added inline comments to D4216: add swh-hg-identify a cli to identify hg objects.
Oct 19 2020, 4:49 PM
douardda triaged T2717: Write an end-user documentation on how to use the authenticated stack as High priority.
Oct 19 2020, 1:26 PM · Web app, Documentation

Oct 16 2020

douardda added a comment to T2706: Benchmark objstorage for mirror (uffizi vs. azure vs. s3).

Same as before but with 1M (fresh) sha1s:

Oct 16 2020, 1:02 PM · Object storage, Mirror
douardda added a comment to T2706: Benchmark objstorage for mirror (uffizi vs. azure vs. s3).

Since the results on uffizi above did suffer from a few caveats, I've made a few more tests:

  • a first result has been obtained with a dataset that had only objects stored on the XFS part of the objstorage
  • a second dataset has been created (with the order by sha256 part to spread the sha1s)
  • but results are a mix hot/cold cache tests
Oct 16 2020, 11:59 AM · Object storage, Mirror

Oct 15 2020

douardda added a comment to T2706: Benchmark objstorage for mirror (uffizi vs. azure vs. s3).

Some results:

Oct 15 2020, 1:02 PM · Object storage, Mirror
douardda added a comment to T2706: Benchmark objstorage for mirror (uffizi vs. azure vs. s3).

Current benchmarck scenario:

Oct 15 2020, 12:43 PM · Object storage, Mirror
douardda triaged T2706: Benchmark objstorage for mirror (uffizi vs. azure vs. s3) as High priority.
Oct 15 2020, 12:36 PM · Object storage, Mirror

Oct 14 2020

douardda created P821 (An Untitled Masterwork).
Oct 14 2020, 3:43 PM
douardda created P820 bench objstorage.
Oct 14 2020, 3:39 PM

Oct 13 2020

douardda accepted D4089: Add tests and fix behavior of scanner cli.

I'm mostly OK with this now, so I'll make it "accepted", but please refactor a bit the cli_run_[n]ok() helper functions before landing it.

Oct 13 2020, 1:24 PM
douardda set the repository for D4193: swh identify: add --exclude to rDMOD Data model.
Oct 13 2020, 1:04 PM