Page MenuHomeSoftware Heritage
Feed Advanced Search

Nov 30 2020

douardda accepted D4627: Support multiple log level specifications in the swh cli.

LGTM. I was wondering if a regexp would not be simpler for the validation step, but it would make error message more generic and less user friendly.

Nov 30 2020, 10:39 AM
douardda updated the task description for T2827: Deploy an instance of hedgedoc.
Nov 30 2020, 10:07 AM · System administration
douardda triaged T2827: Deploy an instance of hedgedoc as Wishlist priority.
Nov 30 2020, 9:54 AM · System administration

Nov 27 2020

douardda requested changes to D3435: Add mercurial.from_disk.HgLoaderFromDisk.

ok we are getting close I think. Just a few more comments to be handled and we are done.

Nov 27 2020, 4:38 PM
douardda raised the priority of T2818: Document the journal topics from Normal to High.
Nov 27 2020, 10:18 AM · Documentation
douardda added a comment to T2818: Document the journal topics.

I think the best would be to put this directly in swh-docs ( rDDOC ) because it's kind of "transverse" between swh-journal, swh-storage and swh-indexer.

Nov 27 2020, 9:59 AM · Documentation
douardda triaged T2818: Document the journal topics as Normal priority.
Nov 27 2020, 9:46 AM · Documentation

Nov 25 2020

douardda accepted D4595: api: Fix endpoint responses that must not be cached.

Thanks!
it's not completely clear why the @never_cache did not work any more, but meh.

Nov 25 2020, 4:14 PM
douardda created P880 (An Untitled Masterwork).
Nov 25 2020, 10:43 AM
douardda updated subscribers of T2810: API endpoint /vault/directory/<dirhash>/ should not be cached by varnish.

This may be better suited to a fix in the web API, via proper cache config headers, as @olasd mentioned on IRC (probably via https://docs.djangoproject.com/en/3.1/topics/cache/#downstream-caches )

Nov 25 2020, 10:42 AM · Web app, System administration
douardda added a project to T2810: API endpoint /vault/directory/<dirhash>/ should not be cached by varnish: Web app.
Nov 25 2020, 10:41 AM · Web app, System administration
douardda triaged T2810: API endpoint /vault/directory/<dirhash>/ should not be cached by varnish as High priority.
Nov 25 2020, 10:15 AM · Web app, System administration

Nov 24 2020

douardda edited P877 (An Untitled Masterwork).
Nov 24 2020, 2:19 PM
douardda edited P877 (An Untitled Masterwork).
Nov 24 2020, 2:19 PM
douardda created P877 (An Untitled Masterwork).
Nov 24 2020, 2:18 PM
douardda accepted D4568: Add support for large negative integers in msgpack encoding.

bit sad indeed, but LGTM

Nov 24 2020, 10:04 AM

Nov 23 2020

douardda accepted D4463: Remove the <external_identifier> tag from the protocol..

lgtm (besides the retries_left=3 stuff)

Nov 23 2020, 12:47 PM
douardda accepted D4550: requirements-test.txt: Explicit swh.core[db] as test requirement.
Nov 23 2020, 12:44 PM
douardda accepted D4554: requirements-test.txt: Drop no longer needed test dependency.

ok ok ok

Nov 23 2020, 12:42 PM
douardda accepted D4548: requirements-test.txt: Drop no longer needed pytest-postgresql requirement.
Nov 23 2020, 12:36 PM
douardda added a comment to D4554: requirements-test.txt: Drop no longer needed test dependency.

swh.core[db] is already a dep in requirements-swh.txt, why would is be needed here?

But more importantly, I disagree with the validity of the diff because pytest-posgtgresql is used directly in this package (in conftest.py), not only by the mean of the swh.core.tests.db package.

Nov 23 2020, 10:53 AM
douardda requested changes to D4554: requirements-test.txt: Drop no longer needed test dependency.

swh.core[db] is already a dep in requirements-swh.txt, why would is be needed here?

Nov 23 2020, 10:40 AM
douardda added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 23 2020, 10:34 AM

Nov 20 2020

douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

I disagree. We don't want to re-use it, but to create a new tag entirely.

Nov 20 2020, 4:04 PM
douardda accepted D4541: Add content lru cache to HgLoaderFromDisk.

Besides @marmoute 's comments, lgtm

Nov 20 2020, 3:37 PM
douardda accepted D4527: Fix SWORD XMLNS (http://purl.org/net/sword/ -> http://purl.org/net/sword/terms/).

if we are confident this really is the correct URI, lgtm

Nov 20 2020, 3:32 PM
douardda accepted D4547: Move the XML parser function to a module that does not import Django..

(maybe with a better ci msg)

Nov 20 2020, 3:30 PM
douardda accepted D4544: Clarify names around the swh.core.db.pytest_plugin tests.

I would put (at least part of) the diff's message in the commit message, otherwise lgtm

Nov 20 2020, 3:15 PM
douardda accepted D4529: Fix XML parsing of the client..

it looks good to me, but, yeah...

Nov 20 2020, 3:06 PM
douardda accepted D4530: Use a XML prefix for dublincore tags, instead of merging them with atom tags..
Nov 20 2020, 2:32 PM
douardda accepted D4533: Explicitly use the atom: prefix internally..

lgtm (besides the red CI flag)

Nov 20 2020, 2:29 PM
douardda added a project to T2800: Add support for webhook-like callback for the vault: Vault.
Nov 20 2020, 11:18 AM · Vault
douardda triaged T2800: Add support for webhook-like callback for the vault as Wishlist priority.
Nov 20 2020, 11:17 AM · Vault
douardda updated subscribers of T2799: Add support for SWHID as source of repository for jupyterhub.
Nov 20 2020, 11:12 AM
douardda triaged T2799: Add support for SWHID as source of repository for jupyterhub as Wishlist priority.
Nov 20 2020, 11:12 AM

Nov 18 2020

douardda added a comment to T2794: Open a sword v3 compliant deposit implementation.

Note that we will need to keep the support for SWORD v2 (at least for a while), so this open question must be tackled with this in mind.

Nov 18 2020, 2:10 PM · SWORD deposit
douardda added a comment to T2537: Extend new deposit endpoint to support metadata-only deposits.

Is this related to T1021?

I'd say yes, i added that task as parent task.

Nov 18 2020, 11:46 AM · Roadmap 2020, SWORD deposit, Scientific Community Building
douardda added a comment to D4491: assets/revision/diff-utils: Fix text selection in revision view.

why store the current tab in a variable instead of just asking it when needed (using something similar to e.currentTarget.text.trim())?

Nov 18 2020, 10:05 AM
douardda accepted D4498: add journal of notable archival changes.
Nov 18 2020, 10:00 AM

Nov 17 2020

douardda added a comment to D4490: swh.deposit.parsers: Ensure SWHIDs with context are strictly valid.

IMO the SWHID class should store the parsed core SWHIDs for its qualifiers, so they don't need to be parsed twice (+ error handling)

Nov 17 2020, 12:36 PM · SWORD deposit
douardda added a comment to T2537: Extend new deposit endpoint to support metadata-only deposits.

Is this related to T1021?

Nov 17 2020, 12:34 PM · Roadmap 2020, SWORD deposit, Scientific Community Building
douardda added a comment to T2767: Make the Slug header optional for the deposit server.

It seems to make perfectly sense to use the same logic as the metadata deposit to handle this problem, I think.

Nov 17 2020, 12:25 PM · SWORD deposit
douardda added a comment to T2779: Put information (client, collection and deposit-id) inside metadata for metadata-only deposit.

I may have missed something (several actually) but where is this swh:deposit namespace specified?

Nov 17 2020, 12:13 PM · Metadata workflow, SWORD deposit
douardda added a comment to T2786: UI: wrong usage of the "go to origin" icon next to the origin URL that redirect to SWH.

We need to think about a better UI for this, but I have no solution for now.

Nov 17 2020, 11:53 AM · Web app
douardda added a comment to T2779: Put information (client, collection and deposit-id) inside metadata for metadata-only deposit.

To solve this discrepancy, deposit message should be added in the xml.

Nov 17 2020, 11:52 AM · Metadata workflow, SWORD deposit
douardda added a comment to T2779: Put information (client, collection and deposit-id) inside metadata for metadata-only deposit.

I may have missed something (several actually) but where is this swh:deposit namespace specified?

Nov 17 2020, 11:48 AM · Metadata workflow, SWORD deposit
douardda committed rDLDBASEcbd9c82c571b: Small docstring improvements in the deposit loader code (authored by douardda).
Small docstring improvements in the deposit loader code
Nov 17 2020, 11:39 AM
douardda added a comment to T2781: Make it obvious that services are the staging version.

Also I believe this will have its priority to be elevated once the staging is officially made public

Nov 17 2020, 11:32 AM · Easy hack, SWORD deposit, Web app
douardda added a comment to T2781: Make it obvious that services are the staging version.

I vote for the big fat red banner

Nov 17 2020, 11:31 AM · Easy hack, SWORD deposit, Web app
douardda triaged T2786: UI: wrong usage of the "go to origin" icon next to the origin URL that redirect to SWH as Normal priority.
Nov 17 2020, 11:11 AM · Web app
douardda added a comment to T2778: Change origin on deposit checks to use the same origin.

I don't understand this. I guess https://www.softwareheritage.org/check-deposit-2020-11-15T21:58:29.744061 is the Origin URL of this deposit https://archive.softwareheritage.org/browse/origin/directory/?origin_url=https://www.softwareheritage.org/check-deposit-2020-11-15T21:58:29.744061 right?

Nov 17 2020, 11:01 AM · SWORD deposit
douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

What bugs me is that said url does not make sense (when that slug is generated), that url means nothing...

It does mean something, it's an URI (resource identifier) but not an URL (resource locator)

Nov 17 2020, 10:41 AM
douardda added a comment to T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI .

I don't mind a "quick" solution that fixes the problem right now, and revisit it once we have T2459 fixed. Not sure I agree with the statement that my proposal in the comment is "fine-grained" though. I mean I suggest nothing fancy, just a few entries in a dropbox which content is built from the "directories" of the branch names.

Nov 17 2020, 10:17 AM · Web app

Nov 16 2020

douardda added a comment to T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI .

maybe a generic widget allowing to select the "directory" to show in the branches view, For example, in this DGtal repository, it would propose a dropbox with the following items:

Nov 16 2020, 5:07 PM · Web app
douardda triaged T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI as High priority.
Nov 16 2020, 5:03 PM · Web app

Nov 13 2020

douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

if the slug is not sent, then the id is generated (like we currently do in the cli) but server side.

Let me clarify, that's a hack that was installed for a specific user (which does not use it in the end).

I don't want that hack from the cli ending up server side...

We should probably rework the origin logic first then.

Nov 13 2020, 3:00 PM
douardda accepted D4464: Remove the <client> tag from the protocol..
Nov 13 2020, 2:54 PM
douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

Something is amiss for me somewhere.

I thought we wanted to relax the slug's mandatory property (to respect the sword v2 spec).
If that's the case, we still need the external_identifier in the loop.

Because that's what's used at some point to build the origin for the loader.

What did i miss?

Because, from my understanding of another discussion with @douardda
or @moranegg, i thought it was the other way around.
Keep the external_identifier and use it if not provided by the slug.

So that way, we can go the optional slug road.

Nov 13 2020, 10:42 AM

Nov 12 2020

douardda committed rDDEP125c4831b443: Customize the user-agent header in deposit client classes (authored by douardda).
Customize the user-agent header in deposit client classes
Nov 12 2020, 10:18 AM
douardda committed rDDEP3a48807eb2b0: Use a requests.Session object in deposit client classes (authored by douardda).
Use a requests.Session object in deposit client classes
Nov 12 2020, 10:18 AM
douardda closed D4459: Customize the user-agent header in deposit client classes.
Nov 12 2020, 10:18 AM

Nov 10 2020

douardda created D4459: Customize the user-agent header in deposit client classes.
Nov 10 2020, 6:43 PM
douardda committed rDDEPb5da4cdd68db: Reformat optional arguments passed cli tests to make them easier to read (authored by douardda).
Reformat optional arguments passed cli tests to make them easier to read
Nov 10 2020, 6:42 PM
douardda closed D4433: Reformat optional arguments passed cli tests to make them easier to read.
Nov 10 2020, 6:42 PM
douardda closed D4432: Simplify `swh deposit upload` cli options.
Nov 10 2020, 6:42 PM
douardda committed rDDEP7421887510ba: Simplify `swh deposit upload` cli options (authored by douardda).
Simplify `swh deposit upload` cli options
Nov 10 2020, 6:42 PM
douardda committed rDDEP1129a092e671: Refactor BaseApiDepositClient to get rid of the _client argument (authored by douardda).
Refactor BaseApiDepositClient to get rid of the _client argument
Nov 10 2020, 6:42 PM
douardda closed D4431: Refactor BaseApiDepositClient to get rid of the _client argument.
Nov 10 2020, 6:42 PM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 10 2020, 5:31 PM
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

keep the cli options but mark them as deprecated (and ignored)

Nov 10 2020, 5:30 PM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

rebase

Nov 10 2020, 5:30 PM
douardda added inline comments to D4432: Simplify `swh deposit upload` cli options.
Nov 10 2020, 5:18 PM
douardda added a comment to D4432: Simplify `swh deposit upload` cli options.

As i mentioned in irc during the week, I'm pretty sure those are to be used alongside
the --replace flag. So not sure if it's wise or not to remove those...

But i guess your argument about deduction out of --archive and --metadata flags stands.

Also, kinda in the same vein as vlorentz's comment, i just don't know who is using those
nor how to actually check that.

Maybe deprecating those flags, then actually explains that they
are redundant with --archive and --metadata would be best as a first step?

Nov 10 2020, 5:14 PM
douardda committed rDDEPc815bbf19a61: Add an annotated table of content at the end of the introduction (authored by douardda).
Add an annotated table of content at the end of the introduction
Nov 10 2020, 3:50 PM
douardda committed rDDEPa252e005acb1: doc: improve the user manual documentation (authored by douardda).
doc: improve the user manual documentation
Nov 10 2020, 3:50 PM
douardda closed D4450: Add an annotated table of content at the end of the introduction.
Nov 10 2020, 3:50 PM
douardda committed rDDEP91f92a12d455: doc: rename Getting Starter as User Manual and update the content (authored by douardda).
doc: rename Getting Starter as User Manual and update the content
Nov 10 2020, 3:50 PM
douardda closed D4430: doc: rename Getting Started as User Manual and update the content.
Nov 10 2020, 3:50 PM
douardda added a comment to D4438: migrate_extrinsic_metadata: don't crash when deb revisions aren't referenced by any snapshot.

As said on IRC, I think I'd rather prefer erroneous hashed to be logged somewhere rather than using an assertion.

Nov 10 2020, 3:42 PM
douardda accepted D4083: Make the indexer storage write to the journal..

overall I'm ok, but I find it really lacks some documentation/explanations of how this works, especially the JournalWriter collaborator object

Nov 10 2020, 3:12 PM
douardda updated the diff for D4450: Add an annotated table of content at the end of the introduction.

rebase

Nov 10 2020, 2:52 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

rebase

Nov 10 2020, 2:51 PM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 10 2020, 2:47 PM
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

rebase

Nov 10 2020, 2:46 PM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

rebase

Nov 10 2020, 2:46 PM
douardda updated the diff for D4450: Add an annotated table of content at the end of the introduction.

rebase

Nov 10 2020, 2:44 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

remove remaining mentions of the "simplified metadata deposit"

Nov 10 2020, 2:43 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

typos and some fixes according to comments

Nov 10 2020, 2:35 PM
douardda added inline comments to D4430: doc: rename Getting Started as User Manual and update the content.
Nov 10 2020, 2:31 PM
douardda added a comment to T2767: Make the Slug header optional for the deposit server.

I need to reflect on this.
There were two major reasons for which we use the slug:

  1. create an origin
Nov 10 2020, 9:56 AM · SWORD deposit
douardda added a comment to T2767: Make the Slug header optional for the deposit server.

I've merged T2757 in this since there indeed identical

Nov 10 2020, 9:45 AM · SWORD deposit
douardda merged T2757: the Slug header should not be mandatory into T2767: Make the Slug header optional for the deposit server.
Nov 10 2020, 9:44 AM · SWORD deposit
douardda merged task T2757: the Slug header should not be mandatory into T2767: Make the Slug header optional for the deposit server.
Nov 10 2020, 9:44 AM · SWORD deposit

Nov 9 2020

douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

adapt according to vlorentz' comment

Nov 9 2020, 4:51 PM
douardda updated the diff for D4450: Add an annotated table of content at the end of the introduction.

typo (thx vlorentz)

Nov 9 2020, 4:44 PM
douardda added a revision to T2751: Clarify in documentation the specs status: D4450: Add an annotated table of content at the end of the introduction.
Nov 9 2020, 4:11 PM · Documentation
douardda updated the summary of D4450: Add an annotated table of content at the end of the introduction.
Nov 9 2020, 4:10 PM
douardda added a revision to T2751: Clarify in documentation the specs status: D4430: doc: rename Getting Started as User Manual and update the content.
Nov 9 2020, 4:10 PM · Documentation
douardda updated the summary of D4430: doc: rename Getting Started as User Manual and update the content.
Nov 9 2020, 4:10 PM
douardda retitled D4430: doc: rename Getting Started as User Manual and update the content from [WIP] doc: rename Getting Starter as User Manual and update the content to doc: rename Getting Starter as User Manual and update the content.
Nov 9 2020, 4:09 PM