Page MenuHomeSoftware Heritage
Feed Advanced Search

Nov 23 2020

douardda requested changes to D4554: requirements-test.txt: Drop no longer needed test dependency.

swh.core[db] is already a dep in requirements-swh.txt, why would is be needed here?

Nov 23 2020, 10:40 AM
douardda added inline comments to D3435: Add mercurial.from_disk.HgLoaderFromDisk.
Nov 23 2020, 10:34 AM

Nov 20 2020

douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

I disagree. We don't want to re-use it, but to create a new tag entirely.

Nov 20 2020, 4:04 PM
douardda accepted D4541: Add content lru cache to HgLoaderFromDisk.

Besides @marmoute 's comments, lgtm

Nov 20 2020, 3:37 PM
douardda accepted D4527: Fix SWORD XMLNS (http://purl.org/net/sword/ -> http://purl.org/net/sword/terms/).

if we are confident this really is the correct URI, lgtm

Nov 20 2020, 3:32 PM
douardda accepted D4547: Move the XML parser function to a module that does not import Django..

(maybe with a better ci msg)

Nov 20 2020, 3:30 PM
douardda accepted D4544: Clarify names around the swh.core.db.pytest_plugin tests.

I would put (at least part of) the diff's message in the commit message, otherwise lgtm

Nov 20 2020, 3:15 PM
douardda accepted D4529: Fix XML parsing of the client..

it looks good to me, but, yeah...

Nov 20 2020, 3:06 PM
douardda accepted D4530: Use a XML prefix for dublincore tags, instead of merging them with atom tags..
Nov 20 2020, 2:32 PM
douardda accepted D4533: Explicitly use the atom: prefix internally..

lgtm (besides the red CI flag)

Nov 20 2020, 2:29 PM
douardda added a project to T2800: Add support for webhook-like callback for the vault: Vault.
Nov 20 2020, 11:18 AM · Vault
douardda triaged T2800: Add support for webhook-like callback for the vault as Wishlist priority.
Nov 20 2020, 11:17 AM · Vault
douardda updated subscribers of T2799: Add support for SWHID as source of repository for jupyterhub.
Nov 20 2020, 11:12 AM
douardda triaged T2799: Add support for SWHID as source of repository for jupyterhub as Wishlist priority.
Nov 20 2020, 11:12 AM

Nov 18 2020

douardda added a comment to T2794: Open a sword v3 compliant deposit implementation.

Note that we will need to keep the support for SWORD v2 (at least for a while), so this open question must be tackled with this in mind.

Nov 18 2020, 2:10 PM · SWORD deposit
douardda added a comment to T2537: Extend new deposit endpoint to support metadata-only deposits.

Is this related to T1021?

I'd say yes, i added that task as parent task.

Nov 18 2020, 11:46 AM · Roadmap 2020, SWORD deposit, Scientific Community Building
douardda added a comment to D4491: assets/revision/diff-utils: Fix text selection in revision view.

why store the current tab in a variable instead of just asking it when needed (using something similar to e.currentTarget.text.trim())?

Nov 18 2020, 10:05 AM
douardda accepted D4498: add journal of notable archival changes.
Nov 18 2020, 10:00 AM

Nov 17 2020

douardda added a comment to D4490: swh.deposit.parsers: Ensure SWHIDs with context are strictly valid.

IMO the SWHID class should store the parsed core SWHIDs for its qualifiers, so they don't need to be parsed twice (+ error handling)

Nov 17 2020, 12:36 PM · SWORD deposit
douardda added a comment to T2537: Extend new deposit endpoint to support metadata-only deposits.

Is this related to T1021?

Nov 17 2020, 12:34 PM · Roadmap 2020, SWORD deposit, Scientific Community Building
douardda added a comment to T2767: Make the Slug header optional for the deposit server.

It seems to make perfectly sense to use the same logic as the metadata deposit to handle this problem, I think.

Nov 17 2020, 12:25 PM · SWORD deposit
douardda added a comment to T2779: Put information (client, collection and deposit-id) inside metadata for metadata-only deposit.

I may have missed something (several actually) but where is this swh:deposit namespace specified?

Nov 17 2020, 12:13 PM · Metadata workflow, SWORD deposit
douardda added a comment to T2786: UI: wrong usage of the "go to origin" icon next to the origin URL that redirect to SWH.

We need to think about a better UI for this, but I have no solution for now.

Nov 17 2020, 11:53 AM · Web app
douardda added a comment to T2779: Put information (client, collection and deposit-id) inside metadata for metadata-only deposit.

To solve this discrepancy, deposit message should be added in the xml.

Nov 17 2020, 11:52 AM · Metadata workflow, SWORD deposit
douardda added a comment to T2779: Put information (client, collection and deposit-id) inside metadata for metadata-only deposit.

I may have missed something (several actually) but where is this swh:deposit namespace specified?

Nov 17 2020, 11:48 AM · Metadata workflow, SWORD deposit
douardda committed rDLDBASEcbd9c82c571b: Small docstring improvements in the deposit loader code (authored by douardda).
Small docstring improvements in the deposit loader code
Nov 17 2020, 11:39 AM
douardda added a comment to T2781: Make it obvious that services are the staging version.

Also I believe this will have its priority to be elevated once the staging is officially made public

Nov 17 2020, 11:32 AM · Easy hack, SWORD deposit, Web app
douardda added a comment to T2781: Make it obvious that services are the staging version.

I vote for the big fat red banner

Nov 17 2020, 11:31 AM · Easy hack, SWORD deposit, Web app
douardda triaged T2786: UI: wrong usage of the "go to origin" icon next to the origin URL that redirect to SWH as Normal priority.
Nov 17 2020, 11:11 AM · Web app
douardda added a comment to T2778: Change origin on deposit checks to use the same origin.

I don't understand this. I guess https://www.softwareheritage.org/check-deposit-2020-11-15T21:58:29.744061 is the Origin URL of this deposit https://archive.softwareheritage.org/browse/origin/directory/?origin_url=https://www.softwareheritage.org/check-deposit-2020-11-15T21:58:29.744061 right?

Nov 17 2020, 11:01 AM · SWORD deposit
douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

What bugs me is that said url does not make sense (when that slug is generated), that url means nothing...

It does mean something, it's an URI (resource identifier) but not an URL (resource locator)

Nov 17 2020, 10:41 AM
douardda added a comment to T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI .

I don't mind a "quick" solution that fixes the problem right now, and revisit it once we have T2459 fixed. Not sure I agree with the statement that my proposal in the comment is "fine-grained" though. I mean I suggest nothing fancy, just a few entries in a dropbox which content is built from the "directories" of the branch names.

Nov 17 2020, 10:17 AM · Web app

Nov 16 2020

douardda added a comment to T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI .

maybe a generic widget allowing to select the "directory" to show in the branches view, For example, in this DGtal repository, it would propose a dropbox with the following items:

Nov 16 2020, 5:07 PM · Web app
douardda triaged T2782: add a "Filter Pull Requests" checkbox (or similar) in the Branches view of an origin in the web UI as High priority.
Nov 16 2020, 5:03 PM · Web app

Nov 13 2020

douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

if the slug is not sent, then the id is generated (like we currently do in the cli) but server side.

Let me clarify, that's a hack that was installed for a specific user (which does not use it in the end).

I don't want that hack from the cli ending up server side...

We should probably rework the origin logic first then.

Nov 13 2020, 3:00 PM
douardda accepted D4464: Remove the <client> tag from the protocol..
Nov 13 2020, 2:54 PM
douardda added a comment to D4463: Remove the <external_identifier> tag from the protocol..

Something is amiss for me somewhere.

I thought we wanted to relax the slug's mandatory property (to respect the sword v2 spec).
If that's the case, we still need the external_identifier in the loop.

Because that's what's used at some point to build the origin for the loader.

What did i miss?

Because, from my understanding of another discussion with @douardda
or @moranegg, i thought it was the other way around.
Keep the external_identifier and use it if not provided by the slug.

So that way, we can go the optional slug road.

Nov 13 2020, 10:42 AM

Nov 12 2020

douardda committed rDDEP125c4831b443: Customize the user-agent header in deposit client classes (authored by douardda).
Customize the user-agent header in deposit client classes
Nov 12 2020, 10:18 AM
douardda committed rDDEP3a48807eb2b0: Use a requests.Session object in deposit client classes (authored by douardda).
Use a requests.Session object in deposit client classes
Nov 12 2020, 10:18 AM
douardda closed D4459: Customize the user-agent header in deposit client classes.
Nov 12 2020, 10:18 AM

Nov 10 2020

douardda created D4459: Customize the user-agent header in deposit client classes.
Nov 10 2020, 6:43 PM
douardda committed rDDEPb5da4cdd68db: Reformat optional arguments passed cli tests to make them easier to read (authored by douardda).
Reformat optional arguments passed cli tests to make them easier to read
Nov 10 2020, 6:42 PM
douardda closed D4433: Reformat optional arguments passed cli tests to make them easier to read.
Nov 10 2020, 6:42 PM
douardda closed D4432: Simplify `swh deposit upload` cli options.
Nov 10 2020, 6:42 PM
douardda committed rDDEP7421887510ba: Simplify `swh deposit upload` cli options (authored by douardda).
Simplify `swh deposit upload` cli options
Nov 10 2020, 6:42 PM
douardda committed rDDEP1129a092e671: Refactor BaseApiDepositClient to get rid of the _client argument (authored by douardda).
Refactor BaseApiDepositClient to get rid of the _client argument
Nov 10 2020, 6:42 PM
douardda closed D4431: Refactor BaseApiDepositClient to get rid of the _client argument.
Nov 10 2020, 6:42 PM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 10 2020, 5:31 PM
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

keep the cli options but mark them as deprecated (and ignored)

Nov 10 2020, 5:30 PM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

rebase

Nov 10 2020, 5:30 PM
douardda added inline comments to D4432: Simplify `swh deposit upload` cli options.
Nov 10 2020, 5:18 PM
douardda added a comment to D4432: Simplify `swh deposit upload` cli options.

As i mentioned in irc during the week, I'm pretty sure those are to be used alongside
the --replace flag. So not sure if it's wise or not to remove those...

But i guess your argument about deduction out of --archive and --metadata flags stands.

Also, kinda in the same vein as vlorentz's comment, i just don't know who is using those
nor how to actually check that.

Maybe deprecating those flags, then actually explains that they
are redundant with --archive and --metadata would be best as a first step?

Nov 10 2020, 5:14 PM
douardda committed rDDEPc815bbf19a61: Add an annotated table of content at the end of the introduction (authored by douardda).
Add an annotated table of content at the end of the introduction
Nov 10 2020, 3:50 PM
douardda committed rDDEPa252e005acb1: doc: improve the user manual documentation (authored by douardda).
doc: improve the user manual documentation
Nov 10 2020, 3:50 PM
douardda closed D4450: Add an annotated table of content at the end of the introduction.
Nov 10 2020, 3:50 PM
douardda committed rDDEP91f92a12d455: doc: rename Getting Starter as User Manual and update the content (authored by douardda).
doc: rename Getting Starter as User Manual and update the content
Nov 10 2020, 3:50 PM
douardda closed D4430: doc: rename Getting Started as User Manual and update the content.
Nov 10 2020, 3:50 PM
douardda added a comment to D4438: migrate_extrinsic_metadata: don't crash when deb revisions aren't referenced by any snapshot.

As said on IRC, I think I'd rather prefer erroneous hashed to be logged somewhere rather than using an assertion.

Nov 10 2020, 3:42 PM
douardda accepted D4083: Make the indexer storage write to the journal..

overall I'm ok, but I find it really lacks some documentation/explanations of how this works, especially the JournalWriter collaborator object

Nov 10 2020, 3:12 PM
douardda updated the diff for D4450: Add an annotated table of content at the end of the introduction.

rebase

Nov 10 2020, 2:52 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

rebase

Nov 10 2020, 2:51 PM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 10 2020, 2:47 PM
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

rebase

Nov 10 2020, 2:46 PM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

rebase

Nov 10 2020, 2:46 PM
douardda updated the diff for D4450: Add an annotated table of content at the end of the introduction.

rebase

Nov 10 2020, 2:44 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

remove remaining mentions of the "simplified metadata deposit"

Nov 10 2020, 2:43 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

typos and some fixes according to comments

Nov 10 2020, 2:35 PM
douardda added inline comments to D4430: doc: rename Getting Started as User Manual and update the content.
Nov 10 2020, 2:31 PM
douardda added a comment to T2767: Make the Slug header optional for the deposit server.

I need to reflect on this.
There were two major reasons for which we use the slug:

  1. create an origin
Nov 10 2020, 9:56 AM · SWORD deposit
douardda added a comment to T2767: Make the Slug header optional for the deposit server.

I've merged T2757 in this since there indeed identical

Nov 10 2020, 9:45 AM · SWORD deposit
douardda merged T2757: the Slug header should not be mandatory into T2767: Make the Slug header optional for the deposit server.
Nov 10 2020, 9:44 AM · SWORD deposit
douardda merged task T2757: the Slug header should not be mandatory into T2767: Make the Slug header optional for the deposit server.
Nov 10 2020, 9:44 AM · SWORD deposit

Nov 9 2020

douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

adapt according to vlorentz' comment

Nov 9 2020, 4:51 PM
douardda updated the diff for D4450: Add an annotated table of content at the end of the introduction.

typo (thx vlorentz)

Nov 9 2020, 4:44 PM
douardda added a revision to T2751: Clarify in documentation the specs status: D4450: Add an annotated table of content at the end of the introduction.
Nov 9 2020, 4:11 PM · Documentation
douardda updated the summary of D4450: Add an annotated table of content at the end of the introduction.
Nov 9 2020, 4:10 PM
douardda added a revision to T2751: Clarify in documentation the specs status: D4430: doc: rename Getting Started as User Manual and update the content.
Nov 9 2020, 4:10 PM · Documentation
douardda updated the summary of D4430: doc: rename Getting Started as User Manual and update the content.
Nov 9 2020, 4:10 PM
douardda retitled D4430: doc: rename Getting Started as User Manual and update the content from [WIP] doc: rename Getting Starter as User Manual and update the content to doc: rename Getting Starter as User Manual and update the content.
Nov 9 2020, 4:09 PM
douardda created D4450: Add an annotated table of content at the end of the introduction.
Nov 9 2020, 4:08 PM
douardda updated the diff for D4430: doc: rename Getting Started as User Manual and update the content.

Rebase, updates and fix according to moranegg's comments

Nov 9 2020, 4:08 PM
douardda added inline comments to D4430: doc: rename Getting Started as User Manual and update the content.
Nov 9 2020, 3:59 PM
douardda added a comment to T2751: Clarify in documentation the specs status.

I also think that if we use the "blueprint" terminology, then the blueprint section should list all the specifications, implemented or not, but make it very clear what's done, in-progress or only planned.

Nov 9 2020, 3:49 PM · Documentation
douardda added a comment to T2751: Clarify in documentation the specs status.
  • I wonder if it would not make more sense to put spec-meta-deposit.rst and spec-sparse-deposit.rst within blueprint.rst since these are other "use cases"
  • I don't really get the point of the specs/spec-technical.rst file - it should be deleted (maybe keep one or 2 paragraphs in there)
  • I think dev-info.rst and sys-info.rst should be moved in a subsection ("Reference" or "Technical Doc" or something)
  • I think the tests/tests_HAL.rst file should not be in the docs
Nov 9 2020, 3:47 PM · Documentation
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

fix formatting in the docstring of client_command_parse_input()

Nov 9 2020, 11:57 AM
douardda added a comment to D4442: deposit.parser: Add parse_swh_reference function.

Agree with @vlorentz, we should not clean a given SWHID, it's either valid or not

Nov 9 2020, 10:59 AM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 9 2020, 10:46 AM
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

rebase

Nov 9 2020, 10:46 AM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

fix a few more oopsies reported by vlorentz and ardumont

Nov 9 2020, 10:45 AM
douardda added inline comments to D4431: Refactor BaseApiDepositClient to get rid of the _client argument.
Nov 9 2020, 10:42 AM
douardda added inline comments to D4431: Refactor BaseApiDepositClient to get rid of the _client argument.
Nov 9 2020, 10:01 AM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

Fixes according to vlorentz' comments

Nov 9 2020, 9:54 AM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 9 2020, 9:43 AM
douardda updated the diff for D4432: Simplify `swh deposit upload` cli options.

rebas

Nov 9 2020, 9:43 AM
douardda updated the diff for D4431: Refactor BaseApiDepositClient to get rid of the _client argument.

rebase

Nov 9 2020, 9:43 AM
douardda updated the diff for D4433: Reformat optional arguments passed cli tests to make them easier to read.

rebase

Nov 9 2020, 9:40 AM
douardda committed rDDEP7148a257b296: doc: add an introduction paragraph in blueprint.rst (authored by douardda).
doc: add an introduction paragraph in blueprint.rst
Nov 9 2020, 9:38 AM
douardda closed D4429: doc: add an introduction paragraph in blueprint.rst.
Nov 9 2020, 9:38 AM
douardda updated the diff for D4429: doc: add an introduction paragraph in blueprint.rst.

rebase

Nov 9 2020, 9:38 AM

Nov 6 2020

douardda added inline comments to D4429: doc: add an introduction paragraph in blueprint.rst.
Nov 6 2020, 3:27 PM