Page MenuHomeSoftware Heritage
Feed All Stories

Dec 8 2022

ardumont added a comment to D8941: replay: Simplify FileEditor implementation.

Neat simplification indeed, congrats \m/.

Dec 8 2022, 11:57 AM
ardumont added inline comments to D8941: replay: Simplify FileEditor implementation.
Dec 8 2022, 11:56 AM
ardumont accepted D8882: replay: Do not ignore externals in copyfrom operations.
Dec 8 2022, 11:53 AM
vlorentz closed D8935: Add dataset name to the export id.
Dec 8 2022, 11:47 AM
vlorentz committed rDGRPH94b1d2c14fe8: Add dataset name to the export id (authored by vlorentz).
Add dataset name to the export id
Dec 8 2022, 11:47 AM
ardumont created P1542 (An Untitled Masterwork).
Dec 8 2022, 11:46 AM
vlorentz closed T4354: Contribute terms to ForgeFed as Resolved.
Dec 8 2022, 11:44 AM · Archive search, Metadata workflow
vlorentz closed T4354: Contribute terms to ForgeFed, a subtask of T4249: Choose/define an ontology to use for indexed extrinsic origin metadata, as Resolved.
Dec 8 2022, 11:44 AM · Archive search, Metadata workflow
vlorentz updated the task description for T4354: Contribute terms to ForgeFed.
Dec 8 2022, 11:44 AM · Archive search, Metadata workflow
vlorentz accepted D8944: replay: Copy dir states and external paths in copy_from operations.
Dec 8 2022, 11:39 AM
anlambert added a comment to D8944: replay: Copy dir states and external paths in copy_from operations.

Are you sure the path argument to add_directory cannot start with a / or contain ..?

Dec 8 2022, 11:38 AM
vlorentz accepted D8882: replay: Do not ignore externals in copyfrom operations.
Dec 8 2022, 11:38 AM
olasd added inline comments to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.
Dec 8 2022, 11:38 AM
vlorentz accepted D8941: replay: Simplify FileEditor implementation.

huh, nice

Dec 8 2022, 11:38 AM
vlorentz added a comment to D8944: replay: Copy dir states and external paths in copy_from operations.

Are you sure the path argument to add_directory cannot start with a / or contain ..?

Dec 8 2022, 11:36 AM
ardumont added a comment to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.

For --staging, we just want to create a single oneshot full listing task.

Why bother with a full listing?

Since we now got the means to limit the listing, i'd use it. That's a faster feedback
loop and without stressing twice the upstream forge (one round for staging and another
for production at a relatively small intervals of time).

Maybe I need to configure the default to a more sensible defaults though. Like 3 pages
with 10 results (so that we can see that the pagination works too). Currently it's a tad
small (1 page of 15 or something).

Sure, that's what I meant: for staging, schedule a oneshot full listing task, but with the limiting and "origin disabling" options ("full" as opposed to "incremental")

Dec 8 2022, 11:36 AM
ardumont added inline comments to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.
Dec 8 2022, 11:35 AM
vlorentz accepted D8942: utils: Raise ValueError when external definition could not be parsed.
Dec 8 2022, 11:33 AM
olasd added inline comments to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.
Dec 8 2022, 11:31 AM
vlorentz added a comment to D8939: Rework the replaying exception handling.

Could you use a logger instance, and add if logger.isEnabledFor(logging.DEBUG): before logger.debug statements that use hash_to_hex?

Dec 8 2022, 11:31 AM
olasd added a comment to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.

For --staging, we just want to create a single oneshot full listing task.

Why bother with a full listing?

Since we now got the means to limit the listing, i'd use it. That's a faster feedback
loop and without stressing twice the upstream forge (one round for staging and another
for production at a relatively small intervals of time).

Maybe I need to configure the default to a more sensible defaults though. Like 3 pages
with 10 results (so that we can see that the pagination works too). Currently it's a tad
small (1 page of 15 or something).

Dec 8 2022, 11:28 AM
douardda accepted D8935: Add dataset name to the export id.
Dec 8 2022, 11:26 AM
douardda added inline comments to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.
Dec 8 2022, 11:22 AM
ardumont added a comment to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.

So, I don't think we want to have to remember if there's full or incremental versions
of the lister.

Dec 8 2022, 11:18 AM
anlambert added inline comments to D8882: replay: Do not ignore externals in copyfrom operations.
Dec 8 2022, 11:15 AM
anlambert added inline comments to D8943: svn: Use urllib.parse.quote to percent encode svn URLs.
Dec 8 2022, 11:15 AM
swh-public-ci added a comment to D8911: Remove ambiguous item cursors.

Build is green

Dec 8 2022, 11:13 AM
jayeshv updated the diff for D8911: Remove ambiguous item cursors.

extra tests for pagination

Dec 8 2022, 11:10 AM
vlorentz accepted D8946: svn_retry: Reduce max number of retry attempts to 3.
Dec 8 2022, 2:29 AM
vlorentz accepted D8945: api, browse: Ensure to sanitize filename passed to django FileResponse.
Dec 8 2022, 2:29 AM
vlorentz requested changes to D8943: svn: Use urllib.parse.quote to percent encode svn URLs.
Dec 8 2022, 2:27 AM

Dec 7 2022

olasd added a comment to D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.

So, I don't think we want to have to remember if there's full or incremental versions of the lister.

Dec 7 2022, 6:10 PM
anlambert requested review of D8946: svn_retry: Reduce max number of retry attempts to 3.
Dec 7 2022, 6:06 PM
anlambert requested review of D8945: api, browse: Ensure to sanitize filename passed to django FileResponse.
Dec 7 2022, 6:06 PM
anlambert raised the priority of T4673: AssertionError: 7911 != 7912 in subvertpy while parsing txdeltas from Low to Normal.
Dec 7 2022, 5:46 PM · SVN Loader
anlambert requested review of D8944: replay: Copy dir states and external paths in copy_from operations.
Dec 7 2022, 5:43 PM
anlambert requested review of D8943: svn: Use urllib.parse.quote to percent encode svn URLs.
Dec 7 2022, 5:42 PM
anlambert requested review of D8942: utils: Raise ValueError when external definition could not be parsed.
Dec 7 2022, 5:41 PM
anlambert requested review of D8941: replay: Simplify FileEditor implementation.
Dec 7 2022, 5:36 PM
anlambert added a revision to T4673: AssertionError: 7911 != 7912 in subvertpy while parsing txdeltas: D8941: replay: Simplify FileEditor implementation.
Dec 7 2022, 5:33 PM · SVN Loader
swh-public-ci added a comment to D8882: replay: Do not ignore externals in copyfrom operations.

Build is green

Dec 7 2022, 5:27 PM
Harbormaster failed to build B33170: rDGRPHcd69e48b5acc: Replace RunAll with RunExportCompressUpload for rDGRPHcd69e48b5acc: Replace RunAll with RunExportCompressUpload!
Dec 7 2022, 5:26 PM
Harbormaster failed to build B33168: rDGRPH042af3adf5b6: Fix crash when the sensitive dataset directory does not exist for rDGRPH042af3adf5b6: Fix crash when the sensitive dataset directory does not exist!
Dec 7 2022, 5:26 PM
Harbormaster failed to build B33169: rDGRPH233b0508395a: Prevent incorrect warning from being printed to output files for rDGRPH233b0508395a: Prevent incorrect warning from being printed to output files!
Dec 7 2022, 5:25 PM
anlambert updated the diff for D8882: replay: Do not ignore externals in copyfrom operations.

Rebase

Dec 7 2022, 5:23 PM
anlambert closed D8913: loader: Compute contents diff when detecting trees divergence.
Dec 7 2022, 5:19 PM
anlambert committed rDLDSVN2b80484b8b70: loader: Compute contents diff when detecting trees divergence (authored by anlambert).
loader: Compute contents diff when detecting trees divergence
Dec 7 2022, 5:19 PM
vlorentz closed D8932: Replace RunAll with RunExportCompressUpload.
Dec 7 2022, 5:15 PM
vlorentz committed rDGRPHcd69e48b5acc: Replace RunAll with RunExportCompressUpload (authored by vlorentz).
Replace RunAll with RunExportCompressUpload
Dec 7 2022, 5:15 PM
vlorentz closed D8931: Prevent incorrect warning from being printed to output files.
Dec 7 2022, 5:15 PM
vlorentz committed rDGRPH233b0508395a: Prevent incorrect warning from being printed to output files (authored by vlorentz).
Prevent incorrect warning from being printed to output files
Dec 7 2022, 5:15 PM
vlorentz committed rDGRPH042af3adf5b6: Fix crash when the sensitive dataset directory does not exist (authored by vlorentz).
Fix crash when the sensitive dataset directory does not exist
Dec 7 2022, 5:15 PM
moranegg added a comment to T4264: Add photos to https://mybox.inria.fr/.

@rdicosmo: Can you share access to the myBox so I can add what I have from events?

Dec 7 2022, 5:08 PM · Unknown Object (Project)
ardumont requested review of D8940: cli.add_forge_now: Open `register-lister` with sensible defaults.
Dec 7 2022, 5:05 PM
vlorentz requested review of D8935: Add dataset name to the export id.
Dec 7 2022, 4:57 PM
ardumont closed D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.
Dec 7 2022, 4:54 PM
ardumont committed rDSCH1c34e9837faa: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults (authored by ardumont).
cli.add_forge_now: Open `schedule-first-visits` with sensible defaults
Dec 7 2022, 4:54 PM
douardda updated the summary of D8939: Rework the replaying exception handling.
Dec 7 2022, 4:51 PM
douardda updated the summary of D8939: Rework the replaying exception handling.
Dec 7 2022, 4:51 PM
swh-public-ci added a comment to D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.

Build is green

Dec 7 2022, 4:49 PM
ardumont updated the diff for D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.

Adapt according to suggestion

Dec 7 2022, 4:44 PM
ardumont added a comment to D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.

d'oh at your suggestions, you're totally right!

Dec 7 2022, 4:43 PM
douardda accepted D8877: Fix incorrect error messages when failing to connect.

(no test? worth it?)

Dec 7 2022, 4:43 PM
olasd accepted D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.

A couple of suggestions inline, but lgtm otherwise, thanks!

Dec 7 2022, 4:38 PM
vlorentz closed D8934: Remove tool ids from Kafka messages.
Dec 7 2022, 4:34 PM
vlorentz committed rDCIDXe8549400bc54: Remove tool ids from Kafka messages (authored by vlorentz).
Remove tool ids from Kafka messages
Dec 7 2022, 4:34 PM
douardda requested review of D8939: Rework the replaying exception handling.
Dec 7 2022, 4:33 PM
franckbret added a comment to D8539: Add random directory sampling policy.
In D8539#222941, @olasd wrote:

Fine, i've one comment i'd like others to have a look at though [1] regarding where
that new discovery (interface) code should go. It feels currently a bit off to me that this code
is in loader-core. Loaders are not the sole archive consumers (scanner, webapp, cli, indexer, cooker, ...).

[1] https://forge.softwareheritage.org/D8539?id=30916#inline-60774

@vlorentz @douardda @olasd @anlambert ^

If it's to be used generically, this discovery code should pretty definitely not be in swh.loader.core.

  • The generic discovery algorithm, and base abstract classes/protocols, should probably be in swh.model, as they're tied to that structure;
  • The swh.storage-based discovery mechanism could live in swh.storage.algorithms, and be used by swh.loader.core;
  • The REST API-based discovery mechanism could live in swh.web.client, or stay in swh.scanner.
Dec 7 2022, 4:28 PM
swh-public-ci added a comment to D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.

Build is green

Dec 7 2022, 4:21 PM
ardumont accepted D8934: Remove tool ids from Kafka messages.

I was sold on the description!

Dec 7 2022, 4:20 PM
ardumont retitled D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults from cli.add_forge_now: Open `schedule-first-visits` with sensible defaultsg to cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.
Dec 7 2022, 4:16 PM
ardumont updated the summary of D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.
Dec 7 2022, 4:15 PM
ardumont updated the diff for D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.

Adapt according to irc sysadm discussion

Dec 7 2022, 4:15 PM
ardumont retitled D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults from cli.origin: Open `swh scheduler add-forge-now` with sensible defaults to cli.add_forge_now: Open `schedule-first-visits` with sensible defaultsg.
Dec 7 2022, 4:15 PM
zack committed rMSLD092885119d81: check in slides for today's talk at ENS Rennes (authored by zack).
check in slides for today's talk at ENS Rennes
Dec 7 2022, 3:42 PM
zack committed rMSLD4dc4ba49fd61: swh-fuse: add ID references to all tutorial slides (authored by zack).
swh-fuse: add ID references to all tutorial slides
Dec 7 2022, 3:42 PM
zack committed rMSLDa226eb6e7bc1: status module: update size/nodes/edges counts (authored by zack).
status module: update size/nodes/edges counts
Dec 7 2022, 3:42 PM
zack committed rMSLDc87694b9ff77: ESE research module: adding missing subtitle in a paper reference (authored by zack).
ESE research module: adding missing subtitle in a paper reference
Dec 7 2022, 3:42 PM
swh-public-ci added a comment to D8932: Replace RunAll with RunExportCompressUpload.

Build is green

Dec 7 2022, 3:36 PM
Harbormaster failed remote builds in B33157: Diff 32203 for D8931: Prevent incorrect warning from being printed to output files!
Dec 7 2022, 3:35 PM
swh-public-ci added a comment to D8931: Prevent incorrect warning from being printed to output files.

Build has FAILED

Dec 7 2022, 3:35 PM
swh-public-ci added a comment to D8932: Replace RunAll with RunExportCompressUpload.

Build is green

Dec 7 2022, 3:29 PM
Harbormaster failed remote builds in B33153: Diff 32199 for D8931: Prevent incorrect warning from being printed to output files!
Dec 7 2022, 3:28 PM
swh-public-ci added a comment to D8931: Prevent incorrect warning from being printed to output files.

Build has FAILED

Dec 7 2022, 3:28 PM
ardumont requested review of D8936: cli.add_forge_now: Open `schedule-first-visits` with sensible defaults.
Dec 7 2022, 3:26 PM
vlorentz updated the diff for D8932: Replace RunAll with RunExportCompressUpload.

rebase

Dec 7 2022, 3:24 PM
vlorentz updated the diff for D8931: Prevent incorrect warning from being printed to output files.

I'm tired

Dec 7 2022, 3:24 PM
swh-public-ci added a comment to D8932: Replace RunAll with RunExportCompressUpload.

Build is green

Dec 7 2022, 3:20 PM
Harbormaster failed remote builds in B33150: Diff 32196 for D8931: Prevent incorrect warning from being printed to output files!
Dec 7 2022, 3:19 PM
swh-public-ci added a comment to D8931: Prevent incorrect warning from being printed to output files.

Build has FAILED

Dec 7 2022, 3:19 PM
vlorentz updated the diff for D8932: Replace RunAll with RunExportCompressUpload.

rebase

Dec 7 2022, 3:18 PM
vlorentz updated the diff for D8931: Prevent incorrect warning from being printed to output files.

remove useless function

Dec 7 2022, 3:18 PM
vlorentz updated the diff for D8932: Replace RunAll with RunExportCompressUpload.

rebase

Dec 7 2022, 3:10 PM
vlorentz updated the diff for D8931: Prevent incorrect warning from being printed to output files.

less awful fix

Dec 7 2022, 3:09 PM
vlorentz planned changes to D8931: Prevent incorrect warning from being printed to output files.
Dec 7 2022, 2:44 PM
vlorentz added a comment to D8931: Prevent incorrect warning from being printed to output files.
In D8931#232231, @olasd wrote:

Why not just touch all the files?

Dec 7 2022, 2:43 PM
vlorentz created P1541 (An Untitled Masterwork).
Dec 7 2022, 2:31 PM
Harbormaster failed to build B33148: rDGRPH100ccd1d1e47: setup.py: Ensure testing requirements include luigi for rDGRPH100ccd1d1e47: setup.py: Ensure testing requirements include luigi!
Dec 7 2022, 2:27 PM
Harbormaster failed to build B33147: rDGRPHe00ddf515557: luigi/origin_contributors: Fix mypy error with Python 3.7 for rDGRPHe00ddf515557: luigi/origin_contributors: Fix mypy error with Python 3.7!
Dec 7 2022, 2:27 PM
vlorentz requested review of D8934: Remove tool ids from Kafka messages.
Dec 7 2022, 2:20 PM