Page MenuHomeSoftware Heritage
Feed Advanced Search

Feb 11 2021

douardda accepted D5035: package: Mark visit as not_found when relevant.

lgtm but I would wait for a green stamp from @vlorentz as well

Feb 11 2021, 10:10 AM
douardda accepted D5026: core: Mark visit status as failed when relevant.

lgtm

Feb 11 2021, 10:04 AM

Feb 10 2021

douardda added a revision to T3034: generalize usage of SWHID for referencing SWH archive objects: D4985: [WIP] Add support for SWHID on Origin.
Feb 10 2021, 9:57 AM · Data Model
douardda added a task to D4985: [WIP] Add support for SWHID on Origin: T3034: generalize usage of SWHID for referencing SWH archive objects.
Feb 10 2021, 9:57 AM
douardda added a revision to T3034: generalize usage of SWHID for referencing SWH archive objects: D4986: [WIP] Add a composite swhid type in postgresql.
Feb 10 2021, 9:56 AM · Data Model

Feb 9 2021

douardda triaged T3034: generalize usage of SWHID for referencing SWH archive objects as High priority.
Feb 9 2021, 3:37 PM · Data Model
douardda added inline comments to D5029: Add raw_extrinsic_metadata.id column in postgresql..
Feb 9 2021, 10:16 AM
douardda added a comment to D5008: identifiers: Properly define the behavior of raw_extrinsic_metadata on negative timestamps..

it would be nice to have an explanation in the commit message of why this is actually needed. Explain why the original implementation is incorrect, and how it fixes this old incorrect behavior.

Feb 9 2021, 10:14 AM

Feb 5 2021

douardda added a comment to T2912: Next generation archive counters.

@vsellier nice. Note that if we draw these with a y-axis starting from 0, the step shape will be really negligible, so IMHO it's really not a problem.

Feb 5 2021, 10:59 AM · Roadmap 2021, System administration, Monitoring, Web app
douardda triaged T3029: Add validator and tests for the espated nature of the origin URI of the SWHID context qualidier as Normal priority.
Feb 5 2021, 9:49 AM · Data Model

Feb 4 2021

douardda requested review of D4985: [WIP] Add support for SWHID on Origin.
Feb 4 2021, 3:38 PM
douardda closed D5010: Fix swh.deposit.urls typing.
Feb 4 2021, 3:08 PM
douardda committed rDDEP6275caba9314: Fix swh.deposit.urls typing (authored by douardda).
Fix swh.deposit.urls typing
Feb 4 2021, 3:08 PM
douardda updated the diff for D5010: Fix swh.deposit.urls typing.

put the deps on djangorestframework-stubs >= 1.4 in requirement-tests.txt

Feb 4 2021, 2:22 PM
douardda requested review of D5010: Fix swh.deposit.urls typing.
Feb 4 2021, 2:13 PM

Feb 2 2021

douardda accepted D4997: Remove no longer used models field in dict returned by register.
Feb 2 2021, 4:46 PM
douardda created P936 Command-Line Input.
Feb 2 2021, 3:59 PM
douardda accepted D4975: Change origin_visit_status_get_random interface to return visit_status.
Feb 2 2021, 10:53 AM
douardda accepted D4976: storage: Make origin_get_latest_visit_status return OriginVisitStatus.
Feb 2 2021, 10:52 AM
douardda accepted D4977: Update persistent identifiers doc with pip install info.

ok but see my comment

Feb 2 2021, 10:50 AM
douardda accepted D4931: Add mapping of definitions and harvests.
Feb 2 2021, 10:44 AM

Feb 1 2021

douardda added a comment to T2799: Add support for SWHID as source of repository for jupyterhub.

Note that the PR for repo2docker has been merged, and another PR for binderhub is currently in progress.

Feb 1 2021, 4:20 PM
douardda closed D4923: Simulation: allow to export results in a csv file.
Feb 1 2021, 3:57 PM
douardda committed rDSCHaaffff2631a7: Simulator: allow to export results in a csv file (authored by douardda).
Simulator: allow to export results in a csv file
Feb 1 2021, 3:57 PM
douardda closed D4984: Add minimal tests for the SimulationReport.format() method.
Feb 1 2021, 3:56 PM
douardda committed rDSCH9fce3f6f2c73: Add minimal tests for the SimulationReport.format() method (authored by douardda).
Add minimal tests for the SimulationReport.format() method
Feb 1 2021, 3:56 PM
douardda requested review of D4984: Add minimal tests for the SimulationReport.format() method.
Feb 1 2021, 3:40 PM
douardda updated the diff for D4923: Simulation: allow to export results in a csv file.

add mininal test

Feb 1 2021, 3:39 PM
douardda closed D4921: Make plotting optional in simulator cli command.
Feb 1 2021, 3:11 PM
douardda committed rDSCHaaf7dd6f1d82: Make plottings optional in simulator cli output (authored by douardda).
Make plottings optional in simulator cli output
Feb 1 2021, 3:11 PM

Jan 29 2021

douardda added a comment to D4923: Simulation: allow to export results in a csv file.

(requesting changes to get it out of my review queue)

That's not a valid reason! A valid reason is "I agree with olasd's comments, fix them (plz)"...

Jan 29 2021, 5:45 PM
douardda updated the diff for D4923: Simulation: allow to export results in a csv file.

typo

Jan 29 2021, 5:37 PM
douardda added inline comments to D4923: Simulation: allow to export results in a csv file.
Jan 29 2021, 5:34 PM
douardda added a comment to D4923: Simulation: allow to export results in a csv file.

(requesting changes to get it out of my review queue)

Jan 29 2021, 5:29 PM
douardda updated the diff for D4923: Simulation: allow to export results in a csv file.

rebas

Jan 29 2021, 5:23 PM
douardda updated the diff for D4921: Make plotting optional in simulator cli command.

rebase

Jan 29 2021, 5:23 PM

Jan 27 2021

douardda requested changes to D4931: Add mapping of definitions and harvests.

few requests, please:

  • add tests with this commit; every introduced function should have at least one test.
  • add doctrings to your new functions,
  • improve the commit message (see https://chris.beams.io/posts/git-commit/ ); with the current one, I have no idea what exactly is done in this commit, and more importantly, why this is needed for.
Jan 27 2021, 3:54 PM
douardda accepted D4914: simulator: stop using the database as a cache for origin data.
Jan 27 2021, 3:48 PM
douardda closed T2970: Make swh-journal tests not depend on swh-model any more as Resolved.

Let's consider it as done.

Jan 27 2021, 3:47 PM · Journal
douardda closed D4951: Remove tests' journal_data.py in favor of the version in swh-model.
Jan 27 2021, 3:38 PM
douardda committed rDJNL9703864ef366: Remove tests' journal_data.py in favor of the version in swh-model (authored by douardda).
Remove tests' journal_data.py in favor of the version in swh-model
Jan 27 2021, 3:38 PM
douardda updated the diff for D4951: Remove tests' journal_data.py in favor of the version in swh-model.

set the DeprecationWarning category in journal_data

Jan 27 2021, 10:32 AM
douardda updated the diff for D4951: Remove tests' journal_data.py in favor of the version in swh-model.

remove stuff added mistakenly, and properly deprecate journal_data instead of breaking swh-storage

Jan 27 2021, 10:27 AM
douardda added inline comments to D4951: Remove tests' journal_data.py in favor of the version in swh-model.
Jan 27 2021, 10:00 AM
douardda added inline comments to D4951: Remove tests' journal_data.py in favor of the version in swh-model.
Jan 27 2021, 9:57 AM
douardda added inline comments to D4951: Remove tests' journal_data.py in favor of the version in swh-model.
Jan 27 2021, 9:54 AM

Jan 26 2021

douardda requested review of D4951: Remove tests' journal_data.py in favor of the version in swh-model.
Jan 26 2021, 5:32 PM
douardda closed D4950: Add swh-journal's model-related test data set in swh-model.
Jan 26 2021, 5:21 PM
douardda committed rDMODcad940dc8c07: Add swh-journal's model-related test data set in swh-model (authored by douardda).
Add swh-journal's model-related test data set in swh-model
Jan 26 2021, 5:21 PM
douardda added a revision to T2970: Make swh-journal tests not depend on swh-model any more: D4951: Remove tests' journal_data.py in favor of the version in swh-model.
Jan 26 2021, 5:09 PM · Journal
douardda requested review of D4950: Add swh-journal's model-related test data set in swh-model.
Jan 26 2021, 4:47 PM
douardda added a revision to T2970: Make swh-journal tests not depend on swh-model any more: D4950: Add swh-journal's model-related test data set in swh-model.
Jan 26 2021, 4:45 PM · Journal
douardda added a comment to T2970: Make swh-journal tests not depend on swh-model any more.

Back on this, the plan is now to make swh-journal not depend on the actual model definition, which is currently mostly due to the presence of the journal_data.py in swh-journal. So the plan is to move this file in swh-model so it's kept up to date with swh-model, even if it's mostly used for testing other packages (like swh-journal).

Jan 26 2021, 4:41 PM · Journal
douardda added a comment to D4914: simulator: stop using the database as a cache for origin data.

And once again, this "cache" behavior makes the simulator unable to run "forever" (it will eat RAM). Maybe it's an assumed design choice, but please document it somewhere.

Jan 26 2021, 9:50 AM
douardda added a comment to D4914: simulator: stop using the database as a cache for origin data.

Something I don't understand: why do you need to keep both _visit_times and latest_snapshots in "caches" when a snapshot is derived from this visit time (and visit type and origin)?

Jan 26 2021, 9:48 AM
douardda added a comment to D4909: simulator: add lister simulation.

Isn't there some inherent limitation with this lister_process (gradually eating RAM) that should be documented (maybe)?

Jan 26 2021, 9:33 AM
douardda added a comment to D4909: simulator: add lister simulation.

Note that I still think there should be something in docs/simulator.rst also...

Jan 26 2021, 9:29 AM
douardda accepted D4909: simulator: add lister simulation.

We're not claiming this is a realistic model. We only tried to do something that isn't completely naive, and exercises simple edge cases. Making it realistic is hard, and will probably be most of @olasd's work this week.

Jan 26 2021, 9:27 AM

Jan 25 2021

douardda added a comment to D4909: simulator: add lister simulation.

Yes, but that's not inconsistent as we can discover origins that we didn't know about.

Jan 25 2021, 12:23 PM
douardda added a comment to D4909: simulator: add lister simulation.

I'm really not sure to understand what the simulated model looks like in the end. Do I get it right that, including this diff:

Jan 25 2021, 12:04 PM

Jan 22 2021

douardda updated the diff for D4923: Simulation: allow to export results in a csv file.

rebased

Jan 22 2021, 4:20 PM
douardda updated the diff for D4921: Make plotting optional in simulator cli command.

s/-H/-P/

Jan 22 2021, 4:20 PM
douardda added inline comments to D4921: Make plotting optional in simulator cli command.
Jan 22 2021, 4:19 PM
douardda added a comment to D4927: lister.docs: add a lister template for the new API.

thanks. I think however, given its purpose, this example code should be heavily commented: each constant (eg. MyPageType) and each method should be commented (not docstrings but comments exaplaining what the method/variable is used for).

Jan 22 2021, 4:15 PM · Sprint 2021 01, Lister
douardda accepted D4912: grab_next_visits: don't re-schedule visits too fast.

Not very fond of this "one week => dead" embedded in there, but meh.

Jan 22 2021, 3:37 PM
douardda accepted D4916: Run simulator tests on all known scheduling policies.
Jan 22 2021, 3:33 PM
douardda accepted D4915: simulator: record visit metrics alongside scheduler metrics.
Jan 22 2021, 3:33 PM
douardda accepted D4910: Construct grab_next_visits query arguments incrementally.

ok, but it would have been nice to have an explanation of why this is necessary in the commit message.

Jan 22 2021, 3:29 PM
douardda accepted D4911: Allow overriding the timestamp of grab_next_visits.
Jan 22 2021, 3:27 PM
douardda added a comment to D4920: Randomize last_update in generated ListedOrigins in fill_test_data.

why not (cli option), but why (keep it deterministic)?

  1. reproducibility, so we can run the simulator twice with different code, and be sure that differences in behavior are not caused by randomness
Jan 22 2021, 2:27 PM
douardda closed D4919: Add a --num-origins option to the fill-test-data cli command.
Jan 22 2021, 2:12 PM
douardda committed rDSCH86b255544c5d: Add a --num-origins option to the fill-test-data cli command (authored by douardda).
Add a --num-origins option to the fill-test-data cli command
Jan 22 2021, 2:12 PM
douardda updated the diff for D4919: Add a --num-origins option to the fill-test-data cli command.

rebased

Jan 22 2021, 2:12 PM
douardda closed D4922: Simulation: log at infol level recorded metrics.
Jan 22 2021, 2:10 PM
douardda committed rDSCHabb513ca7d09: Simulation: log at info level recorded metrics (authored by douardda).
Simulation: log at info level recorded metrics
Jan 22 2021, 2:10 PM
douardda updated the diff for D4922: Simulation: log at infol level recorded metrics.

rebased

Jan 22 2021, 2:10 PM
douardda added a comment to D4920: Randomize last_update in generated ListedOrigins in fill_test_data.

I'd like to keep the simulator deterministic. What about adding a CLI option with a seed?

why not (cli option), but why (keep it deterministic)?

Also, a given seed will not be enough here: there is also the maxts = int(utcnow().timestamp()) that will kill the deterministic property...

Jan 22 2021, 12:28 PM
douardda added a comment to D4920: Randomize last_update in generated ListedOrigins in fill_test_data.

I'd like to keep the simulator deterministic. What about adding a CLI option with a seed?

why not (cli option), but why (keep it deterministic)?

Jan 22 2021, 12:27 PM
douardda added a comment to D4920: Randomize last_update in generated ListedOrigins in fill_test_data.

I'd like to keep the simulator deterministic. What about adding a CLI option with a seed?

Jan 22 2021, 12:25 PM
douardda updated the diff for D4923: Simulation: allow to export results in a csv file.

rebased

Jan 22 2021, 12:23 PM
douardda updated the summary of D4923: Simulation: allow to export results in a csv file.
Jan 22 2021, 12:23 PM
douardda updated the summary of D4923: Simulation: allow to export results in a csv file.
Jan 22 2021, 12:22 PM
douardda retitled D4921: Make plotting optional in simulator cli command from Make plotting histograms optional in simulator cli command to Make plotting optional in simulator cli command.
Jan 22 2021, 12:21 PM
douardda updated the diff for D4921: Make plotting optional in simulator cli command.

rebase on D4916

Jan 22 2021, 12:21 PM
douardda requested review of D4920: Randomize last_update in generated ListedOrigins in fill_test_data.
Jan 22 2021, 11:40 AM
douardda updated the diff for D4923: Simulation: allow to export results in a csv file.

rebased

Jan 22 2021, 11:36 AM
douardda updated the diff for D4921: Make plotting optional in simulator cli command.

kill unnedded dependency on D4920

Jan 22 2021, 11:27 AM
douardda updated the summary of D4921: Make plotting optional in simulator cli command.
Jan 22 2021, 11:26 AM
douardda updated the diff for D4922: Simulation: log at infol level recorded metrics.

with the commit...

Jan 22 2021, 11:22 AM
douardda updated the diff for D4922: Simulation: log at infol level recorded metrics.

type + vorentz' comment

Jan 22 2021, 11:21 AM
douardda updated the summary of D4922: Simulation: log at infol level recorded metrics.
Jan 22 2021, 11:19 AM
douardda accepted D4877: npm: Reimplement lister using new Lister API.
Jan 22 2021, 11:08 AM
douardda added inline comments to D4909: simulator: add lister simulation.
Jan 22 2021, 11:06 AM
douardda added inline comments to D4909: simulator: add lister simulation.
Jan 22 2021, 11:05 AM
douardda requested review of D4923: Simulation: allow to export results in a csv file.
Jan 22 2021, 11:01 AM
douardda requested review of D4922: Simulation: log at infol level recorded metrics.
Jan 22 2021, 10:59 AM
douardda accepted D4899: Add scheduling policy for already visited origins with known last update.

lgtm

Jan 22 2021, 10:58 AM
douardda requested review of D4921: Make plotting optional in simulator cli command.
Jan 22 2021, 10:57 AM
douardda requested review of D4919: Add a --num-origins option to the fill-test-data cli command.
Jan 22 2021, 10:52 AM

Jan 21 2021

douardda added inline comments to D4895: Add a successive_visits counter to OriginVisitStats.
Jan 21 2021, 9:57 AM