Page MenuHomeSoftware Heritage
Feed Advanced Search

Dec 8 2022

anlambert added inline comments to D8882: replay: Do not ignore externals in copyfrom operations.
Dec 8 2022, 11:15 AM
anlambert added inline comments to D8943: svn: Use urllib.parse.quote to percent encode svn URLs.
Dec 8 2022, 11:15 AM

Dec 7 2022

anlambert requested review of D8946: svn_retry: Reduce max number of retry attempts to 3.
Dec 7 2022, 6:06 PM
anlambert requested review of D8945: api, browse: Ensure to sanitize filename passed to django FileResponse.
Dec 7 2022, 6:06 PM
anlambert raised the priority of T4673: AssertionError: 7911 != 7912 in subvertpy while parsing txdeltas from Low to Normal.
Dec 7 2022, 5:46 PM · SVN Loader
anlambert requested review of D8944: replay: Copy dir states and external paths in copy_from operations.
Dec 7 2022, 5:43 PM
anlambert requested review of D8943: svn: Use urllib.parse.quote to percent encode svn URLs.
Dec 7 2022, 5:42 PM
anlambert requested review of D8942: utils: Raise ValueError when external definition could not be parsed.
Dec 7 2022, 5:41 PM
anlambert requested review of D8941: replay: Simplify FileEditor implementation.
Dec 7 2022, 5:36 PM
anlambert added a revision to T4673: AssertionError: 7911 != 7912 in subvertpy while parsing txdeltas: D8941: replay: Simplify FileEditor implementation.
Dec 7 2022, 5:33 PM · SVN Loader
anlambert updated the diff for D8882: replay: Do not ignore externals in copyfrom operations.

Rebase

Dec 7 2022, 5:23 PM
anlambert closed D8913: loader: Compute contents diff when detecting trees divergence.
Dec 7 2022, 5:19 PM
anlambert committed rDLDSVN2b80484b8b70: loader: Compute contents diff when detecting trees divergence (authored by anlambert).
loader: Compute contents diff when detecting trees divergence
Dec 7 2022, 5:19 PM
anlambert committed rDGRPH100ccd1d1e47: setup.py: Ensure testing requirements include luigi (authored by anlambert).
setup.py: Ensure testing requirements include luigi
Dec 7 2022, 2:15 PM
anlambert closed D8927: setup.py: Ensure testing requirements include luigi.
Dec 7 2022, 2:15 PM
anlambert committed rDGRPHe00ddf515557: luigi/origin_contributors: Fix mypy error with Python 3.7 (authored by anlambert).
luigi/origin_contributors: Fix mypy error with Python 3.7
Dec 7 2022, 2:15 PM
anlambert updated the diff for D8927: setup.py: Ensure testing requirements include luigi.

Rebase

Dec 7 2022, 2:07 PM
anlambert updated the diff for D8927: setup.py: Ensure testing requirements include luigi.

Add commit fixing mypy errors with Python 3.7

Dec 7 2022, 2:02 PM
anlambert closed T2719: Add entry of the FAIRsFAIR report in `publications` as Resolved.
Dec 7 2022, 1:54 PM · Website
anlambert added a comment to T4718: Fix search of URL - return of multiple results when none is correct.

The origin does not exist so elasticsearch is returning similar URLs to the search query as any search engine.

Dec 7 2022, 1:41 PM · Web app
anlambert updated the diff for D8927: setup.py: Ensure testing requirements include luigi.

Remove last commit as it did not work

Dec 7 2022, 12:06 PM
anlambert updated the diff for D8927: setup.py: Ensure testing requirements include luigi.

Add commit to fix mypy issue

Dec 7 2022, 12:03 PM
anlambert updated the diff for D8927: setup.py: Ensure testing requirements include luigi.

Rebase

Dec 7 2022, 10:59 AM

Dec 6 2022

anlambert requested review of D8927: setup.py: Ensure testing requirements include luigi.
Dec 6 2022, 4:29 PM
anlambert accepted D8910: Regenerate the test dataset to include a release with no author.
Dec 6 2022, 4:06 PM
anlambert accepted D8908: Add ListOriginContributors.

LGTM, added a couple of nitpicks as inline comments.

Dec 6 2022, 4:04 PM
anlambert accepted D8883: Add a script to generate a topological sort.

LGTM, been a while since I read java code, so verbose (especially for iterations).

Dec 6 2022, 3:17 PM
anlambert requested changes to D8919: Add CLI script to generate Luigi config and call it.

Could you add a test checking luigi parameters are correctly passed to the subprocess.run instruction ?

Dec 6 2022, 2:54 PM
anlambert accepted D8926: luigi.RunExportAll: Default to exporting all formats.
Dec 6 2022, 2:47 PM
anlambert accepted D8917: Split swh/graph/luigi.py into modules.
Dec 6 2022, 2:46 PM
anlambert accepted D8925: luigi.CreateAthena: Fix validation of DB name.
Dec 6 2022, 2:43 PM
anlambert accepted D8924: exporters/orc: Fix crash on visit status with no type.
Dec 6 2022, 2:05 PM
anlambert closed D8920: from_disk.Content: Add missing path info for symlink.
Dec 6 2022, 1:54 PM
anlambert committed rDMOD818ad826a4f4: from_disk.Content: Add missing path info for symlink (authored by anlambert).
from_disk.Content: Add missing path info for symlink
Dec 6 2022, 1:54 PM
anlambert closed D8923: archive_coverage: Add link to Archive Changelog in coverage widget.
Dec 6 2022, 1:53 PM
anlambert committed rDWAPPS36ce2b462f5d: archive_coverage: Add link to Archive Changelog in coverage widget (authored by anlambert).
archive_coverage: Add link to Archive Changelog in coverage widget
Dec 6 2022, 1:53 PM
anlambert requested changes to D8909: Login: Add an option to choose an authentication method (by username/password or token).

@anlambert Shouldn't this be replaced by swh auth generate-token?

@anlambert @vlorentz seems legit that anything related to auth for a cli command should be centralized in swh auth.

I can adapt to :

  • make swh scanner depends on swh auth
  • alias swh scanner login to one of swh auth
  • add a set token command to swh auth

What do you think?

Dec 6 2022, 1:52 PM
anlambert requested review of D8923: archive_coverage: Add link to Archive Changelog in coverage widget.
Dec 6 2022, 1:44 PM
anlambert added a comment to D8920: from_disk.Content: Add missing path info for symlink.

Does it mean we were silently dropping data until this? Which loaders use this?

Dec 6 2022, 1:14 PM

Dec 5 2022

anlambert closed T4715: Update publications page as Resolved.
Dec 5 2022, 5:16 PM · Website
anlambert accepted D8921: Hook up recently introduced options to all listers.
Dec 5 2022, 5:05 PM
anlambert updated the task description for T4715: Update publications page .
Dec 5 2022, 4:40 PM · Website
anlambert requested review of D8920: from_disk.Content: Add missing path info for symlink.
Dec 5 2022, 4:21 PM
anlambert added a comment to D8914: Ignore psqlrc when loading the rubygems database dump.
In D8914#231759, @olasd wrote:

Thanks, I will try to update the swh/lister/rubygems/data/small_rubygems_dump.sh script to generate a postgres dump that can be loaded when ERROR_ON_STOP is set.

There's not much point in doing that, the actual dump is generated with ownership instructions as well. I've wasted a good chunk of time trying to work around that but filtering their silly double-wrapped plain text SQL export is a PITA.

Dec 5 2022, 3:23 PM
anlambert added inline comments to D8913: loader: Compute contents diff when detecting trees divergence.
Dec 5 2022, 2:42 PM
anlambert accepted D8914: Ignore psqlrc when loading the rubygems database dump.

Thanks, I will try to update the swh/lister/rubygems/data/small_rubygems_dump.sh script to generate a postgres dump that can be loaded when ERROR_ON_STOP is set.

Dec 5 2022, 2:08 PM
anlambert closed T4711: publications page: add 2022 scientific papers as Resolved.
Dec 5 2022, 1:55 PM · Website
anlambert requested review of D8913: loader: Compute contents diff when detecting trees divergence.
Dec 5 2022, 1:36 PM
anlambert closed D8886: replay: Add more debug logs.
Dec 5 2022, 1:26 PM
anlambert committed rDLDSVNe35f800a423a: replay: Add more debug logs (authored by anlambert).
replay: Add more debug logs
Dec 5 2022, 1:26 PM
anlambert created P1535 subversion loader based on incremental checkout operations.
Dec 5 2022, 10:47 AM

Nov 30 2022

anlambert closed D8902: save_code_now: Indicate if request was created from webhook in Web UI.
Nov 30 2022, 4:22 PM
anlambert committed rDWAPPSdd3936616b66: package.json: Upgrade dependencies (authored by anlambert).
package.json: Upgrade dependencies
Nov 30 2022, 4:22 PM
anlambert committed rDWAPPSf392e87cf449: save_code_now: Indicate if request was created from webhook in Web UI (authored by anlambert).
save_code_now: Indicate if request was created from webhook in Web UI
Nov 30 2022, 4:22 PM
anlambert closed D8890: metrics: Add gauge counting save origin requests created from webhooks.
Nov 30 2022, 4:22 PM
anlambert committed rDWAPPSce8da30b9842: metrics/tests/test_app: Fix test name (authored by anlambert).
metrics/tests/test_app: Fix test name
Nov 30 2022, 4:22 PM
anlambert committed rDWAPPSb4fc49b3fe92: metrics: Add gauge counting save origin requests created from webhooks (authored by anlambert).
metrics: Add gauge counting save origin requests created from webhooks
Nov 30 2022, 4:22 PM
anlambert closed D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.
Nov 30 2022, 4:22 PM
anlambert committed rDWAPPS12a291957fa0: save_code_now: Add webhook info fields to SaveOriginRequest model (authored by anlambert).
save_code_now: Add webhook info fields to SaveOriginRequest model
Nov 30 2022, 4:22 PM
anlambert updated the diff for D8902: save_code_now: Indicate if request was created from webhook in Web UI.

Rebase

Nov 30 2022, 3:40 PM
anlambert updated the diff for D8890: metrics: Add gauge counting save origin requests created from webhooks.

Rebase

Nov 30 2022, 3:40 PM
anlambert updated the diff for D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

Fix docstring and doc build.

Nov 30 2022, 3:40 PM
anlambert committed rDDOC8f0abba63b56: Makefile.sphinx: Restore custom sphinx options coming from tox files (authored by anlambert).
Makefile.sphinx: Restore custom sphinx options coming from tox files
Nov 30 2022, 3:37 PM

Nov 29 2022

anlambert requested review of D8902: save_code_now: Indicate if request was created from webhook in Web UI.
Nov 29 2022, 5:43 PM
anlambert added a revision to T4548: Add a public API endpoint and documentation to trigger Save Code Now from webhook: D8902: save_code_now: Indicate if request was created from webhook in Web UI.
Nov 29 2022, 5:00 PM · Web app
anlambert updated the diff for D8890: metrics: Add gauge counting save origin requests created from webhooks.

Rebase

Nov 29 2022, 4:31 PM
anlambert updated the diff for D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

Update apidoc for /origin/save/ endpoint with new webhook related fields

Nov 29 2022, 4:31 PM
anlambert updated the diff for D8890: metrics: Add gauge counting save origin requests created from webhooks.

Drop visit_type label in prometheus gauge.

Nov 29 2022, 3:31 PM
anlambert updated the diff for D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

Fix docstring

Nov 29 2022, 3:30 PM
anlambert added inline comments to D8890: metrics: Add gauge counting save origin requests created from webhooks.
Nov 29 2022, 3:24 PM
anlambert updated the diff for D8890: metrics: Add gauge counting save origin requests created from webhooks.

Update: Add label for webhook origin in prometheus gauge

Nov 29 2022, 3:04 PM
anlambert updated the diff for D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

Update: also add column in save_origin_requests table indicating which forge type sent the webhook

Nov 29 2022, 3:03 PM
anlambert added a comment to D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

And would it make sense to store which webhook was used?

It can be easily guessed from the origin URL so I do not think we should add that info.

How? That works for Bitbucket and GitHub, but you can't tell whether "git.example.org" is Gitlab or Gitea

Nov 29 2022, 2:02 PM
anlambert added a comment to D8886: replay: Add more debug logs.

@anlambert Looks like you should use a separate logger, so it integrates with the standard logging config, eg.

swh -l DEBUG -l swh.loader.svn.whatever:INFO loader run svn <repo_url>

to have all debug logs except from the swh.loader.svn.whatever logger

Nov 29 2022, 1:55 PM
anlambert added a comment to D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

What is it going to be used for?

To compute some stats about usage of that feature.

And would it make sense to store which webhook was used?

It can be easily guessed from the origin URL so I do not think we should add that info.

Nov 29 2022, 1:21 PM
anlambert added a comment to D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.

What is it going to be used for?

Nov 29 2022, 12:26 PM
anlambert added a comment to D8886: replay: Add more debug logs.
In D8886#231281, @lunar wrote:

Could you explain why using loglevels would not work? It feels to me they are meant to solve the problem of selecting the appropriate amount of debug messages… Having a separate flag introduce quite some extra noise in the process.

Nov 29 2022, 12:23 PM

Nov 28 2022

anlambert committed rCJSWH4fd2920268ae: templates/stage-cypress-tests: Use a local temporary cypress cache (authored by anlambert).
templates/stage-cypress-tests: Use a local temporary cypress cache
Nov 28 2022, 3:42 PM
anlambert requested review of D8890: metrics: Add gauge counting save origin requests created from webhooks.
Nov 28 2022, 3:28 PM
anlambert added a revision to T4548: Add a public API endpoint and documentation to trigger Save Code Now from webhook: D8890: metrics: Add gauge counting save origin requests created from webhooks.
Nov 28 2022, 3:17 PM · Web app
anlambert requested review of D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.
Nov 28 2022, 3:15 PM
anlambert added a revision to T4548: Add a public API endpoint and documentation to trigger Save Code Now from webhook: D8889: save_code_now: Add from_webhook field to SaveOriginRequest model.
Nov 28 2022, 2:09 PM · Web app
anlambert closed T4698: Error when requesting vault cooking for a directory in pending state as Resolved.

Fixed and deployed.

Nov 28 2022, 10:55 AM · Web app

Nov 25 2022

anlambert requested review of D8886: replay: Add more debug logs.
Nov 25 2022, 6:37 PM
anlambert closed D8885: package.json: Upgrade dependencies.
Nov 25 2022, 4:24 PM
anlambert committed rDWAPPSd9ffdd1759a2: package.json: Upgrade dependencies (authored by anlambert).
package.json: Upgrade dependencies
Nov 25 2022, 4:24 PM
anlambert requested review of D8885: package.json: Upgrade dependencies.
Nov 25 2022, 4:21 PM

Nov 24 2022

anlambert closed D8878: vault/assets: Fix cooking task creation when there is a pending one.
Nov 24 2022, 4:48 PM
anlambert committed rDWAPPSc32edb667d1b: vault/assets: Fix cooking task creation when there is a pending one (authored by anlambert).
vault/assets: Fix cooking task creation when there is a pending one
Nov 24 2022, 4:48 PM
anlambert updated the diff for D8878: vault/assets: Fix cooking task creation when there is a pending one.

Fix cypress test

Nov 24 2022, 3:34 PM
anlambert added inline comments to D8878: vault/assets: Fix cooking task creation when there is a pending one.
Nov 24 2022, 3:33 PM
anlambert updated the diff for D8878: vault/assets: Fix cooking task creation when there is a pending one.

Update

Nov 24 2022, 3:30 PM
anlambert added inline comments to D8878: vault/assets: Fix cooking task creation when there is a pending one.
Nov 24 2022, 2:48 PM
anlambert updated the diff for D8878: vault/assets: Fix cooking task creation when there is a pending one.

Handle all possible cooking statuses.

Nov 24 2022, 2:44 PM
anlambert retitled D8878: vault/assets: Fix cooking task creation when there is a pending one from vault/assets: Report pending cooking task when submitting the same one to vault/assets: Fix cooking task creation when there is a pending one.
Nov 24 2022, 2:30 PM
anlambert updated the diff for D8878: vault/assets: Fix cooking task creation when there is a pending one.

Rework fix to ensure cooking task will be displayed in vault UI.

Nov 24 2022, 2:29 PM
anlambert closed D8875: cypress/origin-save: Fix flaky test.
Nov 24 2022, 2:29 PM
anlambert committed rDWAPPScd820470fa59: cypress/origin-save: Fix flaky test (authored by anlambert).
cypress/origin-save: Fix flaky test
Nov 24 2022, 2:29 PM
anlambert added a comment to D8878: vault/assets: Fix cooking task creation when there is a pending one.

thanks!

Is it visible on the Downloads page even if someone else requested the cooking?

Ah right, need to check.

Nov 24 2022, 1:58 PM
anlambert accepted D8881: Document the need for higher vm.max_map_count.
Nov 24 2022, 1:45 PM