Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 14 2022

olasd requested review of D7570: Add support for disabling logging integration in sentry.
Apr 14 2022, 11:46 AM
olasd updated the diff for D7567: gunicorn: disable sentry logging event reporting by default.

Rebase on D7570

Apr 14 2022, 11:43 AM
olasd updated the diff for D7570: Add support for disabling logging integration in sentry.

Improve docs, rename variable to disable_logging_events

Apr 14 2022, 11:31 AM
olasd commandeered D7570: Add support for disabling logging integration in sentry.
Apr 14 2022, 11:30 AM
olasd updated the diff for D7570: Add support for disabling logging integration in sentry.

Rebase

Apr 14 2022, 11:04 AM
olasd closed D7566: RPC server: explicitly handle sentry exception capture.
Apr 14 2022, 10:31 AM
olasd committed rDCORE5723f00a2af4: RPC server: explicitly handle sentry exception capture (authored by olasd).
RPC server: explicitly handle sentry exception capture
Apr 14 2022, 10:31 AM
olasd updated the diff for D7566: RPC server: explicitly handle sentry exception capture.

Rebase on top of D7569

Apr 14 2022, 10:27 AM
olasd closed D7569: statsd: add an error_type tag to @timed error counters.
Apr 14 2022, 10:26 AM
olasd committed rDCOREc7f1b6880b96: statsd: add an error_type tag to @timed error counters (authored by olasd).
statsd: add an error_type tag to @timed error counters
Apr 14 2022, 10:25 AM
olasd updated the diff for D7569: statsd: add an error_type tag to @timed error counters.

Rebase below D7566/D7567/D7570

Apr 14 2022, 10:22 AM
olasd closed D7568: retry: re-raise original exception instead of a RetryError.
Apr 14 2022, 10:15 AM
olasd committed rDSTObbb4fc1cb472: retry: re-raise original exception instead of a RetryError (authored by olasd).
retry: re-raise original exception instead of a RetryError
Apr 14 2022, 10:15 AM

Apr 13 2022

olasd requested review of D7568: retry: re-raise original exception instead of a RetryError.
Apr 13 2022, 4:13 PM
olasd requested review of D7569: statsd: add an error_type tag to @timed error counters.
Apr 13 2022, 4:01 PM
olasd requested review of D7567: gunicorn: disable sentry logging event reporting by default.
Apr 13 2022, 3:12 PM
olasd requested review of D7566: RPC server: explicitly handle sentry exception capture.
Apr 13 2022, 3:11 PM

Apr 12 2022

olasd closed D7554: postgresql: ensure origin_visit(_status) queries use index.
Apr 12 2022, 2:32 PM
olasd committed rDSTO75aa07325d6c: postgresql: ensure origin_visit(_status) queries use index (authored by olasd).
postgresql: ensure origin_visit(_status) queries use index
Apr 12 2022, 2:32 PM
olasd updated the diff for D7554: postgresql: ensure origin_visit(_status) queries use index.

Rebase

Apr 12 2022, 2:22 PM
olasd requested review of D7554: postgresql: ensure origin_visit(_status) queries use index.
Apr 12 2022, 2:09 PM

Apr 11 2022

olasd added a comment to T3656: Survey revisions/releases with partially loaded history.

Looks like the number of affected revisions is fluctuating a bit:

Apr 11 2022, 4:10 PM · Archive content
olasd added a comment to T1739: Define an architecture to fetch extrinsic metadata outside listers and loaders.

The original idea for this was to have separate tasks to fetch metadata, so that loaders did not have forge-specific code to fetch metadata.

However, the idea of loading metadata from loader is more appealing the more I think about it:

  1. Metadata are fetched at about the same time as we snapshot code; which would allow showing more consistent states of repositories
  2. Active repositories automatically have their metadata fetched more often than inactive ones
  3. We don't have one more moving part to monitor and schedule
  4. This allows the Git loader to know a new repo is a "forge fork" of another one before it starts loading, so it can do an incremental load
Apr 11 2022, 2:36 PM · Extrinsic metadata

Apr 8 2022

olasd accepted D7518: Add support for recursive multipart messages.

Thanks!

Apr 8 2022, 11:46 AM

Apr 7 2022

olasd added a comment to D7507: Do not keep more than 6 months of build history for python builds.

I tried to be more selective as you proposed but I'm not sure if it's possible. It seems the
html reports are not managed as build artifacts but as an output of the html publisher plugin.
The only way I've found is to change the keep-all[1] property to false but in this case, only one
report will be kept, which is even worse.

Apr 7 2022, 4:02 PM

Apr 6 2022

olasd closed D7503: inbound_email: add function to extract the plaintext from a mail.
Apr 6 2022, 6:37 PM
olasd committed rDWAPPS17cd03607c4e: inbound_email: add function to extract the plaintext from a mail (authored by olasd).
inbound_email: add function to extract the plaintext from a mail
Apr 6 2022, 6:37 PM
olasd closed D7499: inbound_email: add support for signed email addresses.
Apr 6 2022, 6:37 PM
olasd committed rDWAPPS841919a3c8a6: inbound_email: add support for signed email addresses (authored by olasd).
inbound_email: add support for signed email addresses
Apr 6 2022, 6:37 PM
olasd updated the diff for D7503: inbound_email: add function to extract the plaintext from a mail.

Rebase, again, on top of D7499.

Apr 6 2022, 6:03 PM
olasd updated the diff for D7499: inbound_email: add support for signed email addresses.
  • Rebase
  • Remove spurious print
Apr 6 2022, 6:02 PM
olasd added a comment to D7499: inbound_email: add support for signed email addresses.

It makes me a little uncomfortable that the validator needs access to the same secret key that is used for generating signatures; because the validator runs in a frontend app, and a leak of that secret key allows someone to spoof any address.

Apr 6 2022, 5:17 PM
olasd added a comment to D7507: Do not keep more than 6 months of build history for python builds.

(specific targets if we're looking at less indiscriminate removals: docs diffs html output, coverage reports, ccm reports, mochawesome reports, duplicated junit-formatted test reports)

Apr 6 2022, 5:08 PM
olasd added a comment to D7507: Do not keep more than 6 months of build history for python builds.

I don't like the idea of removing all old build logs indiscriminately, but it seems that noone else cares enough to do a more targeted analysis of what should be removed or not, so *shrug*.

Apr 6 2022, 5:06 PM
olasd updated the diff for D7503: inbound_email: add function to extract the plaintext from a mail.
  • Rebase
  • Add coverage for the html-only case
  • Add coverage for the "multiple ambiguous parts length" behavior
Apr 6 2022, 4:50 PM
olasd committed rDSCRUB00cf0f93f62d: pristine-tar data for swh-scrubber_0.0.1.orig.tar.gz (authored by olasd).
pristine-tar data for swh-scrubber_0.0.1.orig.tar.gz
Apr 6 2022, 3:49 PM
olasd committed rDSCRUB744c872362dd: Import upstream version 0.0.1 (authored by olasd).
Import upstream version 0.0.1
Apr 6 2022, 3:49 PM
olasd accepted D7039: Update the debian local package building section.
Apr 6 2022, 2:21 PM

Apr 5 2022

olasd added a comment to D7502: pre-commit, tox: Bump black from 19.10b0 to 22.3.0.

I thought we could introduce flake8-bugbear for line length limits, as suggested by black's docs: https://black.readthedocs.io/en/stable/the_black_code_style/current_style.html#line-length, but I can't seem to find a way to *only* enable B950 and not the other bugbear warnings. sigh.

I forgot to run make check again, good catch ! Indeed using flake8-bugbear seems the best way to proceed regarding line length warnings.
I managed to activate only that warning

Apr 5 2022, 5:25 PM
olasd updated the diff for D7499: inbound_email: add support for signed email addresses.

rebase and apply @ardumont's comment

Apr 5 2022, 4:21 PM
olasd closed D7498: inbound_email: split recipient matching logic out.
Apr 5 2022, 4:21 PM
olasd committed rDWAPPS10f8ba14679d: inbound_email: split recipient matching logic out (authored by olasd).
inbound_email: split recipient matching logic out
Apr 5 2022, 4:21 PM
olasd updated the diff for D7498: inbound_email: split recipient matching logic out.

rebase

Apr 5 2022, 4:20 PM
olasd closed D7497: Restrict pytest-postgresql to < 4.0.0.
Apr 5 2022, 4:20 PM
olasd committed rDWAPPS2ad94111d6de: Restrict pytest-postgresql to < 4.0.0 (authored by olasd).
Restrict pytest-postgresql to < 4.0.0
Apr 5 2022, 4:20 PM
olasd updated the diff for D7497: Restrict pytest-postgresql to < 4.0.0.

rebase

Apr 5 2022, 4:20 PM
olasd added a comment to D7502: pre-commit, tox: Bump black from 19.10b0 to 22.3.0.
In D7502#196088, @olasd wrote:
Apr 5 2022, 3:57 PM
olasd added a comment to D7502: pre-commit, tox: Bump black from 19.10b0 to 22.3.0.

Thanks for looking into this!

Apr 5 2022, 3:55 PM
olasd added a comment to D7502: pre-commit, tox: Bump black from 19.10b0 to 22.3.0.

(sorry, typing... please hold)

Apr 5 2022, 2:31 PM
olasd requested changes to D7502: pre-commit, tox: Bump black from 19.10b0 to 22.3.0.
Apr 5 2022, 2:31 PM
olasd requested review of D7499: inbound_email: add support for signed email addresses.
Apr 5 2022, 2:04 PM
olasd requested review of D7503: inbound_email: add function to extract the plaintext from a mail.
Apr 5 2022, 1:45 PM
olasd added a revision to T3999: [Add forge now] Design a feedback tracking solution: D7503: inbound_email: add function to extract the plaintext from a mail.
Apr 5 2022, 1:17 PM · Add Forge Now
olasd updated the diff for D7498: inbound_email: split recipient matching logic out.

rebase

Apr 5 2022, 1:15 PM
olasd updated the diff for D7497: Restrict pytest-postgresql to < 4.0.0.

rebase

Apr 5 2022, 1:15 PM

Apr 4 2022

olasd requested review of D7498: inbound_email: split recipient matching logic out.
Apr 4 2022, 5:27 PM
olasd requested review of D7497: Restrict pytest-postgresql to < 4.0.0.
Apr 4 2022, 5:27 PM
olasd added a revision to T3999: [Add forge now] Design a feedback tracking solution: D7499: inbound_email: add support for signed email addresses.
Apr 4 2022, 5:05 PM · Add Forge Now
olasd added inline comments to D7473: Make postgresql's Storage client options configurable from config.
Apr 4 2022, 4:24 PM
olasd accepted D7472: Make db_transaction's client_options configurable at run time.

The duplication between db_transaction and db_transaction_generator looks a bit silly, but I'm not sure there's much to do about it, so this lgtm, thanks.

Apr 4 2022, 3:36 PM

Apr 1 2022

olasd closed D7484: Reorganize toctree to make previous roadmaps less preeminent.
Apr 1 2022, 2:35 PM
olasd committed rDDOCcfe2da4e77c7: Reorganize toctree to make previous roadmaps less preeminent (authored by olasd).
Reorganize toctree to make previous roadmaps less preeminent
Apr 1 2022, 2:35 PM
olasd requested review of D7484: Reorganize toctree to make previous roadmaps less preeminent.
Apr 1 2022, 1:22 PM
olasd committed rDWAPPSad5add7d36e6: Ensure that tests run with the C.UTF-8 locale (authored by olasd).
Ensure that tests run with the C.UTF-8 locale
Apr 1 2022, 12:14 PM
olasd closed D7483: Ensure that tests run with the C.UTF-8 locale.
Apr 1 2022, 12:14 PM
olasd committed rDWAPPSb7d6c93148e9: Move add_forge_now migration tests with other add_forge_now tests (authored by olasd).
Move add_forge_now migration tests with other add_forge_now tests
Apr 1 2022, 12:14 PM

Mar 30 2022

olasd added a comment to D7469: npm: Include package version id in ExtID manifest.

So, from what I understand, we're currently generating releases combining intrinsic data (the target directory hash) and extrinsic data provided by the package index (release name and message).

Mar 30 2022, 5:34 PM
olasd added inline comments to D7442: interface: Add new method origin_visit_get_with_statuses.
Mar 30 2022, 3:47 PM
olasd accepted D7470: Modify default timeouts.

Checking the last duration column here https://jenkins.softwareheritage.org/view/swh%20master/ I think we're good to go. Thank you!

Mar 30 2022, 12:37 PM
olasd added a comment to D7470: Modify default timeouts.

I think 30 minutes is a lot of time to waste on an actual hang, but it's clear that this set of jobs is already on the edge.

Mar 30 2022, 11:57 AM
olasd added a comment to T4117: Storage metrics not refreshed.

swh.storage 1.2.0 increased a bunch of timeouts and made some queries smarter so it's entirely plausible that the number of errors has dropped drastically.

Mar 30 2022, 10:09 AM · Storage manager, System administration
olasd added a comment to T4117: Storage metrics not refreshed.

What exact metric are you saying isn't updating? In your diff, the swh_storage_request_duration_seconds_count{endpoint="index"} metric seems to be increasing normally

Mar 30 2022, 10:07 AM · Storage manager, System administration
olasd accepted D7450: Make the timeout declaratively parametric.

Thank you!

Mar 30 2022, 9:59 AM

Mar 29 2022

olasd accepted D7446: Install app_label on add_forge_now model classes.

Thanks!

Mar 29 2022, 6:22 PM
olasd accepted D7455: Fix support of Werkzeug 2.1.0.

jfc.

Mar 29 2022, 2:36 PM
olasd added a comment to T4090: Add method to efficiently retrieve latest statuses of origin visits .

SGTM, thanks!

Mar 29 2022, 2:34 PM · Storage manager
olasd requested changes to D7450: Make the timeout declaratively parametric.

That will apply to all jobs, which I don't think we want to do.

Mar 29 2022, 11:34 AM

Mar 28 2022

olasd added a comment to T4090: Add method to efficiently retrieve latest statuses of origin visits .

I have the feeling that, in terms of API extensibility, we'll want to be returning both the OriginVisit and its latest OriginVisitStatus.

Mar 28 2022, 2:47 PM · Storage manager

Mar 23 2022

olasd committed rDSTO835feb684295: Fix tenacious storage tests for swh.model 6 (authored by olasd).
Fix tenacious storage tests for swh.model 6
Mar 23 2022, 4:48 PM
olasd accepted D7399: Add support for author=None and committer=None.

So, I assume this will bump the swh.model dependency to 6.0 before it lands?

Mar 23 2022, 10:48 AM

Mar 22 2022

olasd accepted D7400: Add support for None as author or committer of a Revision.

LGTM, thanks

Mar 22 2022, 2:29 PM

Mar 18 2022

olasd accepted D7370: Add feature flag to toggle on/off add-forge-now feature.

Awesome, thanks!

Mar 18 2022, 11:08 AM
olasd added a comment to D7370: Add feature flag to toggle on/off add-forge-now feature.

(as a side note, I don't really understand why these views are split in two modules)

It's currently consistently separated in "public" views and "admin" views.

Mar 18 2022, 10:15 AM

Mar 17 2022

olasd added a comment to D7370: Add feature flag to toggle on/off add-forge-now feature.

I think we should really block the views as well as the links when the feature is not enabled.

Mar 17 2022, 6:06 PM
olasd accepted D7166: Improve `origin_add` logic.

Looks fine, thanks!

Mar 17 2022, 1:47 PM

Mar 16 2022

olasd committed rDWAPPS80e55b0a12eb: Merge branch 'master' into sprint-add-forge-now (authored by olasd).
Merge branch 'master' into sprint-add-forge-now
Mar 16 2022, 4:21 PM
olasd added a comment to T3311: Use .gitmodules to discover origins.
In T3311#80997, @olasd wrote:

I'm not comfortable always creating high priority tasks in this context either, as I'm not sure what the throttling implications are when we inevitably end up on a repository that references a commit in a submodule that doesn't exist.

Mar 16 2022, 4:19 PM · Archive coverage, Git loader
olasd added a comment to D7332: loader: Add support for submodules discovering.

FYI, @douardda added a comment on T3311 (I think it's the best place to discuss the design)

Mar 16 2022, 4:15 PM
olasd added a comment to T3311: Use .gitmodules to discover origins.

I think the approach in D7332 is interesting, but it feels a bit expensive to be doing it for every instance of a .gitmodules file found in any new directory for all git repos that are being loaded, as well as doing it again for the top level of any known branch in the git snapshot being loaded currently.

Mar 16 2022, 4:15 PM · Archive coverage, Git loader
olasd accepted D7354: backfill: Make integer_ranges() work on str args + add typing to RANGE_GENERATORS.

Cool, thanks.

Mar 16 2022, 3:55 PM
olasd committed rDWAPPSc24e6822a156: Merge branch 'master' into sprint-add-forge-now (authored by olasd).
Merge branch 'master' into sprint-add-forge-now
Mar 16 2022, 2:28 PM
olasd added a comment to T3958: Improve developer-facing automation.

Cypress parallel testing references:

Mar 16 2022, 2:19 PM · Continuous Integration, System administration
olasd committed rDWAPPSbf099ce9654d: Bootstrap a common app for management of inbound emails (authored by olasd).
Bootstrap a common app for management of inbound emails
Mar 16 2022, 2:18 PM
olasd closed D7325: Bootstrap a common app for management of inbound emails.
Mar 16 2022, 2:18 PM
olasd committed rDWAPPSca39b06c46ba: inbound_email: add utilities to extract and filter email recipients (authored by olasd).
inbound_email: add utilities to extract and filter email recipients
Mar 16 2022, 2:18 PM
olasd updated the diff for D7325: Bootstrap a common app for management of inbound emails.

One last rebase for the road?

Mar 16 2022, 1:57 PM

Mar 14 2022

olasd added a comment to T3841: regularly scrub all the data stores of swh.
In T3841#80779, @olasd wrote:

I think it's fine to remove the entries when we don't need them anymore (i.e. the object has been restored). Worst case, it'll be re-added at the next iteration of the script :-)

Mar 14 2022, 3:23 PM · Datastore Scrubber, meta-task, Roadmap 2022, Storage manager
olasd added a comment to T3841: regularly scrub all the data stores of swh.

You'll need a column for which datastore has the corrupted object.

Mar 14 2022, 3:19 PM · Datastore Scrubber, meta-task, Roadmap 2022, Storage manager
olasd updated the diff for D7325: Bootstrap a common app for management of inbound emails.

Use str.partition instead of str.split

Mar 14 2022, 1:35 PM
olasd added inline comments to D7325: Bootstrap a common app for management of inbound emails.
Mar 14 2022, 1:27 PM