Page MenuHomeSoftware Heritage
Feed Advanced Search

Feb 2 2022

douardda updated the diff for D7064: Add a 'swh db upgrade' command.

rebase

Feb 2 2022, 3:31 PM
douardda updated the diff for D7063: Make 'swh db init' fill the dbversion table.

rebase

Feb 2 2022, 3:31 PM
douardda updated the diff for D7062: Add support for dbversion and dbmodule handling in swh db init.

rebase

Feb 2 2022, 3:30 PM
douardda updated the diff for D7061: Make `swh db init` retrieve the db cnx uri from the config file.

rebase

Feb 2 2022, 3:30 PM
douardda updated the diff for D7060: preparation refactoring for following diffs.

move (forgotten) import stmts top level (thx vlorentz)

Feb 2 2022, 3:30 PM
douardda added inline comments to D7060: preparation refactoring for following diffs.
Feb 2 2022, 3:25 PM
douardda updated the diff for D7064: Add a 'swh db upgrade' command.

rebase

Feb 2 2022, 3:24 PM
douardda updated the diff for D7063: Make 'swh db init' fill the dbversion table.

small cleanup in cli_new/50-data.sql

Feb 2 2022, 3:24 PM
douardda requested review of D7064: Add a 'swh db upgrade' command.
Feb 2 2022, 3:22 PM
douardda requested review of D7063: Make 'swh db init' fill the dbversion table.
Feb 2 2022, 3:21 PM
douardda requested review of D7062: Add support for dbversion and dbmodule handling in swh db init.
Feb 2 2022, 2:46 PM
douardda requested review of D7061: Make `swh db init` retrieve the db cnx uri from the config file.
Feb 2 2022, 2:36 PM
douardda requested review of D7060: preparation refactoring for following diffs.
Feb 2 2022, 2:35 PM
douardda added a revision to T3894: Add a 'swh db upgrade' cli: D7064: Add a 'swh db upgrade' command.
Feb 2 2022, 1:53 PM · Core & foundations
douardda added a revision to T3894: Add a 'swh db upgrade' cli: D7063: Make 'swh db init' fill the dbversion table.
Feb 2 2022, 1:52 PM · Core & foundations
douardda added a revision to T3894: Add a 'swh db upgrade' cli: D7062: Add support for dbversion and dbmodule handling in swh db init.
Feb 2 2022, 1:51 PM · Core & foundations
douardda added a revision to T3894: Add a 'swh db upgrade' cli: D7061: Make `swh db init` retrieve the db cnx uri from the config file.
Feb 2 2022, 1:49 PM · Core & foundations
douardda added a revision to T3894: Add a 'swh db upgrade' cli: D7060: preparation refactoring for following diffs.
Feb 2 2022, 1:49 PM · Core & foundations

Jan 28 2022

douardda closed T3693: Provide a mecanism to report (with persistence) objects that fails to get replayed (mirror) as Resolved.
Jan 28 2022, 9:38 AM · Storage manager
douardda added a comment to T3894: Add a 'swh db upgrade' cli.

The problem with step 2. is that we do not have an API to declare the latest version, something that should come with swh.core.db is suppose.

Jan 28 2022, 9:38 AM · Core & foundations
douardda updated the task description for T3894: Add a 'swh db upgrade' cli.
Jan 28 2022, 9:36 AM · Core & foundations

Jan 27 2022

douardda updated the task description for T3894: Add a 'swh db upgrade' cli.
Jan 27 2022, 3:10 PM · Core & foundations
douardda triaged T3894: Add a 'swh db upgrade' cli as High priority.
Jan 27 2022, 3:07 PM · Core & foundations
douardda closed T424: swh-journal: persistent journal infrastructure to record additions to the swh-storage as Resolved.

yeah!

Jan 27 2022, 2:21 PM · Journal
douardda closed T1279: swh-journal: The schema migration problem, a subtask of T424: swh-journal: persistent journal infrastructure to record additions to the swh-storage, as Wontfix.
Jan 27 2022, 2:21 PM · Journal
douardda closed T1279: swh-journal: The schema migration problem as Wontfix.

See T3893 instead.

Jan 27 2022, 2:21 PM · Journal
douardda renamed T3893: Version swh schema from Make the swh schema versioned to Version swh schema .
Jan 27 2022, 2:18 PM · Data Model
douardda updated the task description for T3893: Version swh schema .
Jan 27 2022, 2:16 PM · Data Model
douardda added a comment to T3893: Version swh schema .

From IRC:

Jan 27 2022, 2:13 PM · Data Model
douardda triaged T3893: Version swh schema as High priority.
Jan 27 2022, 2:09 PM · Data Model
douardda created T3893: Version swh schema .
Jan 27 2022, 2:09 PM · Data Model
douardda closed T1481: add metric to monitor "save code now" efficiency, a subtask of T1408: More/better Metrics, as Resolved.
Jan 27 2022, 1:45 PM · Metrics/monitoring, Sprint 2018 12
douardda closed T1481: add metric to monitor "save code now" efficiency as Resolved.

we can always improve it, but now we have a decent dashboard, so let's consider this done.

Jan 27 2022, 1:45 PM · Save Code Now, System administration, Metrics/monitoring
douardda closed T1481: add metric to monitor "save code now" efficiency, a subtask of T3082: Improve Save Code Now handling, as Resolved.
Jan 27 2022, 1:45 PM · Save Code Now, meta-task, Roadmap 2021, Web app
douardda closed T3218: The graph replayer generates REQTMOUT Timeout errors, a subtask of T3197: Mirror: fix common issues of a replayer session, as Resolved.
Jan 27 2022, 1:43 PM · Mirror
douardda closed T3218: The graph replayer generates REQTMOUT Timeout errors as Resolved.

Seems moslty ok, even if these errors still pop now and then in the logs.

Jan 27 2022, 1:43 PM · Mirror
douardda triaged T3892: Add a dashboard/page with a summary of which version of swh components are running on which service/machine as Normal priority.
Jan 27 2022, 11:04 AM · System administration

Jan 26 2022

douardda added inline comments to D7003: journal: Document the new format for gitdate..
Jan 26 2022, 3:50 PM
douardda added a comment to D7039: Update the debian local package building section.
In D7039#183023, @olasd wrote:

Thanks.

--build-dep-resolver=aptitude should only be used when building with an extra-repository which has a non-default priority, that is only when using a -backports suite (so only for the bullseye and buster instructions). It should probably be documented in the list of "useful options" rather than as the default.

Jan 26 2022, 2:26 PM
douardda requested changes to D7003: journal: Document the new format for gitdate..
Jan 26 2022, 2:24 PM
douardda added inline comments to D7003: journal: Document the new format for gitdate..
Jan 26 2022, 2:23 PM
douardda added inline comments to D7003: journal: Document the new format for gitdate..
Jan 26 2022, 2:18 PM
douardda requested review of D7039: Update the debian local package building section.
Jan 26 2022, 12:33 PM

Jan 25 2022

douardda committed rDDOC7ff0c981c3e9: Update displayed copyright to 2022 (authored by douardda).
Update displayed copyright to 2022
Jan 25 2022, 9:55 AM

Jan 24 2022

douardda triaged T3883: Handle updated kafka messages for the objstorage replayer as High priority.
Jan 24 2022, 5:25 PM · Mirror
douardda triaged T3882: Handle updated kafka messages for the storage replayer as High priority.
Jan 24 2022, 5:22 PM · Storage manager, Mirror
douardda triaged T3881: Mirror - handle handling of multiple kafka messages for the same object as High priority.
Jan 24 2022, 5:22 PM · Mirror
douardda added a comment to T3877: Automate the weekly-planning script.

It might need a dedicated bot user to be created on hedgedoc also.

Jan 24 2022, 5:17 PM · System administration
douardda added a comment to T3877: Automate the weekly-planning script.

Things to fix in the script:

Jan 24 2022, 5:16 PM · System administration
douardda committed rDDOC52c6a83ea2cd: Fix rst syntax in mirror-operations/docker.rst (authored by douardda).
Fix rst syntax in mirror-operations/docker.rst
Jan 24 2022, 5:03 PM
douardda closed D7004: Update the docker mirror doc.
Jan 24 2022, 5:03 PM
douardda committed rDDOC7396e4263eec: Update the 'updating a configuration' section of the swarm-based mirror (authored by douardda).
Update the 'updating a configuration' section of the swarm-based mirror
Jan 24 2022, 5:03 PM
douardda updated the diff for D7004: Update the docker mirror doc.

forgot one...

Jan 24 2022, 4:57 PM
douardda updated the diff for D7004: Update the docker mirror doc.

Use roles as suggested by vlorentz

Jan 24 2022, 4:47 PM
douardda triaged T3877: Automate the weekly-planning script as High priority.
Jan 24 2022, 10:46 AM · System administration

Jan 21 2022

douardda requested review of D7004: Update the docker mirror doc.
Jan 21 2022, 12:54 PM
douardda closed D6994: Add a quickstart section in the doc.
Jan 21 2022, 12:46 PM
douardda committed rDOBJSRPLeca8d7714b00: Add a quickstart section in the doc (authored by douardda).
Add a quickstart section in the doc
Jan 21 2022, 12:46 PM
douardda updated the diff for D6994: Add a quickstart section in the doc.

yat (yet-another-typo)

Jan 21 2022, 11:39 AM
douardda closed D6944: Add a few statsd metrics in the kafka journal client.
Jan 21 2022, 11:30 AM
douardda committed rDJNL5a26dae22928: Add a few statsd metrics in the kafka journal client (authored by douardda).
Add a few statsd metrics in the kafka journal client
Jan 21 2022, 11:30 AM
douardda updated the diff for D6994: Add a quickstart section in the doc.

more typos

Jan 21 2022, 11:06 AM
douardda updated the diff for D6994: Add a quickstart section in the doc.

and fix the spurious () (thx ardumont)

Jan 21 2022, 11:04 AM
douardda updated the diff for D6994: Add a quickstart section in the doc.

fix rst syntax (thx D6995 review)

Jan 21 2022, 11:03 AM
douardda accepted D6995: Fix ReST syntax.

thx

Jan 21 2022, 11:02 AM
douardda requested review of D6994: Add a quickstart section in the doc.
Jan 21 2022, 10:37 AM
douardda closed D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient.
Jan 21 2022, 9:52 AM
douardda committed rDJNL5b17c50d3280: Add support for the rdkafka 'stats_cb' config option in get_journal_client (authored by douardda).
Add support for the rdkafka 'stats_cb' config option in get_journal_client
Jan 21 2022, 9:52 AM
douardda closed D6980: Add an "Hosting a mirror" page.
Jan 21 2022, 9:49 AM
douardda committed rDDOC2fc9925ba5a3: Add an "Hosting a mirror" page (authored by douardda).
Add an "Hosting a mirror" page
Jan 21 2022, 9:49 AM
douardda closed D6945: Make the copy process of blob objects run with thread concurrency.
Jan 21 2022, 9:46 AM
douardda committed rDOBJSRPL0dffebc423a1: Make the copy process of blob objects run with thread concurrency (authored by douardda).
Make the copy process of blob objects run with thread concurrency
Jan 21 2022, 9:46 AM

Jan 20 2022

douardda added a comment to T3127: Compute and display distribution of origins by forge.

Is there a reason not to close this task?

Jan 20 2022, 6:35 PM · Metrics/monitoring, Web app, Roadmap 2021, meta-task
douardda updated the diff for D6944: Add a few statsd metrics in the kafka journal client.

update using new statsd.status_gauge() context manager (in swh.core 1.1)

Jan 20 2022, 4:22 PM
douardda closed D6989: Add a Statsd.status_gauge() context manager.
Jan 20 2022, 3:32 PM
douardda committed rDCOREaba5c80765ad: Add a Statsd.status_gauge() context manager (authored by douardda).
Add a Statsd.status_gauge() context manager
Jan 20 2022, 3:32 PM
douardda requested review of D6989: Add a Statsd.status_gauge() context manager.
Jan 20 2022, 2:27 PM
douardda created P1260 (An Untitled Masterwork).
Jan 20 2022, 1:46 PM
douardda added a comment to D6944: Add a few statsd metrics in the kafka journal client.
In D6944#181139, @olasd wrote:

The code for the gauges feels like something that would be usefully handled with a context manager.

Something like (untested)

class StatsdStatusGauges:
    def __init__(self, metric_name: str, statuses: Collection[str], common_tags: Optional[Dict[str, str]] = None):
        self.metric_name = metric_name
        self.statuses = set(statuses)
        self.common_metrics = common_tags or {}
        self.current_status: Optional[str] = None

    def reset_gauges(self):
        self.current_status = None
        for status in self.statuses:
            statsd.gauge(self.metric_name, 0, {**self.common_tags, "status": status})

    def send_current_gauge(self, value: int):
        if self.current_status is not None:
            statsd.gauge(self.metric_name, value, {**self.common_tags, "status": self.current_status})

    def set(self, new_status: str):
        if new_status not in self.statuses:
            raise ValueError(f'{new_status} not in {self.statuses}')

        # May not be needed; May even be counter-productive if we want to send the gauges to keep them around in the statsd exporter
        if new_status == self.current_status:
            return

        self.send_current_gauge(0)
        self.current_status = new_status
        self.send_current_gauge(1)

    def __enter__(self):
        self.reset_gauges()
        return self

    def __exit__(self, *exc):
        self.reset_gauges()
        return False

Which would be used like:

with StatsdStatusGauges(JOURNAL_STATUS_METRIC, {"processing", "waiting"}) as status_gauge:
    [...]
    status_gauge.set("waiting")
    [...]
    status_gauge.set("processing")
Jan 20 2022, 1:31 PM
douardda added a comment to D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient.
In D6884#181115, @olasd wrote:

I'm kinda wondering if this import stuff should move to a common module - I think we do kind of the same thing with entrypoints?

Jan 20 2022, 1:29 PM
douardda updated the diff for D6980: Add an "Hosting a mirror" page.

Rebase and update according suggestions

Jan 20 2022, 12:41 PM

Jan 19 2022

douardda requested review of D6980: Add an "Hosting a mirror" page.
Jan 19 2022, 5:37 PM

Jan 14 2022

douardda committed rMSLDbee9e94855e1: Tech talk at #swh5years (authored by douardda).
Tech talk at #swh5years
Jan 14 2022, 5:14 PM
douardda added inline comments to D6945: Make the copy process of blob objects run with thread concurrency.
Jan 14 2022, 5:04 PM
douardda closed D6943: Add support for env var substitution in statsd tags from STATSD_TAGS.
Jan 14 2022, 11:39 AM
douardda committed rDCOREde9b0c9fb441: Add support for env var substitution in statsd tags from STATSD_TAGS (authored by douardda).
Add support for env var substitution in statsd tags from STATSD_TAGS
Jan 14 2022, 11:39 AM
douardda retitled D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient from [WIP] Add support for the rdkafka 'stats_cb' config option in JournalClient to Add support for the rdkafka 'stats_cb' config option in JournalClient.
Jan 14 2022, 10:52 AM
douardda requested review of D6944: Add a few statsd metrics in the kafka journal client.
Jan 14 2022, 10:50 AM
douardda abandoned D6875: Add statsd metrics in JournalClient.process.
Jan 14 2022, 10:40 AM
douardda updated the diff for D6943: Add support for env var substitution in statsd tags from STATSD_TAGS.

improve comment as suggested by ardumont

Jan 14 2022, 10:29 AM

Jan 13 2022

douardda updated the diff for D6945: Make the copy process of blob objects run with thread concurrency.

Add the cli option to configure this concurrency value

Jan 13 2022, 4:23 PM
douardda requested review of D6945: Make the copy process of blob objects run with thread concurrency.
Jan 13 2022, 4:08 PM
douardda requested review of D6943: Add support for env var substitution in statsd tags from STATSD_TAGS.
Jan 13 2022, 3:49 PM

Jan 12 2022

douardda accepted D6889: cassandra: Make content_missing run in linear time instead of quadratic.
Jan 12 2022, 11:32 AM
douardda accepted D6888: cassandra: Rewrite content_missing to run queries concurrently..

fine for me (but plz give a bit more insight)

Jan 12 2022, 11:28 AM

Jan 11 2022

douardda renamed T3841: regularly scrub all the data stores of swh from regularly scrub all the data sources of swh to regularly scrub all the data stores of swh.
Jan 11 2022, 12:32 PM · Datastore Scrubber, meta-task, Roadmap 2022, Storage manager
douardda removed a project from T3841: regularly scrub all the data stores of swh: Roadmap 2021.
Jan 11 2022, 12:31 PM · Datastore Scrubber, meta-task, Roadmap 2022, Storage manager
douardda triaged T3841: regularly scrub all the data stores of swh as Normal priority.
Jan 11 2022, 12:31 PM · Datastore Scrubber, meta-task, Roadmap 2022, Storage manager
douardda added a comment to T3544: Deal with GitHub removing support for git:// URLs.

I guess this is then related to T3653 somehow

Jan 11 2022, 10:55 AM · Origin-GitHub, Git loader

Jan 6 2022

douardda closed D6882: Remove 'process_timeout' from JournalClient's arguments.
Jan 6 2022, 2:25 PM