In D7039#183023, @olasd wrote:Thanks.
--build-dep-resolver=aptitude should only be used when building with an extra-repository which has a non-default priority, that is only when using a -backports suite (so only for the bullseye and buster instructions). It should probably be documented in the list of "useful options" rather than as the default.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Feed Advanced Search
Advanced Search
Advanced Search
Jan 27 2022
Jan 27 2022
Jan 26 2022
Jan 26 2022
Jan 25 2022
Jan 25 2022
Update displayed copyright to 2022
Jan 24 2022
Jan 24 2022
douardda triaged T3881: Mirror - handle handling of multiple kafka messages for the same object as High priority.
It might need a dedicated bot user to be created on hedgedoc also.
Things to fix in the script:
douardda committed rDDOC52c6a83ea2cd: Fix rst syntax in mirror-operations/docker.rst (authored by douardda).
Fix rst syntax in mirror-operations/docker.rst
douardda committed rDDOC7396e4263eec: Update the 'updating a configuration' section of the swarm-based mirror (authored by douardda).
Update the 'updating a configuration' section of the swarm-based mirror
Use roles as suggested by vlorentz
Jan 21 2022
Jan 21 2022
douardda committed rDOBJSRPLeca8d7714b00: Add a quickstart section in the doc (authored by douardda).
Add a quickstart section in the doc
yat (yet-another-typo)
douardda committed rDJNL5a26dae22928: Add a few statsd metrics in the kafka journal client (authored by douardda).
Add a few statsd metrics in the kafka journal client
more typos
and fix the spurious () (thx ardumont)
fix rst syntax (thx D6995 review)
douardda committed rDJNL5b17c50d3280: Add support for the rdkafka 'stats_cb' config option in get_journal_client (authored by douardda).
Add support for the rdkafka 'stats_cb' config option in get_journal_client
Add an "Hosting a mirror" page
douardda committed rDOBJSRPL0dffebc423a1: Make the copy process of blob objects run with thread concurrency (authored by douardda).
Make the copy process of blob objects run with thread concurrency
Jan 20 2022
Jan 20 2022
Is there a reason not to close this task?
update using new statsd.status_gauge() context manager (in swh.core 1.1)
douardda committed rDCOREaba5c80765ad: Add a Statsd.status_gauge() context manager (authored by douardda).
Add a Statsd.status_gauge() context manager
In D6944#181139, @olasd wrote:The code for the gauges feels like something that would be usefully handled with a context manager.
Something like (untested)
class StatsdStatusGauges: def __init__(self, metric_name: str, statuses: Collection[str], common_tags: Optional[Dict[str, str]] = None): self.metric_name = metric_name self.statuses = set(statuses) self.common_metrics = common_tags or {} self.current_status: Optional[str] = None def reset_gauges(self): self.current_status = None for status in self.statuses: statsd.gauge(self.metric_name, 0, {**self.common_tags, "status": status}) def send_current_gauge(self, value: int): if self.current_status is not None: statsd.gauge(self.metric_name, value, {**self.common_tags, "status": self.current_status}) def set(self, new_status: str): if new_status not in self.statuses: raise ValueError(f'{new_status} not in {self.statuses}') # May not be needed; May even be counter-productive if we want to send the gauges to keep them around in the statsd exporter if new_status == self.current_status: return self.send_current_gauge(0) self.current_status = new_status self.send_current_gauge(1) def __enter__(self): self.reset_gauges() return self def __exit__(self, *exc): self.reset_gauges() return FalseWhich would be used like:
with StatsdStatusGauges(JOURNAL_STATUS_METRIC, {"processing", "waiting"}) as status_gauge: [...] status_gauge.set("waiting") [...] status_gauge.set("processing")
douardda added a comment to D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient.
In D6884#181115, @olasd wrote:I'm kinda wondering if this import stuff should move to a common module - I think we do kind of the same thing with entrypoints?
Rebase and update according suggestions
Jan 19 2022
Jan 19 2022
Jan 14 2022
Jan 14 2022
Tech talk at #swh5years
douardda added inline comments to D6945: Make the copy process of blob objects run with thread concurrency.
douardda committed rDCOREde9b0c9fb441: Add support for env var substitution in statsd tags from STATSD_TAGS (authored by douardda).
Add support for env var substitution in statsd tags from STATSD_TAGS
douardda retitled D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient from [WIP] Add support for the rdkafka 'stats_cb' config option in JournalClient to Add support for the rdkafka 'stats_cb' config option in JournalClient.
douardda updated the diff for D6943: Add support for env var substitution in statsd tags from STATSD_TAGS.
improve comment as suggested by ardumont
Jan 13 2022
Jan 13 2022
douardda updated the diff for D6945: Make the copy process of blob objects run with thread concurrency.
Add the cli option to configure this concurrency value
douardda requested review of D6945: Make the copy process of blob objects run with thread concurrency.
douardda requested review of D6943: Add support for env var substitution in statsd tags from STATSD_TAGS.
Jan 12 2022
Jan 12 2022
fine for me (but plz give a bit more insight)
Jan 11 2022
Jan 11 2022
douardda renamed T3841: regularly scrub all the data stores of swh from regularly scrub all the data sources of swh to regularly scrub all the data stores of swh.
I guess this is then related to T3653 somehow
Jan 6 2022
Jan 6 2022
douardda committed rDJNL0d115993e0f1: Remove 'process_timeout' from JournalClient's arguments (authored by douardda).
Remove 'process_timeout' from JournalClient's arguments
douardda updated the diff for D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient.
move the code in get_journal_client
make the warning an exception, as suggested by vlorentz
douardda requested review of D6884: Add support for the rdkafka 'stats_cb' config option in JournalClient.
Jan 4 2022
Jan 4 2022
In D6875#178755, @olasd wrote:statsd.timing/statsd.timed do full histograms. Do we really want to keep bucketed counts for all of these values, or just a running total?
Gawd this is horrible! (not your fault!)
I'd feel more comfortable also if we have a good (aka documented and understood) reason for doing this.
douardda committed rDOBJSRPLa2d1aa994400: Move the 'error_reporter' config entry in a dedicated 'replayer' section (authored by douardda).
Move the 'error_reporter' config entry in a dedicated 'replayer' section
douardda added inline comments to D6818: Move the 'error_reporter' config entry in a dedicated 'replayer' section.
douardda updated the diff for D6818: Move the 'error_reporter' config entry in a dedicated 'replayer' section.
vlorentz' comment + update copyright timestamps
douardda added inline comments to D6818: Move the 'error_reporter' config entry in a dedicated 'replayer' section.
closed by 259bf6fe1e3bacbcd2e91f8f3d55d49f5219892c
douardda committed rDSTO259bf6fe1e3b: Improve documentation of the replay command (authored by douardda).
Improve documentation of the replay command
douardda committed rDSTO1071781d8483: Move the 'error_reporter' config entry in a dedicated 'replayer' section (authored by douardda).
Move the 'error_reporter' config entry in a dedicated 'replayer' section
Jan 3 2022
Jan 3 2022
In T3134#76032, @vlorentz wrote:wishlist: it would be nice ot be able to check the whole hash of a revision/release even when the author name/email are replaced by a hash. (eg. by making SWHIDv2 a tree hash)
Dec 16 2021
Dec 16 2021
I guess https://grafana.softwareheritage.org/d/d3l2oqXWz/s3-object-copy?orgId=1 is almost an answer to this task
Dec 13 2021
Dec 13 2021
LGTM (but see my comment)
Dec 10 2021
Dec 10 2021
Same comment as in D6712
douardda added inline comments to D6714: Add support to flatten directories in the isochrone frontiers separately.
douardda added inline comments to D6712: Add explicit flag for flattenned directories to `ProvenanceStorageInterface`.
douardda added inline comments to D6712: Add explicit flag for flattenned directories to `ProvenanceStorageInterface`.
douardda updated subscribers of D6714: Add support to flatten directories in the isochrone frontiers separately.
douardda requested changes to D6712: Add explicit flag for flattenned directories to `ProvenanceStorageInterface`.
It would really nice to have a better explanation of what this flag is added for and why. Why do we add complexity in the code for this? I know there are good reasons for that, but I cannot see them just reading the code or the commit message.
There is a small typo in the commit message (adn instead of and)
Dec 9 2021
Dec 9 2021
douardda requested review of D6817: Move the 'error_reporter' config entry in a dedicated 'replayer' section.
douardda requested review of D6818: Move the 'error_reporter' config entry in a dedicated 'replayer' section.
Dec 6 2021
Dec 6 2021
Dec 3 2021
Dec 3 2021
douardda committed rDOBJSRPL5da286e8d1db: Updated debian changelog for version 0.3.1-2 (authored by douardda).
Updated debian changelog for version 0.3.1-2
douardda committed rDOBJSRPL093687737c11: Add forgotten build-dependency on redis-server and python3-pytest-redis (authored by douardda).
Add forgotten build-dependency on redis-server and python3-pytest-redis