general system administration tasks, not specific to any product
Details
Details
Description
Fri, May 20
Fri, May 20
ardumont added a comment to T4064: Test GitLab migration scripts.
Clean up gitlab instance
ardumont added a comment to T4064: Test GitLab migration scripts.
Gist of the actions are currently:
- Mirror the forgerie repository to add some docker commands to allow sandboxed execution (see diffs ^)
- Adaptations in the forgerie code source to allow migration runs with our current gitlab/phabricator instances (see diffs ^)
- Current runs are working a bit but do not succeed entirely
ardumont updated the task description for T4064: Test GitLab migration scripts.
ardumont added a revision to T4064: Test GitLab migration scripts: D7865: Allow remote mysql connection.
ardumont added a revision to T4064: Test GitLab migration scripts: D7858: Allow main script executions within container.
ardumont added a revision to T4064: Test GitLab migration scripts: D7856: docs: Fix typos and clean up whitespace.
Thu, May 19
Thu, May 19
Wed, May 18
Wed, May 18
vsellier added projects to T4258: [add forge now] email inbound not catched by django: Add Forge Now , System administration.
Tue, May 17
Tue, May 17
vsellier reopened T4251: [swh-search] Investigate long search queries response time as "Work in Progress".
vsellier edited projects for T4251: [swh-search] Investigate long search queries response time, added: System administration; removed System administrators.
Mon, May 16
Mon, May 16
ardumont updated the task description for T4144: Elastic worker infrastructure.
vsellier renamed T4247: journalbeat failed to start after reboot from journalbeat fails to start after reboot to journalbeat failed to start after reboot.
vsellier renamed T4247: journalbeat failed to start after reboot from journalbeat fail to start after reboot to journalbeat fails to start after reboot.
vsellier added a comment to T4247: journalbeat failed to start after reboot.
the file /var/lib/journalbeat/registry looks corrupted:
on worker10.euwest:
root@worker10:/var/lib/journalbeat# cat registry <?xml version="1.0" encoding="utf-8"?> <GoalState xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="goalstate10.xsd"> <Version>2012-11-30</Version> <Incarnation>1</Incarnation> <Machine> <ExpectedState>Started</ExpectedState> <StopRolesDeadlineHint>3
on worker09.euwest:
root@worker09:/var/lib/journalbeat# cat registry update_time: 2022-05-16T07:11:29.680690647Z journal_entries: - path: LOCAL_SYSTEM_JOURNAL cursor: s=1b5676c17e22450b80579b9caf065703;i=659f65c;b=97b0842367c749299a4a12ec839f1c3b;m=5b66c4ba4c0;t=5df1bbb86b72f;x=8e43c09dfc1a706e realtime_timestamp: 1652685086832431 monotonic_timestamp: 6281059083456
vsellier changed the status of T4247: journalbeat failed to start after reboot from Open to Work in Progress.
Fri, May 13
Fri, May 13
ardumont added a parent task for T4242: Deployed loader.git v1.8: T4219: Investigate why GitHub fork detection did not bring a speed-up.
ardumont added a comment to T4242: Deployed loader.git v1.8.
Dashboard to check for improvments [1]
ardumont moved T4242: Deployed loader.git v1.8 from in-progress to deployed/landed/monitoring on the System administration board.
ardumont renamed T4243: Deploy loader.metadata credentials for high and oneshot loaders from Deploy loader.metadata credentials for high and oneshot loader to Deploy loader.metadata credentials for high and oneshot loaders.
ardumont updated the task description for T4243: Deploy loader.metadata credentials for high and oneshot loaders.
ardumont changed the status of T4243: Deploy loader.metadata credentials for high and oneshot loaders from Open to Work in Progress.
ardumont triaged T4243: Deploy loader.metadata credentials for high and oneshot loaders as Normal priority.
ardumont updated the task description for T4242: Deployed loader.git v1.8.
anlambert added projects to T3746: staging: Deploy maven indexer/lister/loader: Maven lister, Maven loader.
vlorentz added a comment to T4225: Deploy a more recent version of prometheus-statsd-exporter on all nodes.
thanks, btw! :)
olasd added a comment to T4225: Deploy a more recent version of prometheus-statsd-exporter on all nodes.
That's deployed on all nodes and validated as working now.
olasd closed T4225: Deploy a more recent version of prometheus-statsd-exporter on all nodes as Resolved.
ardumont moved T4238: Deploy latest loaders version (> v3.4) from in-progress to deployed/landed/monitoring on the System administration board.
ardumont updated the task description for T4238: Deploy latest loaders version (> v3.4).
ardumont updated the task description for T4238: Deploy latest loaders version (> v3.4).
ardumont updated the task description for T4238: Deploy latest loaders version (> v3.4).
ardumont moved T4238: Deploy latest loaders version (> v3.4) from Weekly backlog to in-progress on the System administration board.
olasd added a comment to T4225: Deploy a more recent version of prometheus-statsd-exporter on all nodes.
Puppet ran manually on staging workers now, and the new statsd exporter has been deployed. The metrics properly show up in prometheus.
ardumont updated the task description for T4238: Deploy latest loaders version (> v3.4).
ardumont updated the task description for T4238: Deploy latest loaders version (> v3.4).