The deployment of the replayers is implemented
The monitoring can also be easily deployed in a kubernetes cluster having the prometheus operator installed [1]
the grafana dashboards are also available as templates [1]

Aug 24 2022, 6:10 PM · System administration

vsellier committed rSKCONF1fef5d96e81b: Add grafana dashboards dedicated to cassandra and replayers statistics (authored by vsellier).

Add grafana dashboards dedicated to cassandra and replayers statistics

Aug 24 2022, 5:41 PM

vsellier accepted D8301: staging: Unify vlan declaration.

Aug 24 2022, 10:42 AM

vsellier committed R260:b2b7600f7a1c: fix statsd_exporter image version (authored by vsellier).

fix statsd_exporter image version

Aug 24 2022, 10:12 AM

vsellier committed R260:8d11190a0259: Deploy a prometheus statsd exporter and declare the monitoring (authored by vsellier).

Deploy a prometheus statsd exporter and declare the monitoring

Aug 24 2022, 10:09 AM

vsellier committed R260:767f83f70b95: Add more cassandra servers as seed to dispatch the connections accross all the… (authored by vsellier).

Add more cassandra servers as seed to dispatch the connections accross all the…

Aug 24 2022, 9:40 AM

Aug 23 2022

vsellier committed rSKCONF99b55cad41a6: Declare the monitoring of external cassandra nodes (authored by vsellier).

Declare the monitoring of external cassandra nodes

Aug 23 2022, 5:19 PM

Aug 22 2022

vsellier triaged T4455: Upgrade elk stack to a more recent version as Normal priority.

Aug 22 2022, 3:58 PM · System administration

vsellier triaged T4454: Create a dashboard for the swh-scrubber metrics as High priority.

Aug 22 2022, 3:54 PM · Datastore Scrubber, System administrators

vsellier committed R260:647876a4824a: remove useless name overriding (authored by vsellier).

remove useless name overriding

Aug 22 2022, 3:25 PM

vsellier committed R260:97d8e298ba2f: fix affinity criteria quoting (authored by vsellier).

fix affinity criteria quoting

Aug 22 2022, 2:27 PM

vsellier committed R260:60afde8112bc: affine the resources requests (authored by vsellier).

affine the resources requests

Aug 22 2022, 2:25 PM

vsellier committed R260:b95e7a1f37f3: Allow to specify the replayers's pods affinity (authored by vsellier).

Allow to specify the replayers's pods affinity

Aug 22 2022, 2:25 PM

vsellier triaged T4449: Azure Billing site not refreshed as Normal priority.

Aug 22 2022, 10:06 AM · System administration

Aug 20 2022

vsellier committed R260:dc40c24b4bfc: reduce the deployment history (authored by vsellier).

reduce the deployment history

Aug 20 2022, 11:58 PM

vsellier committed R260:a400f447256d: Add snapshots and increase the directory parallelism (authored by vsellier).

Add snapshots and increase the directory parallelism

Aug 20 2022, 11:52 PM

vsellier committed R260:b0983dbf143c: Adjust memory limits according the real values (authored by vsellier).

Adjust memory limits according the real values

Aug 20 2022, 11:52 PM

vsellier committed R260:06d3ec8f15ea: fix memory spec (authored by vsellier).

fix memory spec

Aug 20 2022, 12:02 PM

vsellier committed R260:a4ee511e00cc: ingest more object type (authored by vsellier).

ingest more object type

Aug 20 2022, 11:55 AM

vsellier committed R260:96f4eeefd246: Fix syntax issues with topics containing a _ (authored by vsellier).

Fix syntax issues with topics containing a _

Aug 20 2022, 11:33 AM

vsellier committed R260:ebed5df4cf6f: Configure autoscaling and add other origin_* topics (authored by vsellier).

Configure autoscaling and add other origin_* topics

Aug 20 2022, 11:19 AM

Aug 19 2022

vsellier committed R260:8b0d38664259: fix cpu reservation unit (authored by vsellier).

fix cpu reservation unit

Aug 19 2022, 8:00 PM

vsellier committed R260:d4a5f53e3035: Declare a cpu and memory request (authored by vsellier).

Declare a cpu and memory request

Aug 19 2022, 7:53 PM

vsellier committed R260:94747dd38910: Use internal kafka addresses (authored by vsellier).

Use internal kafka addresses

Aug 19 2022, 7:32 PM

vsellier committed rSKCONFe19517bb86ea: Depploy keda on cassandra-production cluster (authored by vsellier).

Depploy keda on cassandra-production cluster

Aug 19 2022, 6:09 PM

vsellier committed rSKCONF4434c969017a: declare the cassandra production replaying stack (authored by vsellier).

declare the cassandra production replaying stack

Aug 19 2022, 5:38 PM

vsellier committed R260:d9552225c7c6: Remove debug logs (authored by vsellier).

Remove debug logs

Aug 19 2022, 5:36 PM

vsellier committed rSPSITE42405baf257b: fix kafka credential creation script (authored by vsellier).

fix kafka credential creation script

Aug 19 2022, 5:25 PM

vsellier committed R260:a432372b70d2: bump swh-storage-replayer image versions (authored by vsellier).

bump swh-storage-replayer image versions

Aug 19 2022, 4:42 PM

vsellier committed R259:a3571bce9a00: use apt clean instead of removing all the dpkg content (authored by vsellier).

use apt clean instead of removing all the dpkg content

Aug 19 2022, 4:39 PM

vsellier committed R260:fb1c89846639: fix the way to quote the server in the init command (authored by vsellier).

fix the way to quote the server in the init command

Aug 19 2022, 3:47 PM

vsellier committed R260:346b62b533a7: fix init database command (authored by vsellier).

fix init database command

Aug 19 2022, 3:44 PM

vsellier committed R260:ec11b483cc36: Add an init cassandra option (authored by vsellier).

Add an init cassandra option

Aug 19 2022, 3:38 PM

vsellier committed R260:e6c297f6dc0c: remove port from cassandra host (authored by vsellier).

remove port from cassandra host

Aug 19 2022, 3:13 PM

vsellier committed R260:ea308371d426: Improve indentation v4 (authored by vsellier).

Improve indentation v4

Aug 19 2022, 3:06 PM

vsellier committed R260:223af97f5202: Improve indentation v3 (authored by vsellier).

Improve indentation v3

Aug 19 2022, 3:03 PM

vsellier committed R260:9a9b91c2f364: Improve indentation (authored by vsellier).

Improve indentation

Aug 19 2022, 2:48 PM

vsellier committed R260:cd6349bca7e9: improve indentation (authored by vsellier).

improve indentation

Aug 19 2022, 2:42 PM

vsellier committed R260:31ce35142c3a: fix the configmap name (authored by vsellier).

fix the configmap name

Aug 19 2022, 12:32 PM

vsellier committed R260:eb40b1cbbc6c: add more default values for cassandra replayer (authored by vsellier).

add more default values for cassandra replayer

Aug 19 2022, 12:17 PM

vsellier committed rSKCONF6a8eab9de494: Configure the cassandra-production cluster (authored by vsellier).

Configure the cassandra-production cluster

Aug 19 2022, 12:05 PM

vsellier committed rSKCONF438c33c127f1: Declare the swh namespace for the production-cassandra cluster (authored by vsellier).

Declare the swh namespace for the production-cassandra cluster

Aug 19 2022, 11:55 AM

vsellier committed R260:5fae66df0c4f: specify the journal client user (authored by vsellier).

specify the journal client user

Aug 19 2022, 11:41 AM

vsellier committed R260:6b45ca0686bd: prepare production deployment (authored by vsellier).

prepare production deployment

Aug 19 2022, 11:24 AM

vsellier committed rSPSITE5d845aa0805a: cassandra: extend directory permission to allow monitoring to check the disks (authored by vsellier).

cassandra: extend directory permission to allow monitoring to check the disks

Aug 19 2022, 10:05 AM

Aug 18 2022

vsellier closed T4446: Prepare the disks and configure zfs as Resolved.

all server reconfigured and cassandra started on them:

/opt/cassandra/bin/nodetool status
Datacenter: sesi_rocquencourt
=============================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address          Load       Tokens  Owns (effective)  Host ID                               Rack 
UN  192.168.100.184  88.65 KiB  16      34.3%             e0c24d24-6f68-4a26-8561-94e67b58211a  rack1
UN  192.168.100.181  84.71 KiB  16      31.3%             1d9b9e7d-b376-4afe-8f67-482e8412f21b  rack1
UN  192.168.100.186  69.07 KiB  16      34.2%             0dd3426d-9159-47bd-9b4e-065ff0fbb889  rack1
UN  192.168.100.183  69.08 KiB  16      37.1%             78281a92-7fa0-43bd-bc33-c5b419ee8715  rack1
UN  192.168.100.185  69.07 KiB  16      32.2%             abf9b69e-3cec-4ac3-a195-a54481e4d9da  rack1
UN  192.168.100.182  74.05 KiB  16      30.9%             eca5ea5d-8bd5-4301-9a5e-ffa01aa1b7e5  rack1

Aug 18 2022, 7:00 PM · Storage manager, System administration

vsellier closed T4446: Prepare the disks and configure zfs, a subtask of T4373: [cassandra] Test the new hardware, as Resolved.

Aug 18 2022, 7:00 PM · Storage manager, System administration

vsellier added a comment to T4446: Prepare the disks and configure zfs.

Recreating the zpool correctly:

# mixedused
ls /dev/disk/by-id/nvme-MO003200KXAVU* | grep -v part | xargs -t zpool create -o ashift=12 -O mountpoint=none mixeduse
zfs create -o mountpoint=/srv/cassandra/instance1/data mixeduse/cassandra-instance1-data

Aug 18 2022, 6:16 PM · Storage manager, System administration

vsellier added a comment to T4446: Prepare the disks and configure zfs.

Testing the performances of the different configuration (on a zfs pool with only one disk):

disk block: 512k / zpool ashift:9

zpool create -o ashift=9 -O mountpoint=none mixeduse /dev/disk/by-id/nvme-MO003200KXAVU_SJA4N7938I0405A0U
zfs create -o mountpoint=/srv/cassandra/instance1/data -o atime=off -o relatime=on mixeduse/cassandra-data
cd /srv/cassandra/instance1/data
bonnie++ -d . -m cassandra04 -u nobody                                                                                                                                                                                                                                                 
Using uid:65534, gid:65534.
Writing a byte at a time...done
Writing intelligently...done
Rewriting...done
Reading a byte at a time...done
Reading intelligently...done
start 'em...done...done...done...done...done...
Create files in sequential order...done.
Stat files in sequential order...done.
Delete files in sequential order...done.
Create files in random order...done.
Stat files in random order...done.
Delete files in random order...done.
Version  2.00       ------Sequential Output------ --Sequential Input- --Random-
                    -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Name:Size etc        /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
cassandra04 515496M  293k  99  1.0g  99  703m  99  661k  99  1.4g  91 13717 463
Latency             48216us    7316us    8224us   23303us    7928us    1606us
Version  2.00       ------Sequential Create------ --------Random Create--------
cassandra04         -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 16384  98 +++++ +++ 16384   8 +++++ +++ +++++ +++ 16384  99
Latency              2679us    1207us    4851ms    2850us     138us     301us
1.98,2.00,cassandra04,1,1659338044,515496M,,8192,5,293,99,1080974,99,720299,99,661,99,1488832,91,13717,463,16,,,,,28232,98,+++++,+++,2018,8,+++++,+++,+++++,+++,24821,99,48216us,7316us,8224us,23303us,7928us,1606us,2679us,1207us,4851ms,2850us,138us,301us

Aug 18 2022, 5:49 PM · Storage manager, System administration

vsellier added a comment to T4446: Prepare the disks and configure zfs.

The nvme format command didn't succeed on the write intensive disk. It never exits and the disk become unresponsive after that.

Aug 18 2022, 5:15 PM · Storage manager, System administration

vsellier updated the task description for T4446: Prepare the disks and configure zfs.

Aug 18 2022, 4:47 PM · Storage manager, System administration

vsellier changed the status of T4446: Prepare the disks and configure zfs from Open to Work in Progress.

Aug 18 2022, 4:46 PM · Storage manager, System administration

vsellier committed rSPSITE115a30d19218: cassandra: Add missing dependencies (authored by vsellier).

cassandra: Add missing dependencies

Aug 18 2022, 2:26 PM

vsellier committed rSPSITEa6e31c14466c: cassandra: Remove the no longer used cassandra.yaml template (authored by vsellier).

cassandra: Remove the no longer used cassandra.yaml template

Aug 18 2022, 11:52 AM

vsellier committed rSPRE8791a2ec762d: cassandra: Cleanup extra mounts of the cassandra directories (authored by vsellier).

cassandra: Cleanup extra mounts of the cassandra directories

Aug 18 2022, 10:58 AM

vsellier closed D8236: cassandra: Refactor the former installation scripts.

Aug 18 2022, 10:48 AM

vsellier committed rSPSITE57d978d9e3de: cassandra: Refactor the former installation scripts (authored by vsellier).

cassandra: Refactor the former installation scripts

Aug 18 2022, 10:48 AM

Aug 17 2022

vsellier committed R260:8f12e7df4312: Bootstrap a swh chart only with the cassandra replayer to starting (authored by vsellier).

Bootstrap a swh chart only with the cassandra replayer to starting

Aug 17 2022, 7:20 PM

vsellier committed R259:c9e97cbea950: add swh-storage-replayer application (authored by vsellier).

add swh-storage-replayer application

Aug 17 2022, 3:37 PM

vsellier updated the diff for D8236: cassandra: Refactor the former installation scripts.

use inline_yaml instead of to_yaml
thanks for the hint, I completely forgot about it

Aug 17 2022, 2:27 PM

Aug 16 2022

vsellier committed rSKCONF0e4bc93720bd: fix the private data repository url (authored by vsellier).

fix the private data repository url

Aug 16 2022, 11:17 PM

vsellier committed rSKCONFa07bd71569b1: fix indentation (authored by vsellier).

fix indentation

Aug 16 2022, 11:14 PM

vsellier committed rSKCONF377f015e0d20: fix indentation (authored by vsellier).

fix indentation

Aug 16 2022, 11:13 PM

vsellier committed rSKCONF6f65eb6f12f0: Configure the argocd applications configuration (authored by vsellier).

Configure the argocd applications configuration

Aug 16 2022, 10:59 PM

vsellier committed rSKCONFff61fc33d2f4: Activate applications autosync (authored by vsellier).

Activate applications autosync

Aug 16 2022, 8:18 PM

vsellier committed rSKCONF59bd7205efc8: declare argocd own applications and prepare the cassandra cluster ones (authored by vsellier).

declare argocd own applications and prepare the cassandra cluster ones

Aug 16 2022, 8:15 PM

vsellier committed R261:15987cb50831: Declare the private repository that will contain the k8s clusters secrets (authored by vsellier).

Declare the private repository that will contain the k8s clusters secrets

Aug 16 2022, 4:53 PM

vsellier updated the diff for D8236: cassandra: Refactor the former installation scripts.

Rebase
Override the jmx port value via the JVM_EXTRA_OPTS environment
Inline the cassandra.yaml properties in hiera

Aug 16 2022, 4:03 PM

vsellier committed rSENVe2be4700fac9: Increase cassandra nodes memory (authored by vsellier).

Increase cassandra nodes memory

Aug 16 2022, 2:07 PM

Aug 12 2022

vsellier added a comment to D8236: cassandra: Refactor the former installation scripts.

Setting -Dcassandra.jmx.local.port / -Dcassandra.jmx.remote.port / -Dcom.sun.management.jmxremote.rmi.port in the $JVM_EXTRA_OPTS env var should override what this file configured

Aug 12 2022, 4:10 PM

vsellier added a comment to D8236: cassandra: Refactor the former installation scripts.

I specified why on the first line of the file, but I realize it not very visible.

Aug 12 2022, 3:46 PM

vsellier requested review of D8242: sysadmin: bootstrap the cassandra documentation.

Aug 12 2022, 3:45 PM

vsellier added a revision to T4373: [cassandra] Test the new hardware: D8242: sysadmin: bootstrap the cassandra documentation.

Aug 12 2022, 3:34 PM · Storage manager, System administration

vsellier renamed T4435: scrubber log verbosity puts a risk on ELK from scrubber log verbosity put a risk on ELK to scrubber log verbosity puts a risk on ELK .

Aug 12 2022, 12:53 PM · Datastore Scrubber, System administrators

vsellier triaged T4435: scrubber log verbosity puts a risk on ELK as High priority.

Aug 12 2022, 12:52 PM · Datastore Scrubber, System administrators

vsellier changed the status of T4391: [cassandra] deploy the replaying stack, a subtask of T4373: [cassandra] Test the new hardware, from Open to Work in Progress.

Aug 12 2022, 11:10 AM · Storage manager, System administration

vsellier changed the status of T4391: [cassandra] deploy the replaying stack from Open to Work in Progress.

Aug 12 2022, 11:10 AM · System administration

vsellier added a comment to T4373: [cassandra] Test the new hardware.

The puppet code is ready for review. It was updated to support multi instances deployment in anticipation of T4375.

Aug 12 2022, 11:09 AM · Storage manager, System administration

vsellier requested review of D8236: cassandra: Refactor the former installation scripts.

Aug 12 2022, 11:01 AM

vsellier added a revision to T4373: [cassandra] Test the new hardware: D8236: cassandra: Refactor the former installation scripts.

Aug 12 2022, 11:01 AM · Storage manager, System administration

Aug 11 2022

vsellier committed rSPSITEfbbc0eb13e49: vagrant: fix the internal_network declaration (authored by vsellier).

vagrant: fix the internal_network declaration

Aug 11 2022, 2:19 PM

vsellier committed rSENVaef11d699700: vagrant: Fix the initial environment synchronization (authored by vsellier).

vagrant: Fix the initial environment synchronization

Aug 11 2022, 12:11 PM

Aug 10 2022

vsellier committed rSPRE5fc8dc9b2626: cassandra: expose the log directory to the pods (authored by vsellier).

cassandra: expose the log directory to the pods

Aug 10 2022, 12:53 PM

vsellier added a comment to T4373: [cassandra] Test the new hardware.

For the record, the issues related to the commitlog_directory configuration:

Aug 10 2022, 11:07 AM · Storage manager, System administration

vsellier added a comment to T4373: [cassandra] Test the new hardware.

After spending some time to successfully start a cassandra cluster of 2 nodes with a declarative configuration), these are the observations:

A service can't be used to expose the cassandra ports to the clustrer, the pod address must be used. It's because cassandra use the dns name provided as listen address
It should work by setting the listen address to 0.0.0.0 but it's stongly recommanded to not use this in the documentation

Setting listen_address to 0.0.0.0 is always wrong.

Using internal pod address will avoid multi DC deployment for the future

Aug 10 2022, 10:30 AM · Storage manager, System administration

vsellier committed rSKCONF8e37b0f5df0c: cassandra: poc working manual declarative configuration (authored by vsellier).

cassandra: poc working manual declarative configuration

Aug 10 2022, 9:58 AM

Aug 9 2022

vsellier committed rSPRE73d451f1b506: cassandra: Remove ballooning for the control-plane nodes (authored by vsellier).

cassandra: Remove ballooning for the control-plane nodes

Aug 9 2022, 5:07 PM

vsellier committed R260:6908e14f5cf3: workers: try to stop argocd to override the keda status (authored by vsellier).

workers: try to stop argocd to override the keda status

Aug 9 2022, 2:24 PM

vsellier accepted D8209: docker: update the list of created topics.

Aug 9 2022, 11:09 AM

Aug 8 2022

vsellier closed T4414: Outboard sysadm intern Amadou Thiam as Resolved.

Aug 8 2022, 3:11 PM · System administration

vsellier added a comment to T4414: Outboard sysadm intern Amadou Thiam .

VPN certificate revoked in opensense

Aug 8 2022, 3:11 PM · System administration

vsellier updated the task description for T4414: Outboard sysadm intern Amadou Thiam .

Aug 8 2022, 3:11 PM · System administration

vsellier changed the status of T4414: Outboard sysadm intern Amadou Thiam from Open to Work in Progress.

Aug 8 2022, 3:06 PM · System administration

vsellier accepted D8212: docker: add a generic 'swh' command in storage and worker entrypoint scripts.

looks good, thanks.

Aug 8 2022, 1:53 PM

vsellier accepted D8213: docker: move all 'static' kafka config entries in env/kafka.env.

I'm not sure if it's not better to keep the KAFKA_BROKER_ID closed to the instance declaration as it needs to be unique in a cluster and can't be shared between the instances.
But otherwise, thank, it's simpler like that.

Aug 8 2022, 1:45 PM

Advanced SearchUse ResultsEdit QueryHide Query

Aug 25 2022

Aug 24 2022

Aug 23 2022

Aug 22 2022

Aug 20 2022

Aug 19 2022

Aug 18 2022

Aug 17 2022

Aug 16 2022

Aug 12 2022

Aug 11 2022

Aug 10 2022

Aug 9 2022

Aug 8 2022

Advanced Search
Use Results
Edit Query
Hide Query