Page MenuHomeSoftware Heritage
Feed Advanced Search

Aug 19 2022

vsellier committed R260:346b62b533a7: fix init database command (authored by vsellier).
fix init database command
Aug 19 2022, 3:44 PM
vsellier committed R260:ec11b483cc36: Add an init cassandra option (authored by vsellier).
Add an init cassandra option
Aug 19 2022, 3:38 PM
vsellier committed R260:e6c297f6dc0c: remove port from cassandra host (authored by vsellier).
remove port from cassandra host
Aug 19 2022, 3:13 PM
vsellier committed R260:ea308371d426: Improve indentation v4 (authored by vsellier).
Improve indentation v4
Aug 19 2022, 3:06 PM
vsellier committed R260:223af97f5202: Improve indentation v3 (authored by vsellier).
Improve indentation v3
Aug 19 2022, 3:03 PM
vsellier committed R260:9a9b91c2f364: Improve indentation (authored by vsellier).
Improve indentation
Aug 19 2022, 2:48 PM
vsellier committed R260:cd6349bca7e9: improve indentation (authored by vsellier).
improve indentation
Aug 19 2022, 2:42 PM
vsellier committed R260:31ce35142c3a: fix the configmap name (authored by vsellier).
fix the configmap name
Aug 19 2022, 12:32 PM
vsellier committed R260:eb40b1cbbc6c: add more default values for cassandra replayer (authored by vsellier).
add more default values for cassandra replayer
Aug 19 2022, 12:17 PM
vsellier committed rSKCONF6a8eab9de494: Configure the cassandra-production cluster (authored by vsellier).
Configure the cassandra-production cluster
Aug 19 2022, 12:05 PM
vsellier committed rSKCONF438c33c127f1: Declare the swh namespace for the production-cassandra cluster (authored by vsellier).
Declare the swh namespace for the production-cassandra cluster
Aug 19 2022, 11:55 AM
vsellier committed R260:5fae66df0c4f: specify the journal client user (authored by vsellier).
specify the journal client user
Aug 19 2022, 11:41 AM
vsellier committed R260:6b45ca0686bd: prepare production deployment (authored by vsellier).
prepare production deployment
Aug 19 2022, 11:24 AM
vsellier committed rSPSITE5d845aa0805a: cassandra: extend directory permission to allow monitoring to check the disks (authored by vsellier).
cassandra: extend directory permission to allow monitoring to check the disks
Aug 19 2022, 10:05 AM

Aug 18 2022

vsellier closed T4446: Prepare the disks and configure zfs as Resolved.

all server reconfigured and cassandra started on them:

/opt/cassandra/bin/nodetool status
Datacenter: sesi_rocquencourt
=============================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address          Load       Tokens  Owns (effective)  Host ID                               Rack 
UN  192.168.100.184  88.65 KiB  16      34.3%             e0c24d24-6f68-4a26-8561-94e67b58211a  rack1
UN  192.168.100.181  84.71 KiB  16      31.3%             1d9b9e7d-b376-4afe-8f67-482e8412f21b  rack1
UN  192.168.100.186  69.07 KiB  16      34.2%             0dd3426d-9159-47bd-9b4e-065ff0fbb889  rack1
UN  192.168.100.183  69.08 KiB  16      37.1%             78281a92-7fa0-43bd-bc33-c5b419ee8715  rack1
UN  192.168.100.185  69.07 KiB  16      32.2%             abf9b69e-3cec-4ac3-a195-a54481e4d9da  rack1
UN  192.168.100.182  74.05 KiB  16      30.9%             eca5ea5d-8bd5-4301-9a5e-ffa01aa1b7e5  rack1
Aug 18 2022, 7:00 PM · Storage manager, System administration
vsellier closed T4446: Prepare the disks and configure zfs, a subtask of T4373: [cassandra] Test the new hardware, as Resolved.
Aug 18 2022, 7:00 PM · Storage manager, System administration
vsellier added a comment to T4446: Prepare the disks and configure zfs.

Recreating the zpool correctly:

# mixedused
ls /dev/disk/by-id/nvme-MO003200KXAVU* | grep -v part | xargs -t zpool create -o ashift=12 -O mountpoint=none mixeduse
zfs create -o mountpoint=/srv/cassandra/instance1/data mixeduse/cassandra-instance1-data
Aug 18 2022, 6:16 PM · Storage manager, System administration
vsellier added a comment to T4446: Prepare the disks and configure zfs.

Testing the performances of the different configuration (on a zfs pool with only one disk):

  • disk block: 512k / zpool ashift:9
zpool create -o ashift=9 -O mountpoint=none mixeduse /dev/disk/by-id/nvme-MO003200KXAVU_SJA4N7938I0405A0U
zfs create -o mountpoint=/srv/cassandra/instance1/data -o atime=off -o relatime=on mixeduse/cassandra-data
cd /srv/cassandra/instance1/data
bonnie++ -d . -m cassandra04 -u nobody                                                                                                                                                                                                                                                 
Using uid:65534, gid:65534.
Writing a byte at a time...done
Writing intelligently...done
Rewriting...done
Reading a byte at a time...done
Reading intelligently...done
start 'em...done...done...done...done...done...
Create files in sequential order...done.
Stat files in sequential order...done.
Delete files in sequential order...done.
Create files in random order...done.
Stat files in random order...done.
Delete files in random order...done.
Version  2.00       ------Sequential Output------ --Sequential Input- --Random-
                    -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Name:Size etc        /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
cassandra04 515496M  293k  99  1.0g  99  703m  99  661k  99  1.4g  91 13717 463
Latency             48216us    7316us    8224us   23303us    7928us    1606us
Version  2.00       ------Sequential Create------ --------Random Create--------
cassandra04         -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 16384  98 +++++ +++ 16384   8 +++++ +++ +++++ +++ 16384  99
Latency              2679us    1207us    4851ms    2850us     138us     301us
1.98,2.00,cassandra04,1,1659338044,515496M,,8192,5,293,99,1080974,99,720299,99,661,99,1488832,91,13717,463,16,,,,,28232,98,+++++,+++,2018,8,+++++,+++,+++++,+++,24821,99,48216us,7316us,8224us,23303us,7928us,1606us,2679us,1207us,4851ms,2850us,138us,301us
Aug 18 2022, 5:49 PM · Storage manager, System administration
vsellier added a comment to T4446: Prepare the disks and configure zfs.

The nvme format command didn't succeed on the write intensive disk. It never exits and the disk become unresponsive after that.

Aug 18 2022, 5:15 PM · Storage manager, System administration
vsellier updated the task description for T4446: Prepare the disks and configure zfs.
Aug 18 2022, 4:47 PM · Storage manager, System administration
vsellier changed the status of T4446: Prepare the disks and configure zfs from Open to Work in Progress.
Aug 18 2022, 4:46 PM · Storage manager, System administration
vsellier committed rSPSITE115a30d19218: cassandra: Add missing dependencies (authored by vsellier).
cassandra: Add missing dependencies
Aug 18 2022, 2:26 PM
vsellier committed rSPSITEa6e31c14466c: cassandra: Remove the no longer used cassandra.yaml template (authored by vsellier).
cassandra: Remove the no longer used cassandra.yaml template
Aug 18 2022, 11:52 AM
vsellier committed rSPRE8791a2ec762d: cassandra: Cleanup extra mounts of the cassandra directories (authored by vsellier).
cassandra: Cleanup extra mounts of the cassandra directories
Aug 18 2022, 10:58 AM
vsellier closed D8236: cassandra: Refactor the former installation scripts.
Aug 18 2022, 10:48 AM
vsellier committed rSPSITE57d978d9e3de: cassandra: Refactor the former installation scripts (authored by vsellier).
cassandra: Refactor the former installation scripts
Aug 18 2022, 10:48 AM

Aug 17 2022

vsellier committed R260:8f12e7df4312: Bootstrap a swh chart only with the cassandra replayer to starting (authored by vsellier).
Bootstrap a swh chart only with the cassandra replayer to starting
Aug 17 2022, 7:20 PM
vsellier committed R259:c9e97cbea950: add swh-storage-replayer application (authored by vsellier).
add swh-storage-replayer application
Aug 17 2022, 3:37 PM
vsellier updated the diff for D8236: cassandra: Refactor the former installation scripts.

use inline_yaml instead of to_yaml
thanks for the hint, I completely forgot about it

Aug 17 2022, 2:27 PM

Aug 16 2022

vsellier committed rSKCONF0e4bc93720bd: fix the private data repository url (authored by vsellier).
fix the private data repository url
Aug 16 2022, 11:17 PM
vsellier committed rSKCONFa07bd71569b1: fix indentation (authored by vsellier).
fix indentation
Aug 16 2022, 11:14 PM
vsellier committed rSKCONF377f015e0d20: fix indentation (authored by vsellier).
fix indentation
Aug 16 2022, 11:13 PM
vsellier committed rSKCONF6f65eb6f12f0: Configure the argocd applications configuration (authored by vsellier).
Configure the argocd applications configuration
Aug 16 2022, 10:59 PM
vsellier committed rSKCONFff61fc33d2f4: Activate applications autosync (authored by vsellier).
Activate applications autosync
Aug 16 2022, 8:18 PM
vsellier committed rSKCONF59bd7205efc8: declare argocd own applications and prepare the cassandra cluster ones (authored by vsellier).
declare argocd own applications and prepare the cassandra cluster ones
Aug 16 2022, 8:15 PM
vsellier committed R261:15987cb50831: Declare the private repository that will contain the k8s clusters secrets (authored by vsellier).
Declare the private repository that will contain the k8s clusters secrets
Aug 16 2022, 4:53 PM
vsellier updated the diff for D8236: cassandra: Refactor the former installation scripts.
  • Rebase
  • Override the jmx port value via the JVM_EXTRA_OPTS environment
  • Inline the cassandra.yaml properties in hiera
Aug 16 2022, 4:03 PM
vsellier committed rSENVe2be4700fac9: Increase cassandra nodes memory (authored by vsellier).
Increase cassandra nodes memory
Aug 16 2022, 2:07 PM

Aug 12 2022

vsellier added a comment to D8236: cassandra: Refactor the former installation scripts.

Setting -Dcassandra.jmx.local.port / -Dcassandra.jmx.remote.port / -Dcom.sun.management.jmxremote.rmi.port in the $JVM_EXTRA_OPTS env var should override what this file configured

Aug 12 2022, 4:10 PM
vsellier added a comment to D8236: cassandra: Refactor the former installation scripts.

I specified why on the first line of the file, but I realize it not very visible.

Aug 12 2022, 3:46 PM
vsellier requested review of D8242: sysadmin: bootstrap the cassandra documentation.
Aug 12 2022, 3:45 PM
vsellier added a revision to T4373: [cassandra] Test the new hardware: D8242: sysadmin: bootstrap the cassandra documentation.
Aug 12 2022, 3:34 PM · Storage manager, System administration
vsellier renamed T4435: scrubber log verbosity puts a risk on ELK from scrubber log verbosity put a risk on ELK to scrubber log verbosity puts a risk on ELK .
Aug 12 2022, 12:53 PM · Datastore Scrubber, System administrators
vsellier triaged T4435: scrubber log verbosity puts a risk on ELK as High priority.
Aug 12 2022, 12:52 PM · Datastore Scrubber, System administrators
vsellier changed the status of T4391: [cassandra] deploy the replaying stack, a subtask of T4373: [cassandra] Test the new hardware, from Open to Work in Progress.
Aug 12 2022, 11:10 AM · Storage manager, System administration
vsellier changed the status of T4391: [cassandra] deploy the replaying stack from Open to Work in Progress.
Aug 12 2022, 11:10 AM · System administration
vsellier added a comment to T4373: [cassandra] Test the new hardware.

The puppet code is ready for review. It was updated to support multi instances deployment in anticipation of T4375.

Aug 12 2022, 11:09 AM · Storage manager, System administration
vsellier requested review of D8236: cassandra: Refactor the former installation scripts.
Aug 12 2022, 11:01 AM
vsellier added a revision to T4373: [cassandra] Test the new hardware: D8236: cassandra: Refactor the former installation scripts.
Aug 12 2022, 11:01 AM · Storage manager, System administration

Aug 11 2022

vsellier committed rSPSITEfbbc0eb13e49: vagrant: fix the internal_network declaration (authored by vsellier).
vagrant: fix the internal_network declaration
Aug 11 2022, 2:19 PM
vsellier committed rSENVaef11d699700: vagrant: Fix the initial environment synchronization (authored by vsellier).
vagrant: Fix the initial environment synchronization
Aug 11 2022, 12:11 PM

Aug 10 2022

vsellier committed rSPRE5fc8dc9b2626: cassandra: expose the log directory to the pods (authored by vsellier).
cassandra: expose the log directory to the pods
Aug 10 2022, 12:53 PM
vsellier added a comment to T4373: [cassandra] Test the new hardware.

For the record, the issues related to the commitlog_directory configuration:

Aug 10 2022, 11:07 AM · Storage manager, System administration
vsellier added a comment to T4373: [cassandra] Test the new hardware.

After spending some time to successfully start a cassandra cluster of 2 nodes with a declarative configuration), these are the observations:

  • A service can't be used to expose the cassandra ports to the clustrer, the pod address must be used. It's because cassandra use the dns name provided as listen address
  • It should work by setting the listen address to 0.0.0.0 but it's stongly recommanded to not use this in the documentation

Setting listen_address to 0.0.0.0 is always wrong.

  • Using internal pod address will avoid multi DC deployment for the future
Aug 10 2022, 10:30 AM · Storage manager, System administration
vsellier committed rSKCONF8e37b0f5df0c: cassandra: poc working manual declarative configuration (authored by vsellier).
cassandra: poc working manual declarative configuration
Aug 10 2022, 9:58 AM

Aug 9 2022

vsellier committed rSPRE73d451f1b506: cassandra: Remove ballooning for the control-plane nodes (authored by vsellier).
cassandra: Remove ballooning for the control-plane nodes
Aug 9 2022, 5:07 PM
vsellier committed R260:6908e14f5cf3: workers: try to stop argocd to override the keda status (authored by vsellier).
workers: try to stop argocd to override the keda status
Aug 9 2022, 2:24 PM
vsellier accepted D8209: docker: update the list of created topics.
Aug 9 2022, 11:09 AM

Aug 8 2022

vsellier closed T4414: Outboard sysadm intern Amadou Thiam as Resolved.
Aug 8 2022, 3:11 PM · System administration
vsellier added a comment to T4414: Outboard sysadm intern Amadou Thiam .

VPN certificate revoked in opensense

Aug 8 2022, 3:11 PM · System administration
vsellier updated the task description for T4414: Outboard sysadm intern Amadou Thiam .
Aug 8 2022, 3:11 PM · System administration
vsellier changed the status of T4414: Outboard sysadm intern Amadou Thiam from Open to Work in Progress.
Aug 8 2022, 3:06 PM · System administration
vsellier accepted D8212: docker: add a generic 'swh' command in storage and worker entrypoint scripts.

looks good, thanks.

Aug 8 2022, 1:53 PM
vsellier accepted D8213: docker: move all 'static' kafka config entries in env/kafka.env.

I'm not sure if it's not better to keep the KAFKA_BROKER_ID closed to the instance declaration as it needs to be unique in a cluster and can't be shared between the instances.
But otherwise, thank, it's simpler like that.

Aug 8 2022, 1:45 PM
vsellier added inline comments to D8209: docker: update the list of created topics.
Aug 8 2022, 1:34 PM
vsellier accepted D8210: docker: use named volumes for kafka, storage and objstorage.
Aug 8 2022, 1:27 PM
vsellier accepted D8211: docker: add iputils-ping package in the container image.
Aug 8 2022, 1:22 PM

Jul 13 2022

vsellier committed rSKCONFff5707a7c44b: Bootstrap manual cassandra configuration (authored by vsellier).
Bootstrap manual cassandra configuration
Jul 13 2022, 8:01 PM
vsellier committed rSKCONF1aeb6d54ed98: non working k8ssandra configuration (authored by vsellier).
non working k8ssandra configuration
Jul 13 2022, 8:01 PM
vsellier committed rSKCONF8a0f4d990de1: bootstrap the cassandra cluster (authored by vsellier).
bootstrap the cassandra cluster
Jul 13 2022, 8:01 PM
vsellier committed rSPRE944ad7507847: cassandra: update the cluster configuration to deploy cassadra manually (authored by vsellier).
cassandra: update the cluster configuration to deploy cassadra manually
Jul 13 2022, 11:12 AM
vsellier closed D8116: Deploy the cassandra operator on the production cassandra cluster.
Jul 13 2022, 11:12 AM
vsellier committed rSPRE16feb3dda32c: Deploy the cassandra operator on the production cassandra cluster (authored by vsellier).
Deploy the cassandra operator on the production cassandra cluster
Jul 13 2022, 11:12 AM
vsellier added a comment to T4373: [cassandra] Test the new hardware.

Unfortunately, the operator test is a failure due to the lack of configuration possibility

  • non blocker, the init containers are OOMkilled during the start, it can be solved by editing the cassandra statefulset created by the operator to extend the limits
  • blocker, it's not possible to configure the commitlog_directory explicitly. it's by default on /var/lib/cassandra/commitlog
    • it's not easy to propagate the host mounts to use 2 mountpoints /srv/cassandra and /srv/cassandra/commitlog without tweaking the kernel / rancher configuration
    • it's not possible to add a second volume on the pod description created by the operator
Jul 13 2022, 10:25 AM · Storage manager, System administration

Jul 12 2022

vsellier renamed T4391: [cassandra] deploy the replaying stack from [cassandra] deploy the replying stack to [cassandra] deploy the replaying stack.
Jul 12 2022, 3:29 PM · System administration
vsellier updated the diff for D8116: Deploy the cassandra operator on the production cassandra cluster.

declare the extra volume binding (no impact on the tfstate)

Jul 12 2022, 3:18 PM
vsellier added a revision to T4373: [cassandra] Test the new hardware: D8116: Deploy the cassandra operator on the production cassandra cluster.
Jul 12 2022, 3:01 PM · Storage manager, System administration
vsellier requested review of D8116: Deploy the cassandra operator on the production cassandra cluster.
Jul 12 2022, 3:01 PM
vsellier committed rSPREd081816a9ad3: cassandra - match node roles (authored by vsellier).
cassandra - match node roles
Jul 12 2022, 2:44 PM
vsellier committed rSPRE6381c215ea0c: cassandra - match kubernes version (authored by vsellier).
cassandra - match kubernes version
Jul 12 2022, 2:38 PM
vsellier committed rSKCONFf89d3a7a6bc4: Add the argo workflow poc configuration (authored by vsellier).
Add the argo workflow poc configuration
Jul 12 2022, 12:38 PM
vsellier triaged T4391: [cassandra] deploy the replaying stack as Normal priority.
Jul 12 2022, 12:13 PM · System administration
vsellier changed the status of T4374: [cassandra] Test basic topology, a subtask of T4379: [cassandra] create etcd / controlplane servers, from Open to Work in Progress.
Jul 12 2022, 12:10 PM · Storage manager, System administration
vsellier changed the status of T4374: [cassandra] Test basic topology from Open to Work in Progress.
Jul 12 2022, 12:10 PM · Storage manager, System administration
vsellier closed T4389: [cassandra] Configure the monitoring of the cluster as Resolved.

The mountpoint needs to be declare on the kubelet container to be reachable by the pods:

--- /tmp/cluster-orig.yaml	2022-07-12 11:27:27.169509573 +0200
+++ /tmp/cluster.yaml	2022-07-12 11:26:54.865395186 +0200
@@ -58,6 +58,8 @@
       service_node_port_range: 30000-32767
     kube-controller: {}
     kubelet:
+      extra_binds:
+        - '/srv/prometheus:/srv/prometheus'
       fail_swap_on: false
       generate_serving_certificate: false
     kubeproxy: {}
Jul 12 2022, 12:09 PM · System administration
vsellier closed T4389: [cassandra] Configure the monitoring of the cluster, a subtask of T4373: [cassandra] Test the new hardware, as Resolved.
Jul 12 2022, 12:09 PM · Storage manager, System administration
vsellier committed rSKCONF18f544855355: Bootstrap cassandra cluster configuration (authored by vsellier).
Bootstrap cassandra cluster configuration
Jul 12 2022, 11:56 AM
vsellier committed rSKCONF2d4826b67ca5: Specify the repository organization (authored by vsellier).
Specify the repository organization
Jul 12 2022, 11:47 AM
vsellier committed R261:8249ffa761b1: Add the kubernetes clusters configuration repository (authored by vsellier).
Add the kubernetes clusters configuration repository
Jul 12 2022, 11:43 AM

Jul 11 2022

vsellier closed D8094: Declare the kubernetes cluster and management nodes for cassandra.
Jul 11 2022, 10:02 PM
vsellier committed rSPREbd8e7bc4223d: Declare the kubernetes cluster and management nodes for cassandra (authored by vsellier).
Declare the kubernetes cluster and management nodes for cassandra
Jul 11 2022, 10:02 PM
vsellier updated the diff for D8094: Declare the kubernetes cluster and management nodes for cassandra.

add tfstate file

Jul 11 2022, 10:02 PM
vsellier added a comment to T4389: [cassandra] Configure the monitoring of the cluster.

Configure the data directory:

root@pergamon:~# clush -b -w @cassandra-mgmt hostname
---------------
rancher-node-cassandra1
---------------
rancher-node-cassandra1
---------------
rancher-node-cassandra2
---------------
rancher-node-cassandra2
---------------
rancher-node-cassandra3
---------------
rancher-node-cassandra3
Jul 11 2022, 5:01 PM · System administration
vsellier renamed T4389: [cassandra] Configure the monitoring of the cluster from Configure the monitoring of the cluster to [cassandra] Configure the monitoring of the cluster.
Jul 11 2022, 4:49 PM · System administration
vsellier moved T4389: [cassandra] Configure the monitoring of the cluster from Backlog to in-progress on the System administration board.
Jul 11 2022, 4:49 PM · System administration
vsellier changed the status of T4389: [cassandra] Configure the monitoring of the cluster from Open to Work in Progress.
Jul 11 2022, 4:48 PM · System administration
vsellier closed T4379: [cassandra] create etcd / controlplane servers as Resolved.

Finally, the cluster is up.
I'm not sure what unstuck the node registration, but I suspect a node with all the roles is needed to bootstrap the cluster.
I tried this initially, it didn't worked, but I'm not sure in which status the cluster was.

Jul 11 2022, 4:33 PM · Storage manager, System administration
vsellier closed T4379: [cassandra] create etcd / controlplane servers, a subtask of T4373: [cassandra] Test the new hardware, as Resolved.
Jul 11 2022, 4:33 PM · Storage manager, System administration
vsellier committed rSPSITE1ae1e5a7cf2f: Force to use pergamon as dns (authored by vsellier).
Force to use pergamon as dns
Jul 11 2022, 2:29 PM
vsellier committed rSENV85ef675e7a8e: update facts (authored by vsellier).
update facts
Jul 11 2022, 2:27 PM