Page MenuHomeSoftware Heritage
Feed Advanced Search

Aug 25 2022

vsellier closed D8311: Bootstrap a swh chart only with the cassandra replayer stack.
Aug 25 2022, 10:09 AM
vsellier committed R260:3b295de813f3: Bootstrap a swh chart only with the cassandra replayer stack (authored by vsellier).
Bootstrap a swh chart only with the cassandra replayer stack
Aug 25 2022, 10:09 AM
vsellier updated the diff for D8311: Bootstrap a swh chart only with the cassandra replayer stack.

Move the checksum computation inside the if

Aug 25 2022, 10:09 AM

Aug 24 2022

vsellier added a revision to T4391: [cassandra] deploy the replaying stack: D8311: Bootstrap a swh chart only with the cassandra replayer stack.
Aug 24 2022, 6:24 PM · System administration
vsellier requested review of D8311: Bootstrap a swh chart only with the cassandra replayer stack.
Aug 24 2022, 6:24 PM
vsellier added a comment to T4391: [cassandra] deploy the replaying stack.

Everything is almost ready to start the ingestion tests:

  • The deployment of the replayers is implemented
  • The monitoring can also be easily deployed in a kubernetes cluster having the prometheus operator installed [1]
  • the grafana dashboards are also available as templates [1]
Aug 24 2022, 6:10 PM · System administration
vsellier committed rSKCONF1fef5d96e81b: Add grafana dashboards dedicated to cassandra and replayers statistics (authored by vsellier).
Add grafana dashboards dedicated to cassandra and replayers statistics
Aug 24 2022, 5:41 PM
vsellier accepted D8301: staging: Unify vlan declaration.
Aug 24 2022, 10:42 AM
vsellier committed R260:b2b7600f7a1c: fix statsd_exporter image version (authored by vsellier).
fix statsd_exporter image version
Aug 24 2022, 10:12 AM
vsellier committed R260:8d11190a0259: Deploy a prometheus statsd exporter and declare the monitoring (authored by vsellier).
Deploy a prometheus statsd exporter and declare the monitoring
Aug 24 2022, 10:09 AM
vsellier committed R260:767f83f70b95: Add more cassandra servers as seed to dispatch the connections accross all the… (authored by vsellier).
Add more cassandra servers as seed to dispatch the connections accross all the…
Aug 24 2022, 9:40 AM

Aug 23 2022

vsellier committed rSKCONF99b55cad41a6: Declare the monitoring of external cassandra nodes (authored by vsellier).
Declare the monitoring of external cassandra nodes
Aug 23 2022, 5:19 PM

Aug 22 2022

vsellier triaged T4455: Upgrade elk stack to a more recent version as Normal priority.
Aug 22 2022, 3:58 PM · System administration
vsellier triaged T4454: Create a dashboard for the swh-scrubber metrics as High priority.
Aug 22 2022, 3:54 PM · Datastore Scrubber, System administrators
vsellier committed R260:647876a4824a: remove useless name overriding (authored by vsellier).
remove useless name overriding
Aug 22 2022, 3:25 PM
vsellier committed R260:97d8e298ba2f: fix affinity criteria quoting (authored by vsellier).
fix affinity criteria quoting
Aug 22 2022, 2:27 PM
vsellier committed R260:60afde8112bc: affine the resources requests (authored by vsellier).
affine the resources requests
Aug 22 2022, 2:25 PM
vsellier committed R260:b95e7a1f37f3: Allow to specify the replayers's pods affinity (authored by vsellier).
Allow to specify the replayers's pods affinity
Aug 22 2022, 2:25 PM
vsellier triaged T4449: Azure Billing site not refreshed as Normal priority.
Aug 22 2022, 10:06 AM · System administration

Aug 20 2022

vsellier committed R260:dc40c24b4bfc: reduce the deployment history (authored by vsellier).
reduce the deployment history
Aug 20 2022, 11:58 PM
vsellier committed R260:a400f447256d: Add snapshots and increase the directory parallelism (authored by vsellier).
Add snapshots and increase the directory parallelism
Aug 20 2022, 11:52 PM
vsellier committed R260:b0983dbf143c: Adjust memory limits according the real values (authored by vsellier).
Adjust memory limits according the real values
Aug 20 2022, 11:52 PM
vsellier committed R260:06d3ec8f15ea: fix memory spec (authored by vsellier).
fix memory spec
Aug 20 2022, 12:02 PM
vsellier committed R260:a4ee511e00cc: ingest more object type (authored by vsellier).
ingest more object type
Aug 20 2022, 11:55 AM
vsellier committed R260:96f4eeefd246: Fix syntax issues with topics containing a _ (authored by vsellier).
Fix syntax issues with topics containing a _
Aug 20 2022, 11:33 AM
vsellier committed R260:ebed5df4cf6f: Configure autoscaling and add other origin_* topics (authored by vsellier).
Configure autoscaling and add other origin_* topics
Aug 20 2022, 11:19 AM

Aug 19 2022

vsellier committed R260:8b0d38664259: fix cpu reservation unit (authored by vsellier).
fix cpu reservation unit
Aug 19 2022, 8:00 PM
vsellier committed R260:d4a5f53e3035: Declare a cpu and memory request (authored by vsellier).
Declare a cpu and memory request
Aug 19 2022, 7:53 PM
vsellier committed R260:94747dd38910: Use internal kafka addresses (authored by vsellier).
Use internal kafka addresses
Aug 19 2022, 7:32 PM
vsellier committed rSKCONFe19517bb86ea: Depploy keda on cassandra-production cluster (authored by vsellier).
Depploy keda on cassandra-production cluster
Aug 19 2022, 6:09 PM
vsellier committed rSKCONF4434c969017a: declare the cassandra production replaying stack (authored by vsellier).
declare the cassandra production replaying stack
Aug 19 2022, 5:38 PM
vsellier committed R260:d9552225c7c6: Remove debug logs (authored by vsellier).
Remove debug logs
Aug 19 2022, 5:36 PM
vsellier committed rSPSITE42405baf257b: fix kafka credential creation script (authored by vsellier).
fix kafka credential creation script
Aug 19 2022, 5:25 PM
vsellier committed R260:a432372b70d2: bump swh-storage-replayer image versions (authored by vsellier).
bump swh-storage-replayer image versions
Aug 19 2022, 4:42 PM
vsellier committed R259:a3571bce9a00: use apt clean instead of removing all the dpkg content (authored by vsellier).
use apt clean instead of removing all the dpkg content
Aug 19 2022, 4:39 PM
vsellier committed R260:fb1c89846639: fix the way to quote the server in the init command (authored by vsellier).
fix the way to quote the server in the init command
Aug 19 2022, 3:47 PM
vsellier committed R260:346b62b533a7: fix init database command (authored by vsellier).
fix init database command
Aug 19 2022, 3:44 PM
vsellier committed R260:ec11b483cc36: Add an init cassandra option (authored by vsellier).
Add an init cassandra option
Aug 19 2022, 3:38 PM
vsellier committed R260:e6c297f6dc0c: remove port from cassandra host (authored by vsellier).
remove port from cassandra host
Aug 19 2022, 3:13 PM
vsellier committed R260:ea308371d426: Improve indentation v4 (authored by vsellier).
Improve indentation v4
Aug 19 2022, 3:06 PM
vsellier committed R260:223af97f5202: Improve indentation v3 (authored by vsellier).
Improve indentation v3
Aug 19 2022, 3:03 PM
vsellier committed R260:9a9b91c2f364: Improve indentation (authored by vsellier).
Improve indentation
Aug 19 2022, 2:48 PM
vsellier committed R260:cd6349bca7e9: improve indentation (authored by vsellier).
improve indentation
Aug 19 2022, 2:42 PM
vsellier committed R260:31ce35142c3a: fix the configmap name (authored by vsellier).
fix the configmap name
Aug 19 2022, 12:32 PM
vsellier committed R260:eb40b1cbbc6c: add more default values for cassandra replayer (authored by vsellier).
add more default values for cassandra replayer
Aug 19 2022, 12:17 PM
vsellier committed rSKCONF6a8eab9de494: Configure the cassandra-production cluster (authored by vsellier).
Configure the cassandra-production cluster
Aug 19 2022, 12:05 PM
vsellier committed rSKCONF438c33c127f1: Declare the swh namespace for the production-cassandra cluster (authored by vsellier).
Declare the swh namespace for the production-cassandra cluster
Aug 19 2022, 11:55 AM
vsellier committed R260:5fae66df0c4f: specify the journal client user (authored by vsellier).
specify the journal client user
Aug 19 2022, 11:41 AM
vsellier committed R260:6b45ca0686bd: prepare production deployment (authored by vsellier).
prepare production deployment
Aug 19 2022, 11:24 AM
vsellier committed rSPSITE5d845aa0805a: cassandra: extend directory permission to allow monitoring to check the disks (authored by vsellier).
cassandra: extend directory permission to allow monitoring to check the disks
Aug 19 2022, 10:05 AM

Aug 18 2022

vsellier closed T4446: Prepare the disks and configure zfs as Resolved.

all server reconfigured and cassandra started on them:

/opt/cassandra/bin/nodetool status
Datacenter: sesi_rocquencourt
=============================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address          Load       Tokens  Owns (effective)  Host ID                               Rack 
UN  192.168.100.184  88.65 KiB  16      34.3%             e0c24d24-6f68-4a26-8561-94e67b58211a  rack1
UN  192.168.100.181  84.71 KiB  16      31.3%             1d9b9e7d-b376-4afe-8f67-482e8412f21b  rack1
UN  192.168.100.186  69.07 KiB  16      34.2%             0dd3426d-9159-47bd-9b4e-065ff0fbb889  rack1
UN  192.168.100.183  69.08 KiB  16      37.1%             78281a92-7fa0-43bd-bc33-c5b419ee8715  rack1
UN  192.168.100.185  69.07 KiB  16      32.2%             abf9b69e-3cec-4ac3-a195-a54481e4d9da  rack1
UN  192.168.100.182  74.05 KiB  16      30.9%             eca5ea5d-8bd5-4301-9a5e-ffa01aa1b7e5  rack1
Aug 18 2022, 7:00 PM · Storage manager, System administration
vsellier closed T4446: Prepare the disks and configure zfs, a subtask of T4373: [cassandra] Test the new hardware, as Resolved.
Aug 18 2022, 7:00 PM · Storage manager, System administration
vsellier added a comment to T4446: Prepare the disks and configure zfs.

Recreating the zpool correctly:

# mixedused
ls /dev/disk/by-id/nvme-MO003200KXAVU* | grep -v part | xargs -t zpool create -o ashift=12 -O mountpoint=none mixeduse
zfs create -o mountpoint=/srv/cassandra/instance1/data mixeduse/cassandra-instance1-data
Aug 18 2022, 6:16 PM · Storage manager, System administration
vsellier added a comment to T4446: Prepare the disks and configure zfs.

Testing the performances of the different configuration (on a zfs pool with only one disk):

  • disk block: 512k / zpool ashift:9
zpool create -o ashift=9 -O mountpoint=none mixeduse /dev/disk/by-id/nvme-MO003200KXAVU_SJA4N7938I0405A0U
zfs create -o mountpoint=/srv/cassandra/instance1/data -o atime=off -o relatime=on mixeduse/cassandra-data
cd /srv/cassandra/instance1/data
bonnie++ -d . -m cassandra04 -u nobody                                                                                                                                                                                                                                                 
Using uid:65534, gid:65534.
Writing a byte at a time...done
Writing intelligently...done
Rewriting...done
Reading a byte at a time...done
Reading intelligently...done
start 'em...done...done...done...done...done...
Create files in sequential order...done.
Stat files in sequential order...done.
Delete files in sequential order...done.
Create files in random order...done.
Stat files in random order...done.
Delete files in random order...done.
Version  2.00       ------Sequential Output------ --Sequential Input- --Random-
                    -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Name:Size etc        /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
cassandra04 515496M  293k  99  1.0g  99  703m  99  661k  99  1.4g  91 13717 463
Latency             48216us    7316us    8224us   23303us    7928us    1606us
Version  2.00       ------Sequential Create------ --------Random Create--------
cassandra04         -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 16384  98 +++++ +++ 16384   8 +++++ +++ +++++ +++ 16384  99
Latency              2679us    1207us    4851ms    2850us     138us     301us
1.98,2.00,cassandra04,1,1659338044,515496M,,8192,5,293,99,1080974,99,720299,99,661,99,1488832,91,13717,463,16,,,,,28232,98,+++++,+++,2018,8,+++++,+++,+++++,+++,24821,99,48216us,7316us,8224us,23303us,7928us,1606us,2679us,1207us,4851ms,2850us,138us,301us
Aug 18 2022, 5:49 PM · Storage manager, System administration
vsellier added a comment to T4446: Prepare the disks and configure zfs.

The nvme format command didn't succeed on the write intensive disk. It never exits and the disk become unresponsive after that.

Aug 18 2022, 5:15 PM · Storage manager, System administration
vsellier updated the task description for T4446: Prepare the disks and configure zfs.
Aug 18 2022, 4:47 PM · Storage manager, System administration
vsellier changed the status of T4446: Prepare the disks and configure zfs from Open to Work in Progress.
Aug 18 2022, 4:46 PM · Storage manager, System administration
vsellier committed rSPSITE115a30d19218: cassandra: Add missing dependencies (authored by vsellier).
cassandra: Add missing dependencies
Aug 18 2022, 2:26 PM
vsellier committed rSPSITEa6e31c14466c: cassandra: Remove the no longer used cassandra.yaml template (authored by vsellier).
cassandra: Remove the no longer used cassandra.yaml template
Aug 18 2022, 11:52 AM
vsellier committed rSPRE8791a2ec762d: cassandra: Cleanup extra mounts of the cassandra directories (authored by vsellier).
cassandra: Cleanup extra mounts of the cassandra directories
Aug 18 2022, 10:58 AM
vsellier closed D8236: cassandra: Refactor the former installation scripts.
Aug 18 2022, 10:48 AM
vsellier committed rSPSITE57d978d9e3de: cassandra: Refactor the former installation scripts (authored by vsellier).
cassandra: Refactor the former installation scripts
Aug 18 2022, 10:48 AM

Aug 17 2022

vsellier committed R260:8f12e7df4312: Bootstrap a swh chart only with the cassandra replayer to starting (authored by vsellier).
Bootstrap a swh chart only with the cassandra replayer to starting
Aug 17 2022, 7:20 PM
vsellier committed R259:c9e97cbea950: add swh-storage-replayer application (authored by vsellier).
add swh-storage-replayer application
Aug 17 2022, 3:37 PM
vsellier updated the diff for D8236: cassandra: Refactor the former installation scripts.

use inline_yaml instead of to_yaml
thanks for the hint, I completely forgot about it

Aug 17 2022, 2:27 PM

Aug 16 2022

vsellier committed rSKCONF0e4bc93720bd: fix the private data repository url (authored by vsellier).
fix the private data repository url
Aug 16 2022, 11:17 PM
vsellier committed rSKCONFa07bd71569b1: fix indentation (authored by vsellier).
fix indentation
Aug 16 2022, 11:14 PM
vsellier committed rSKCONF377f015e0d20: fix indentation (authored by vsellier).
fix indentation
Aug 16 2022, 11:13 PM
vsellier committed rSKCONF6f65eb6f12f0: Configure the argocd applications configuration (authored by vsellier).
Configure the argocd applications configuration
Aug 16 2022, 10:59 PM
vsellier committed rSKCONFff61fc33d2f4: Activate applications autosync (authored by vsellier).
Activate applications autosync
Aug 16 2022, 8:18 PM
vsellier committed rSKCONF59bd7205efc8: declare argocd own applications and prepare the cassandra cluster ones (authored by vsellier).
declare argocd own applications and prepare the cassandra cluster ones
Aug 16 2022, 8:15 PM
vsellier committed R261:15987cb50831: Declare the private repository that will contain the k8s clusters secrets (authored by vsellier).
Declare the private repository that will contain the k8s clusters secrets
Aug 16 2022, 4:53 PM
vsellier updated the diff for D8236: cassandra: Refactor the former installation scripts.
  • Rebase
  • Override the jmx port value via the JVM_EXTRA_OPTS environment
  • Inline the cassandra.yaml properties in hiera
Aug 16 2022, 4:03 PM
vsellier committed rSENVe2be4700fac9: Increase cassandra nodes memory (authored by vsellier).
Increase cassandra nodes memory
Aug 16 2022, 2:07 PM

Aug 12 2022

vsellier added a comment to D8236: cassandra: Refactor the former installation scripts.

Setting -Dcassandra.jmx.local.port / -Dcassandra.jmx.remote.port / -Dcom.sun.management.jmxremote.rmi.port in the $JVM_EXTRA_OPTS env var should override what this file configured

Aug 12 2022, 4:10 PM
vsellier added a comment to D8236: cassandra: Refactor the former installation scripts.

I specified why on the first line of the file, but I realize it not very visible.

Aug 12 2022, 3:46 PM
vsellier requested review of D8242: sysadmin: bootstrap the cassandra documentation.
Aug 12 2022, 3:45 PM
vsellier added a revision to T4373: [cassandra] Test the new hardware: D8242: sysadmin: bootstrap the cassandra documentation.
Aug 12 2022, 3:34 PM · Storage manager, System administration
vsellier renamed T4435: scrubber log verbosity puts a risk on ELK from scrubber log verbosity put a risk on ELK to scrubber log verbosity puts a risk on ELK .
Aug 12 2022, 12:53 PM · Datastore Scrubber, System administrators
vsellier triaged T4435: scrubber log verbosity puts a risk on ELK as High priority.
Aug 12 2022, 12:52 PM · Datastore Scrubber, System administrators
vsellier changed the status of T4391: [cassandra] deploy the replaying stack, a subtask of T4373: [cassandra] Test the new hardware, from Open to Work in Progress.
Aug 12 2022, 11:10 AM · Storage manager, System administration
vsellier changed the status of T4391: [cassandra] deploy the replaying stack from Open to Work in Progress.
Aug 12 2022, 11:10 AM · System administration
vsellier added a comment to T4373: [cassandra] Test the new hardware.

The puppet code is ready for review. It was updated to support multi instances deployment in anticipation of T4375.

Aug 12 2022, 11:09 AM · Storage manager, System administration
vsellier requested review of D8236: cassandra: Refactor the former installation scripts.
Aug 12 2022, 11:01 AM
vsellier added a revision to T4373: [cassandra] Test the new hardware: D8236: cassandra: Refactor the former installation scripts.
Aug 12 2022, 11:01 AM · Storage manager, System administration

Aug 11 2022

vsellier committed rSPSITEfbbc0eb13e49: vagrant: fix the internal_network declaration (authored by vsellier).
vagrant: fix the internal_network declaration
Aug 11 2022, 2:19 PM
vsellier committed rSENVaef11d699700: vagrant: Fix the initial environment synchronization (authored by vsellier).
vagrant: Fix the initial environment synchronization
Aug 11 2022, 12:11 PM

Aug 10 2022

vsellier committed rSPRE5fc8dc9b2626: cassandra: expose the log directory to the pods (authored by vsellier).
cassandra: expose the log directory to the pods
Aug 10 2022, 12:53 PM
vsellier added a comment to T4373: [cassandra] Test the new hardware.

For the record, the issues related to the commitlog_directory configuration:

Aug 10 2022, 11:07 AM · Storage manager, System administration
vsellier added a comment to T4373: [cassandra] Test the new hardware.

After spending some time to successfully start a cassandra cluster of 2 nodes with a declarative configuration), these are the observations:

  • A service can't be used to expose the cassandra ports to the clustrer, the pod address must be used. It's because cassandra use the dns name provided as listen address
  • It should work by setting the listen address to 0.0.0.0 but it's stongly recommanded to not use this in the documentation

Setting listen_address to 0.0.0.0 is always wrong.

  • Using internal pod address will avoid multi DC deployment for the future
Aug 10 2022, 10:30 AM · Storage manager, System administration
vsellier committed rSKCONF8e37b0f5df0c: cassandra: poc working manual declarative configuration (authored by vsellier).
cassandra: poc working manual declarative configuration
Aug 10 2022, 9:58 AM

Aug 9 2022

vsellier committed rSPRE73d451f1b506: cassandra: Remove ballooning for the control-plane nodes (authored by vsellier).
cassandra: Remove ballooning for the control-plane nodes
Aug 9 2022, 5:07 PM
vsellier committed R260:6908e14f5cf3: workers: try to stop argocd to override the keda status (authored by vsellier).
workers: try to stop argocd to override the keda status
Aug 9 2022, 2:24 PM
vsellier accepted D8209: docker: update the list of created topics.
Aug 9 2022, 11:09 AM

Aug 8 2022

vsellier closed T4414: Outboard sysadm intern Amadou Thiam as Resolved.
Aug 8 2022, 3:11 PM · System administration
vsellier added a comment to T4414: Outboard sysadm intern Amadou Thiam .

VPN certificate revoked in opensense

Aug 8 2022, 3:11 PM · System administration
vsellier updated the task description for T4414: Outboard sysadm intern Amadou Thiam .
Aug 8 2022, 3:11 PM · System administration
vsellier changed the status of T4414: Outboard sysadm intern Amadou Thiam from Open to Work in Progress.
Aug 8 2022, 3:06 PM · System administration
vsellier accepted D8212: docker: add a generic 'swh' command in storage and worker entrypoint scripts.

looks good, thanks.

Aug 8 2022, 1:53 PM
vsellier accepted D8213: docker: move all 'static' kafka config entries in env/kafka.env.

I'm not sure if it's not better to keep the KAFKA_BROKER_ID closed to the instance declaration as it needs to be unique in a cluster and can't be shared between the instances.
But otherwise, thank, it's simpler like that.

Aug 8 2022, 1:45 PM