Page MenuHomeSoftware Heritage
Feed Advanced Search

Nov 24 2020

vsellier committed rSPSITE473c2b76b3f1: Configure backports to be able to install zfs packages (authored by vsellier).
Configure backports to be able to install zfs packages
Nov 24 2020, 7:33 PM
vsellier closed D4580: Configure backports to be able to install zfs packages.
Nov 24 2020, 7:33 PM
vsellier added a comment to T2790: [staging] deploy the journal infrastructure.

The backfillng is done for several objects type and still in progress for revision, content, directory :

root@journal0:/opt/kafka/bin# for topic in $(./kafka-topics.sh --bootstrap-server $SERVER --list); do   echo -n "$topic : ";   ./kafka-run-class.sh kafka.tools.GetOffsetShell  --broker-list $SERVER --topic $topic | awk -F: '{s+=$3}END{print s}'; done
__consumer_offsets : 0
swh.journal.objects.content : 927440
swh.journal.objects.directory : 213279
swh.journal.objects.metadata_authority : 0
swh.journal.objects.metadata_fetcher : 0
swh.journal.objects.origin : 62892
swh.journal.objects.origin_visit : 68368
swh.journal.objects.origin_visit_status : 136721
swh.journal.objects.raw_extrinsic_metadata : 0
swh.journal.objects.release : 3101
swh.journal.objects.revision : 155746
swh.journal.objects.skipped_content : 189
swh.journal.objects.snapshot : 36046
swh.journal.objects_privileged.release : 0
swh.journal.objects_privileged.revision : 0

I have some doubts on how to import the following object types and if they need to :

  • swh.journal.objects.metadata_authority
  • swh.journal.objects.metadata_fetcher
  • swh.journal.objects.raw_extrinsic_metadata
  • swh.journal.objects_privileged.release
  • swh.journal.objects_privileged.revision
Nov 24 2020, 7:17 PM · System administration, Staging environment
vsellier added a comment to T2790: [staging] deploy the journal infrastructure.

The topics were created with 64 partitions and a replication factor of 1 :

Nov 24 2020, 4:51 PM · System administration, Staging environment
vsellier added a comment to T2790: [staging] deploy the journal infrastructure.
  • the vm memory was increased from 12G to 20G (completely "pifometrique" approximated value)
  • a new data disk of 500Go is attached to the VM (there are currently 300G of objects on storage1.staging)
  • the kafka's logdir was configured to be stored on a zfs volume composed of only the new data disk :
root@journal0:~# apt install zfs-dkms
## reboot
Nov 24 2020, 4:32 PM · System administration, Staging environment
vsellier added a revision to T2790: [staging] deploy the journal infrastructure: D4580: Configure backports to be able to install zfs packages.
Nov 24 2020, 4:27 PM · System administration, Staging environment
vsellier created D4580: Configure backports to be able to install zfs packages.
Nov 24 2020, 4:27 PM
vsellier added a comment to T2790: [staging] deploy the journal infrastructure.

Kafka is up and running on journal0.
The next steps are:

  • tune the server as there is not a lot of disk space (and memory but only if needed) :
root@journal0:~# df -h
Filesystem      Size  Used Avail Use% Mounted on
udev            5.9G     0  5.9G   0% /dev
tmpfs           1.2G  560K  1.2G   1% /run
/dev/vda1        32G  7.2G   24G  24% /
tmpfs           5.9G  8.0K  5.9G   1% /dev/shm
tmpfs           5.0M     0  5.0M   0% /run/lock
tmpfs           4.0M     0  4.0M   0% /sys/fs/cgroup
tmpfs           244M     0  244M   0% /run/user/1025
root@journal0:~# free -h
              total        used        free      shared  buff/cache   available
Mem:           11Gi       6.5Gi       354Mi        11Mi       4.8Gi       4.9Gi
Swap:            0B          0B          0B
  • Create the topics as explained in T2520#48682 (with a smaller number of partition and a replication factor to 1 as we only have one staging server)
  • Launch the backfill to populate kafka with the current content of the staging archive
Nov 24 2020, 12:36 PM · System administration, Staging environment
vsellier committed rSPSITE13c87f140600: Declare the certificate for the journal0.staging's kafka (authored by vsellier).
Declare the certificate for the journal0.staging's kafka
Nov 24 2020, 12:25 PM
vsellier closed D4571: Declare the certificate for the journal0.staging's kafka.
Nov 24 2020, 12:25 PM
vsellier added a revision to T2790: [staging] deploy the journal infrastructure: D4571: Declare the certificate for the journal0.staging's kafka.
Nov 24 2020, 12:23 PM · System administration, Staging environment
vsellier created D4571: Declare the certificate for the journal0.staging's kafka.
Nov 24 2020, 12:23 PM
vsellier committed rSPSITE9074d87bd6dd: Configure the kafka server for staging (authored by vsellier).
Configure the kafka server for staging
Nov 24 2020, 12:07 PM
vsellier committed rSPSITEbf96179df42d: Ensure java is installed before trying to manipulate the java keystore (authored by vsellier).
Ensure java is installed before trying to manipulate the java keystore
Nov 24 2020, 12:07 PM
vsellier committed rSPSITEedf036f29835: Move pgbouncer password alias to the main configuration (authored by vsellier).
Move pgbouncer password alias to the main configuration
Nov 24 2020, 12:07 PM
vsellier closed D4560: Configure the kafka server for staging.
Nov 24 2020, 12:07 PM
vsellier closed D4565: Move pgbouncer password alias to the main configuration.
Nov 24 2020, 12:07 PM
vsellier committed rSPSITE7fa33c70aa99: Only store the sentry tokens in the credentials (authored by vsellier).
Only store the sentry tokens in the credentials
Nov 24 2020, 12:07 PM
vsellier closed D4563: Only store the sentry tokens in the credentials.
Nov 24 2020, 12:06 PM
vsellier committed rSPPRIVC7334928f0628: Add kafka::broker::password value (authored by vsellier).
Add kafka::broker::password value
Nov 24 2020, 11:57 AM
vsellier committed rSPPRIVC481fe7d73db8: Move pgbouncer alias to the main configuration (authored by vsellier).
Move pgbouncer alias to the main configuration
Nov 24 2020, 11:50 AM
vsellier closed D4564: Move pgbouncer alias to the main configuration.
Nov 24 2020, 11:50 AM
vsellier committed rSPPRIVC9cce52adf11e: Only store the sentry tokens in the credentials (authored by vsellier).
Only store the sentry tokens in the credentials
Nov 24 2020, 11:50 AM
vsellier closed D4562: Only store the sentry tokens in the credentials.
Nov 24 2020, 11:50 AM
vsellier added a comment to D4560: Configure the kafka server for staging.

precision: the tests were only done on the internal plaintext listener without sasl authentication

Nov 24 2020, 11:47 AM
vsellier updated the diff for D4560: Configure the kafka server for staging.

Rename the zookeper cluster to match the kafka cluster name

Nov 24 2020, 11:45 AM
vsellier committed rSENVf6eaeed656ef: vagrant: Increase memory to be able to run the services (authored by vsellier).
vagrant: Increase memory to be able to run the services
Nov 24 2020, 10:04 AM
vsellier added a comment to D4560: Configure the kafka server for staging.

The configuration seems to work pretty well in vagrant :

root@journal0:/opt/kafka/bin# ./kafka-topics.sh --bootstrap-server 10.168.130.70:9092 --list | xargs -n1 -i{} ./kafka-run-class.sh kafka.tools.GetOffsetShell  --broker-list 10.168.130.70:9092 --topic {} | grep -v consumer
swh.journal.objects.content:0:898
swh.journal.objects.directory:0:1172
swh.journal.objects.origin:0:26
swh.journal.objects.origin_visit:0:55
swh.journal.objects.origin_visit_status:0:2
swh.journal.objects.release:0:0
swh.journal.objects.revision:0:209
swh.journal.objects.snapshot:0:1

I let the worker create the topics so there is only one partition. It will need to be adapted for the staging environment

Nov 24 2020, 9:25 AM
vsellier updated the diff for D4560: Configure the kafka server for staging.

Fix wrong indentation on the storage service configuration

Nov 24 2020, 9:21 AM

Nov 23 2020

vsellier created D4565: Move pgbouncer password alias to the main configuration.
Nov 23 2020, 7:01 PM
vsellier created D4564: Move pgbouncer alias to the main configuration.
Nov 23 2020, 6:57 PM
vsellier updated the diff for D4563: Only store the sentry tokens in the credentials.

Move the sentry dsn declarations with their friends

Nov 23 2020, 6:49 PM
vsellier created D4563: Only store the sentry tokens in the credentials.
Nov 23 2020, 6:18 PM
vsellier created D4562: Only store the sentry tokens in the credentials.
Nov 23 2020, 6:12 PM
vsellier closed D4497: Manage the parent directories of the kafka logdirs.

closed by rSPSITEed253c86d4f6

Nov 23 2020, 5:30 PM
vsellier updated the diff for D4497: Manage the parent directories of the kafka logdirs.

rebase

Nov 23 2020, 5:29 PM
vsellier committed rSPSITEed253c86d4f6: Manage the parent directories of the kafka logdirs (authored by vsellier).
Manage the parent directories of the kafka logdirs
Nov 23 2020, 5:28 PM
vsellier updated the test plan for D4497: Manage the parent directories of the kafka logdirs.
Nov 23 2020, 5:27 PM
vsellier added a revision to T2790: [staging] deploy the journal infrastructure: D4560: Configure the kafka server for staging.
Nov 23 2020, 4:07 PM · System administration, Staging environment
vsellier planned changes to D4560: Configure the kafka server for staging.
Nov 23 2020, 4:07 PM
vsellier created D4560: Configure the kafka server for staging.
Nov 23 2020, 4:07 PM
vsellier committed rSPSITEc1ed9b262594: vagrant: fix es nodes fqdn (authored by vsellier).
vagrant: fix es nodes fqdn
Nov 23 2020, 3:57 PM
vsellier committed rSPSITEf58828fd5f86: vagrant: Fix network range allowed to connect to postgresql (authored by vsellier).
vagrant: Fix network range allowed to connect to postgresql
Nov 23 2020, 3:57 PM
vsellier requested review of D4497: Manage the parent directories of the kafka logdirs.
Nov 23 2020, 3:46 PM
vsellier updated the diff for D4497: Manage the parent directories of the kafka logdirs.

Avoid the declaration of the parent directories by using mkdir -p

Nov 23 2020, 3:46 PM
vsellier committed rSENV69bbcb118d85: Add staging-storage1 server (authored by vsellier).
Add staging-storage1 server
Nov 23 2020, 3:38 PM
vsellier committed rSENV6710a971ac87: Fix staging-worker0 hostname to match the declaration in site.pp (authored by vsellier).
Fix staging-worker0 hostname to match the declaration in site.pp
Nov 23 2020, 3:38 PM
vsellier committed rSENV0ef211987c01: Add journal0 certificates (used by kafka) (authored by vsellier).
Add journal0 certificates (used by kafka)
Nov 23 2020, 3:38 PM

Nov 20 2020

vsellier committed rSENVd68d7e7b80e7: vagrant: add staging-storage0 node (authored by vsellier).
vagrant: add staging-storage0 node
Nov 20 2020, 7:39 PM
vsellier committed rSPSITEa5716714a1c5: Kafka needs a jre to run (authored by vsellier).
Kafka needs a jre to run
Nov 20 2020, 5:10 PM
vsellier closed D4534: Kafka needs a jre to run.
Nov 20 2020, 5:10 PM
vsellier updated the diff for D4534: Kafka needs a jre to run.

rebase

Nov 20 2020, 5:10 PM
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.

All the loader where restarted on all the workers :

sudo clush -b -w @swh-workers 'apt-get update && apt-get -y upgrade -V'
sudo clush -b -w @swh-workers 'puppet agent --enable && puppet agent --test'
sudo clush -b -w @swh-workers 'systemctl default'
Nov 20 2020, 3:35 PM · System administration
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.

Automatic tasks restarted on worker01, the logs are under watch.

Nov 20 2020, 2:39 PM · System administration
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.

after upgrading the packages on worker01, the npm load was successful :

swhworker@worker01:~$ time SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_npm.yml swh loader run npm https://www.npmjs.com/package/bootstrap-vue
{'status': 'eventful', 'snapshot_id': '30d32aff7fab1a2c364dc5c61503b0aec3f9fb11'}
Nov 20 2020, 1:05 PM · System administration
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.

The problem is not reproduced in staging but the worker and storage have the same package versions:

vsellier@worker0 ~ % apt list --upgradable
Listing... Done
python3-swh.deposit.client/unknown 0.6.0-1~swh1~bpo10+1 all [upgradable from: 0.5.0-1~swh1~bpo10+1]
python3-swh.deposit.loader/unknown 0.6.0-1~swh1~bpo10+1 all [upgradable from: 0.5.0-1~swh1~bpo10+1]
python3-swh.deposit/unknown 0.6.0-1~swh1~bpo10+1 all [upgradable from: 0.5.0-1~swh1~bpo10+1]
python3-swh.indexer.storage/unknown 0.5.0-2~swh1~bpo10+1 all [upgradable from: 0.4.2-1~swh1~bpo10+1]
python3-swh.indexer/unknown 0.5.0-2~swh1~bpo10+1 all [upgradable from: 0.4.2-1~swh1~bpo10+1]
python3-swh.journal/unknown 0.5.1-1~swh1~bpo10+1 all [upgradable from: 0.5.0-1~swh1~bpo10+1]
python3-swh.loader.git/unknown 0.5.0-1~swh1~bpo10+1 all [upgradable from: 0.4.1-1~swh1~bpo10+1]
python3-swh.model/unknown 0.9.0-1~swh1~bpo10+1 all [upgradable from: 0.7.3-1~swh1~bpo10+1]
python3-swh.storage/unknown 0.17.2-1~swh1~bpo10+1 all [upgradable from: 0.17.0-1~swh1~bpo10+1]
python3-swh.vault/unknown 0.3.3-1~swh1~bpo10+1 all [upgradable from: 0.3.1-1~swh1~bpo10+1]
vsellier@storage1 ~ % apt list --upgradable
Listing... Done
libpq5/buster-pgdg 13.1-1.pgdg100+1 amd64 [upgradable from: 13.0-1.pgdg100+1]
postgresql-13/buster-pgdg 13.1-1.pgdg100+1 amd64 [upgradable from: 13.0-1.pgdg100+1]
postgresql-client-13/buster-pgdg 13.1-1.pgdg100+1 amd64 [upgradable from: 13.0-1.pgdg100+1]
postgresql-client-common/buster-pgdg 223.pgdg100+1 all [upgradable from: 220.pgdg100+1]
postgresql-client/buster-pgdg 13+223.pgdg100+1 all [upgradable from: 13+220.pgdg100+1]
postgresql-common/buster-pgdg 223.pgdg100+1 all [upgradable from: 220.pgdg100+1]
postgresql/buster-pgdg 13+223.pgdg100+1 all [upgradable from: 13+220.pgdg100+1]
python3-swh.indexer.storage/unknown 0.5.0-2~swh1~bpo10+1 all [upgradable from: 0.4.2-1~swh1~bpo10+1]
python3-swh.indexer/unknown 0.5.0-2~swh1~bpo10+1 all [upgradable from: 0.4.2-1~swh1~bpo10+1]
python3-swh.journal/unknown 0.5.1-1~swh1~bpo10+1 all [upgradable from: 0.5.0-1~swh1~bpo10+1]
python3-swh.model/unknown 0.9.0-1~swh1~bpo10+1 all [upgradable from: 0.7.3-1~swh1~bpo10+1]
python3-swh.storage/unknown 0.17.2-1~swh1~bpo10+1 all [upgradable from: 0.17.0-1~swh1~bpo10+1]
  • after upgrading storage1.staging, the exact problem is also present
  • After upgrading the worker, everything goes well.
Nov 20 2020, 12:58 PM · System administration
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.
  • puppet applied on worker01
  • task by tasks tests :
    • mercurial :
swhworker@worker01:~$ SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_mercurial.yml swh loader run mercurial https://foss.heptapod.net/fluiddyn/fluidfft
INFO:swh.loader.mercurial.Bundle20Loader:Load origin 'https://foss.heptapod.net/fluiddyn/fluidfft' with type 'hg'
{'status': 'eventful'}
swhworker@worker01:~$ SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_mercurial.yml swh loader run mercurial https://hg.mozilla.org/projects/nss
INFO:swh.loader.mercurial.Bundle20Loader:Load origin 'https://hg.mozilla.org/projects/nss' with type 'hg'
WARNING:swh.loader.mercurial.Bundle20Loader:No matching revision for tag NSS_3_15_5_BETA2 (hg changeset: e5d3ec1d9a35f7cac554543d52775092de9f6a01). Skipping
WARNING:swh.loader.mercurial.Bundle20Loader:No matching revision for tag NSS_3_15_5_BETA2 (hg changeset: 0000000000000000000000000000000000000000). Skipping
WARNING:swh.loader.mercurial.Bundle20Loader:No matching revision for tag NSS_3_18_RTM (hg changeset: 0000000000000000000000000000000000000000). Skipping
WARNING:swh.loader.mercurial.Bundle20Loader:No matching revision for tag NSS_3_18_RTM (hg changeset: 0000000000000000000000000000000000000000). Skipping
WARNING:swh.loader.mercurial.Bundle20Loader:No matching revision for tag NSS_3_24_BETA3 (hg changeset: 0000000000000000000000000000000000000000). Skipping
{'status': 'eventful'}
  • svn
root@worker01:~# SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_svn.yml swh loader run svn svn://svn.appwork.org/utils
INFO:swh.loader.svn.SvnLoader:Load origin 'svn://svn.appwork.org/utils' with type 'svn'
INFO:swh.loader.svn.SvnLoader:Processing revisions [3428-3436] for {'swh-origin': 'svn://svn.appwork.org/utils', 'remote_url': 'svn://svn.appwork.org/utils', 'local_url': b'/tmp/swh.loader.svn.dojsubkd-890577/utils', 'uuid': b'21714237-3853-44ef-a1f0-ef8f03a7d1fe'}
{'status': 'eventful'}
  • npm:

ko : https://sentry.softwareheritage.org/share/issue/363ef9d218ac4817a992b7dc9bf283a6/

root@worker01:~# SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_npm.yml swh loader run npm https://www.npmjs.com/package/bootstrap-vue
WARNING:swh.storage.retry:Retry adding a batch
WARNING:swh.storage.retry:Retry adding a batch
WARNING:swh.storage.retry:Retry adding a batch
ERROR:swh.loader.package.loader:Failed loading branch releases/2.18.0 for https://www.npmjs.com/package/bootstrap-vue
Traceback (most recent call last):
  File "/usr/lib/python3/dist-packages/tenacity/__init__.py", line 333, in call
    result = fn(*args, **kwargs)
  File "/usr/lib/python3/dist-packages/swh/storage/retry.py", line 117, in raw_extrinsic_metadata_add
    return self.storage.raw_extrinsic_metadata_add(metadata)
  File "/usr/lib/python3/dist-packages/swh/core/api/__init__.py", line 181, in meth_
    return self.post(meth._endpoint_path, post_data)
  File "/usr/lib/python3/dist-packages/swh/core/api/__init__.py", line 278, in post
    return self._decode_response(response)
  File "/usr/lib/python3/dist-packages/swh/core/api/__init__.py", line 352, in _decode_response
    self.raise_for_status(response)
  File "/usr/lib/python3/dist-packages/swh/storage/api/client.py", line 29, in raise_for_status
    super().raise_for_status(response)
  File "/usr/lib/python3/dist-packages/swh/core/api/__init__.py", line 342, in raise_for_status
    raise exception from None
swh.core.api.RemoteException: <RemoteException 500 TypeError: ["__init__() got an unexpected keyword argument 'id'"]>
Nov 20 2020, 12:39 PM · System administration
vsellier created P873 error during npm loader.
Nov 20 2020, 12:33 PM
vsellier added a comment to P870 loader npm.

root@worker01:~# SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_npm.yml swh loader run npm https://www.npmjs.com/package/vue
ERROR:swh.loader.package.loader:Failed loading branch releases/0.8.6 for https://www.npmjs.com/package/vue
Traceback (most recent call last):

File "/usr/lib/python3/dist-packages/swh/loader/package/loader.py", line 424, in load
  res = self._load_revision(p_info, origin)
File "/usr/lib/python3/dist-packages/swh/loader/package/loader.py", line 539, in _load_revision
  dl_artifacts = self.download_package(p_info, tmpdir)
File "/usr/lib/python3/dist-packages/swh/loader/package/loader.py", line 277, in download_package
  return [download(p_info.url, dest=tmpdir, filename=p_info.filename)]
File "/usr/lib/python3/dist-packages/swh/loader/package/utils.py", line 80, in download
  raise ValueError("Fail to query '%s'. Reason: %s" % (url, response.status_code))

ValueError: Fail to query 'https://registry.npmjs.org/vue/-/vue-0.8.6.tgz'. Reason: 404

WARNING: swh.storage.retry:Retry adding a batch WARNING:swh.storage.retry:Retry adding a batch WARNING:swh.storage.retry:Retry adding a batch ERROR:swh.loader.package.loader:Failed loading branch releases/3.0.1 for https://www.npmjs.com/package/vue Traceback (most recent call last): File "/usr/lib/python3/dist-packages/tenacity/__init.py", line 333, in call result = fn(*args, **kwargs) File "/usr/lib/python3/dist-packages/swh/storage/retry.py", line 117, in raw_extrinsic_metadata_add return self.storage.raw_extrinsic_metadata_add(metadata) File "/usr/lib/python3/dist-packages/swh/core/api/init.py", line 181, in meth_ return self.post(meth._endpoint_path, post_data) File "/usr/lib/python3/dist-packages/swh/core/api/init.py", line 278, in post return self._decode_response(response) File "/usr/lib/python3/dist-packages/swh/core/api/init.py", line 352, in _decode_response self.raise_for_status(response) File "/usr/lib/python3/dist-packages/swh/storage/api/client.py", line 29, in raise_for_status super().raise_for_status(response) File "/usr/lib/python3/dist-packages/swh/core/api/init.py", line 342, in raise_for_status raise exception from None swh.core.api.RemoteException: <RemoteException 500 TypeError: ["init__() got an unexpected keyword argument 'id'"]>
Nov 20 2020, 12:19 PM
vsellier created P870 loader npm.
Nov 20 2020, 12:09 PM
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.

after applying the D4359 change on saam, the load is ok :

root@worker01:/etc/softwareheritage# sudo -u swhworker SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_git.yml swh loader run git https://github.com/SoftwareHeritage/puppet-swh-site
INFO:swh.loader.git.BulkLoader:Load origin 'https://github.com/SoftwareHeritage/puppet-swh-site' with type 'git'
Enumerating objects: 537, done.
Counting objects: 100% (537/537), done.
Compressing objects: 100% (326/326), done.
Total 19066 (delta 260), reused 445 (delta 194), pack-reused 18529
INFO:swh.loader.git.BulkLoader:Listed 3 refs for repo https://github.com/SoftwareHeritage/puppet-swh-site
{'status': 'eventful'}
Nov 20 2020, 10:47 AM · System administration
vsellier committed rSPSITE966b1d2eabd9: Add the mandatory cloud storage profile to bare metal storage servers (authored by vsellier).
Add the mandatory cloud storage profile to bare metal storage servers
Nov 20 2020, 10:30 AM
vsellier closed D4539: Add mandatory cloud storage configuration on bare metal storage servers.
Nov 20 2020, 10:30 AM
vsellier updated the diff for D4539: Add mandatory cloud storage configuration on bare metal storage servers.

update the commit message

Nov 20 2020, 10:28 AM
vsellier created D4539: Add mandatory cloud storage configuration on bare metal storage servers.
Nov 20 2020, 10:24 AM
vsellier added a revision to T2796: 2020-11-18 Datacenter operations in Rocquencourt: D4539: Add mandatory cloud storage configuration on bare metal storage servers.
Nov 20 2020, 10:24 AM · System administration
vsellier committed rSENV1e0ad13a1dba: Update octocatalog-diff facts (authored by vsellier).
Update octocatalog-diff facts
Nov 20 2020, 10:22 AM
vsellier committed rSENVd1aef9871d78: vagrant: Add staging-journal0 host (authored by vsellier).
vagrant: Add staging-journal0 host
Nov 20 2020, 10:22 AM
vsellier closed D4496: vagrant: Add staging-journal0 host.
Nov 20 2020, 10:22 AM
vsellier added a comment to T2796: 2020-11-18 Datacenter operations in Rocquencourt.
  • The configuration was applied on moma
  • a manual import was performed on worker01 :
    • the /etc/softwareheritage/loader_git.yaml config was updated:
root@worker01:/etc/softwareheritage# diff -U3 /tmp/loader_git.yml loader_git.yml 
--- /tmp/loader_git.yml	2020-11-20 08:43:18.682462213 +0000
+++ loader_git.yml	2020-11-20 08:44:00.150375756 +0000
@@ -13,7 +13,7 @@
   - cls: filter
   - cls: remote
     args:
-      url: http://uffizi.internal.softwareheritage.org:5002/
+      url: http://saam.internal.softwareheritage.org:5002/
 max_content_size: 104857600
 save_data: false
 save_data_path: "/srv/storage/space/data/sharded_packfiles"
  • the import was run on the puppet-swh-site repository:
root@worker01:/etc/softwareheritage# sudo -u swhworker SWH_CONFIG_FILENAME=/etc/softwareheritage/loader_git.yml swh loader run git https://github.com/SoftwareHeritage/puppet-swh-site

The first try returns this exception :

swh.core.api.RemoteException: <RemoteException 500 ValueError: ["Storage class azure-prefixed is not available: No module named 'swh.objstorage.backends.azure'"]>
Nov 20 2020, 9:48 AM · System administration
vsellier closed D4516: Move archive storage to a new server.
Nov 20 2020, 9:24 AM
vsellier committed rSPSITE2830795f6f7a: Move archive storage to a new server (authored by vsellier).
Move archive storage to a new server
Nov 20 2020, 9:24 AM
vsellier updated the diff for D4516: Move archive storage to a new server.

rebase

Nov 20 2020, 9:24 AM
vsellier updated the diff for D4516: Move archive storage to a new server.

rebase

Nov 20 2020, 9:21 AM
vsellier added a comment to D4534: Kafka needs a jre to run.
In D4534#113059, @olasd wrote:

We use the puppet java module in a bunch of other places, maybe it makes sense to directly import that (which would mean using include ::java)?

Nov 20 2020, 9:10 AM
vsellier updated the test plan for D4534: Kafka needs a jre to run.
Nov 20 2020, 9:09 AM
vsellier updated the diff for D4534: Kafka needs a jre to run.

Use ::java instead of directly install the jre package

Nov 20 2020, 9:08 AM

Nov 19 2020

vsellier accepted D4531: profile::mountpoints: only create directories if the mountpoint is enabled.
Nov 19 2020, 4:08 PM
vsellier accepted D4528: Carry over uffizi local storage/objstorage configs to saam.
Nov 19 2020, 4:05 PM
vsellier accepted D4532: Add mountpoints for saam.

LGTM

Nov 19 2020, 3:55 PM
vsellier added inline comments to D4531: profile::mountpoints: only create directories if the mountpoint is enabled.
Nov 19 2020, 3:55 PM
vsellier added a revision to T2790: [staging] deploy the journal infrastructure: D4534: Kafka needs a jre to run.
Nov 19 2020, 3:27 PM · System administration, Staging environment
vsellier created D4534: Kafka needs a jre to run.
Nov 19 2020, 3:27 PM
vsellier added a comment to D4497: Manage the parent directories of the kafka logdirs.

I will not land this now, it seems there is another issue with the startup of kafka when the logdir is already existing but empty (i.e. created by puppet). I need to dig further

Nov 19 2020, 2:36 PM
vsellier accepted D4523: Multipath setup for saam.

The systemd configuration looks good.

Nov 19 2020, 2:11 PM
vsellier added a comment to D4516: Move archive storage to a new server.

Looks fine to me except for the vagrant bit.

thanks, it's fixed

Nov 19 2020, 9:52 AM
vsellier updated the diff for D4516: Move archive storage to a new server.

fix misdirected removal

Nov 19 2020, 9:50 AM
vsellier added a comment to D4517: Run 'docker-compose up' in the background instead of detached, to show logs..

up -d is waiting the containers are "started" before returning the hand so you are sure the execs on line 169- can be executed.
You will also miss the return code of the docker-compose up command

Nov 19 2020, 9:42 AM

Nov 18 2020

vsellier created D4516: Move archive storage to a new server.
Nov 18 2020, 9:32 PM
vsellier committed rSPSITEdec921518d0b: declare the new proxmox hypervisor (authored by vsellier).
declare the new proxmox hypervisor
Nov 18 2020, 4:16 PM
vsellier committed rSPSITE61c61c95a68e: Declare new storage server (authored by vsellier).
Declare new storage server
Nov 18 2020, 2:50 PM

Nov 17 2020

vsellier added a revision to T2790: [staging] deploy the journal infrastructure: D4497: Manage the parent directories of the kafka logdirs.
Nov 17 2020, 6:20 PM · System administration, Staging environment
vsellier created D4497: Manage the parent directories of the kafka logdirs.
Nov 17 2020, 6:20 PM
vsellier added a revision to T2790: [staging] deploy the journal infrastructure: D4496: vagrant: Add staging-journal0 host.
Nov 17 2020, 5:36 PM · System administration, Staging environment
vsellier created D4496: vagrant: Add staging-journal0 host.
Nov 17 2020, 5:36 PM
vsellier added a comment to T2790: [staging] deploy the journal infrastructure.

Rectification : kafka is installed on the node but it seems the configuration is not complete

Nov 17 2020, 5:30 PM · System administration, Staging environment
vsellier changed the status of T2790: [staging] deploy the journal infrastructure, a subtask of T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage), from Open to Work in Progress.
Nov 17 2020, 3:09 PM · Staging environment, System administration
vsellier changed the status of T2790: [staging] deploy the journal infrastructure from Open to Work in Progress.
Nov 17 2020, 3:09 PM · System administration, Staging environment
vsellier added a project to T2790: [staging] deploy the journal infrastructure: System administration.
Nov 17 2020, 2:53 PM · System administration, Staging environment
vsellier added a project to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage): Staging environment.
Nov 17 2020, 2:53 PM · Staging environment, System administration