Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 21 2021

ardumont moved T2770: Fix all icinga checks on staging webapp from deployed/landed/monitoring to done on the System administration board.
Apr 21 2021, 6:57 PM · Monitoring, System administration, Staging environment

Apr 20 2021

vsellier added a comment to T3243: Replace /dev/sdb and /dev/sdc on storage1.staging.

The 2 disks were removed from the server and packaged to be sent to seagate.

Apr 20 2021, 5:32 PM · System administration, Staging environment

Apr 16 2021

vsellier moved T3243: Replace /dev/sdb and /dev/sdc on storage1.staging from Backlog to in-progress on the System administration board.
Apr 16 2021, 10:12 AM · System administration, Staging environment

Apr 15 2021

vsellier added a comment to T3243: Replace /dev/sdb and /dev/sdc on storage1.staging.

Email sent to the dsi to launch the replacement.

Apr 15 2021, 3:03 PM · System administration, Staging environment
vsellier added a comment to T3243: Replace /dev/sdb and /dev/sdc on storage1.staging.

In preparation of the disk replacement, their leds must be activated to make the emplacement identifiable:

  • Ensure all the led are off
root@storage1:~# ls /dev/sd* | grep -e "[a-z]$" | xargs -n1 -t -i{} ledctl normal={} 
ledctl normal=/dev/sda 
ledctl normal=/dev/sdb 
ledctl normal=/dev/sdc 
ledctl normal=/dev/sdd 
ledctl normal=/dev/sde 
ledctl normal=/dev/sdf 
ledctl normal=/dev/sdg 
ledctl normal=/dev/sdh 
ledctl normal=/dev/sdi 
ledctl normal=/dev/sdj 
ledctl normal=/dev/sdk 
ledctl normal=/dev/sdl 
ledctl normal=/dev/sdm 
ledctl normal=/dev/sdn
  • light on
root@storage1:~# ledctl locate=/dev/sdb
root@storage1:~# ledctl locate=/dev/sdc
Apr 15 2021, 2:31 PM · System administration, Staging environment

Apr 12 2021

vsellier changed the status of T3243: Replace /dev/sdb and /dev/sdc on storage1.staging, a subtask of T3236: staging: Disk error on storage1, from Open to Work in Progress.
Apr 12 2021, 7:31 PM · System administration, Staging environment
vsellier changed the status of T3243: Replace /dev/sdb and /dev/sdc on storage1.staging from Open to Work in Progress.

The disks are removed from the zfs pool. The replacement be done

Apr 12 2021, 7:31 PM · System administration, Staging environment
vsellier closed T3236: staging: Disk error on storage1 as Resolved.
Apr 12 2021, 7:30 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The mirror is removed fro the pool:

root@storage1:~# zpool list
NAME   SIZE  ALLOC   FREE  CKPOINT  EXPANDSZ   FRAG    CAP  DEDUP    HEALTH  ALTROOT
data  21.8T  2.50T  19.3T        -         -    20%    11%  1.00x    ONLINE  -
Apr 12 2021, 7:30 PM · System administration, Staging environment
vsellier added a comment to T3243: Replace /dev/sdb and /dev/sdc on storage1.staging.

Ticket opened on the seagate site for the replacement of these 2 disks, the information will be transferred to the DSI for the packaging (as soon the disk will be removed from the pool)

Apr 12 2021, 4:14 PM · System administration, Staging environment
vsellier triaged T3243: Replace /dev/sdb and /dev/sdc on storage1.staging as High priority.
Apr 12 2021, 2:26 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The mirror-1 removal is in progress:

root@storage1:~# zpool remove data mirror-1
Apr 12 2021, 2:19 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

There are 2 disks with errors that should now be replaced:

  • /dev/sdb/wwn-0x5000c500a23e3868 An old one
  • /dev/sdc/wwn-0x5000c500a22f48c9 the disk just removed from the pool
Apr 12 2021, 12:56 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The failing disk was removed from the pool:

root@storage1:~# zpool detach data wwn-0x5000c500a22f48c9
Apr 12 2021, 12:49 PM · System administration, Staging environment
vsellier added a comment to T3236: staging: Disk error on storage1.

The new failing drive is /dev/sdc

root@storage1:~# ls -al /dev/disk/by-id/ | grep wwn-0x5000c500a22f48c9
lrwxrwxrwx 1 root root    9 Apr 11 03:42 wwn-0x5000c500a22f48c9 -> ../../sdc
lrwxrwxrwx 1 root root   10 Mar 11 17:08 wwn-0x5000c500a22f48c9-part1 -> ../../sdc1
lrwxrwxrwx 1 root root   10 Mar 11 17:08 wwn-0x5000c500a22f48c9-part9 -> ../../sdc9
Apr 12 2021, 12:46 PM · System administration, Staging environment
vsellier changed the status of T3236: staging: Disk error on storage1 from Open to Work in Progress.
Apr 12 2021, 12:09 PM · System administration, Staging environment
vsellier triaged T3236: staging: Disk error on storage1 as High priority.
Apr 12 2021, 9:36 AM · System administration, Staging environment

Mar 22 2021

vsellier closed T3159: Deploy swh-counters:v0.1.0 in staging as Resolved.

A new vm counters0.internal.staging.swh.network is deployed and hosting redis, swh-counters and its journal-client.
The lag in staging will be recovered in a couple of hours.

Mar 22 2021, 5:34 PM · Staging environment, System administration, Monitoring
vsellier added a revision to T3159: Deploy swh-counters:v0.1.0 in staging: D5297: staging: Add counters0 vm.
Mar 22 2021, 3:40 PM · Staging environment, System administration, Monitoring
vsellier added a revision to T3159: Deploy swh-counters:v0.1.0 in staging: D5296: Add swh-counters deployment configuration.
Mar 22 2021, 8:32 AM · Staging environment, System administration, Monitoring

Mar 19 2021

vsellier moved T3159: Deploy swh-counters:v0.1.0 in staging from Backlog to in-progress on the System administration board.
Mar 19 2021, 12:39 PM · Staging environment, System administration, Monitoring
vsellier changed the status of T3159: Deploy swh-counters:v0.1.0 in staging from Open to Work in Progress.
Mar 19 2021, 12:39 PM · Staging environment, System administration, Monitoring

Feb 5 2021

vsellier added a comment to T2231: Continuous deployment.

I start to throw some ideas in this document : https://hedgedoc.softwareheritage.org/Fi2pq7zkSw6aVAJwk9Xhqw

Feb 5 2021, 5:48 PM · meta-task, Roadmap 2022, Staging environment, Roadmap 2020

Jan 29 2021

ardumont added a comment to T2920: Document staging infrastructure.

awesome, thanks.

Jan 29 2021, 12:24 PM · Documentation, System administration, Staging environment
vsellier moved T2920: Document staging infrastructure from in-progress to done on the System administration board.
Jan 29 2021, 12:21 PM · Documentation, System administration, Staging environment
vsellier closed T2920: Document staging infrastructure as Resolved.
  • Inventory updated to ensure all the components are associated to the staging environment
  • Staging page on the intranet updated [1]
  • Staging section on the network page [2] on the intranet updated
Jan 29 2021, 12:20 PM · Documentation, System administration, Staging environment

Jan 27 2021

vsellier added a comment to T2920: Document staging infrastructure.

This is a tryout to generate a global schema of the staging environment (P929):

Jan 27 2021, 6:09 PM · Documentation, System administration, Staging environment

Jan 25 2021

vsellier changed the status of T2920: Document staging infrastructure from Open to Work in Progress.
Jan 25 2021, 3:32 PM · Documentation, System administration, Staging environment

Jan 20 2021

moranegg added a project to T2920: Document staging infrastructure: Documentation.
Jan 20 2021, 10:33 AM · Documentation, System administration, Staging environment

Jan 18 2021

vsellier moved T2920: Document staging infrastructure from Backlog to Weekly backlog on the System administration board.
Jan 18 2021, 7:13 PM · Documentation, System administration, Staging environment
vsellier added a project to T2920: Document staging infrastructure: System administration.
Jan 18 2021, 7:13 PM · Documentation, System administration, Staging environment

Jan 6 2021

ardumont added a comment to T2770: Fix all icinga checks on staging webapp.

The last check no longer appears in icinga.

Jan 6 2021, 4:36 PM · Monitoring, System administration, Staging environment
ardumont closed T2770: Fix all icinga checks on staging webapp as Resolved.
Jan 6 2021, 4:36 PM · Monitoring, System administration, Staging environment
ardumont changed the status of T2770: Fix all icinga checks on staging webapp from Open to Work in Progress.
Jan 6 2021, 4:36 PM · Monitoring, System administration, Staging environment
ardumont moved T2877: Investigate spurious deposit logs from Backlog to deployed/landed/monitoring on the System administration board.
Jan 6 2021, 3:45 PM · System administration, Staging environment, SWORD deposit

Jan 4 2021

vsellier closed T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage) as Resolved.

Closing this task as all the direct work is done.
The documentation will be addressed in T2920

Jan 4 2021, 12:33 PM · Staging environment, System administration
vsellier triaged T2920: Document staging infrastructure as Normal priority.
Jan 4 2021, 12:32 PM · Documentation, System administration, Staging environment

Dec 22 2020

vsellier added a comment to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).

Everything looks good, let's try to add some documentation before closing the issue

Dec 22 2020, 9:56 AM · Staging environment, System administration
vsellier updated the task description for T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
Dec 22 2020, 9:54 AM · Staging environment, System administration

Dec 21 2020

vsellier added a comment to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
  • A new vm objstorage0.internal.staging.swh.network is configured with an read-only object storage service
  • It's exposed to internet via the reverse proxy at https://objstorage.staging.swh.network (it quite different as the usual objstorage:5003 url but it allow to expose the service without new network configuration)
  • DNS entry added on gandi
  • Inventory updated
Dec 21 2020, 7:32 PM · Staging environment, System administration
vsellier added a revision to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage): D4776: [staging] Configure and expose to internet a read-only objstorage.
Dec 21 2020, 6:01 PM · Staging environment, System administration
vsellier added a revision to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage): D4775: Add objstorage0.staging.swh.network node to expose a r/o objstorage node.
Dec 21 2020, 4:48 PM · Staging environment, System administration
vsellier updated the task description for T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
Dec 21 2020, 12:58 PM · Staging environment, System administration
vsellier added a comment to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).

A user was correctly configured and a read test performed :

Dec 21 2020, 12:57 PM · Staging environment, System administration
vsellier updated the task description for T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
Dec 21 2020, 12:38 PM · Staging environment, System administration
vsellier added a comment to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).

The network configuration is done. The server is now accessible from the internet at broker0.journal.staging.swh.network:9093

Dec 21 2020, 12:25 PM · Staging environment, System administration
vsellier updated the task description for T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
Dec 21 2020, 12:24 PM · Staging environment, System administration

Dec 18 2020

vsellier updated the task description for T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
Dec 18 2020, 4:59 PM · Staging environment, System administration
vsellier added a comment to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).

The request to expose the journal to internet was done this afternoon to the dsi.

Dec 18 2020, 4:57 PM · Staging environment, System administration

Dec 17 2020

vsellier closed T2897: [staging] kafka data dir over 80%, a subtask of T2790: [staging] deploy the journal infrastructure, as Resolved.
Dec 17 2020, 10:00 AM · System administration, Staging environment
vsellier closed T2897: [staging] kafka data dir over 80% as Resolved.
Dec 17 2020, 10:00 AM · System administration, Staging environment
vsellier added a comment to T2897: [staging] kafka data dir over 80%.

After one week, the disk used by kafka was around 85% of usage

root@journal0:/tmp# df -h /srv/kafka/logdir
Filesystem      Size  Used Avail Use% Mounted on
kafka-volume    481G  409G   73G  85% /srv/kafka/logdir

Compared to the production, the compression was not activated on the zfs pool:

root@kafka1:~#  zfs get all data/kafka  | grep compress
data/kafka  compressratio         1.55x                  -
data/kafka  compression           lz4                    inherited from data
data/kafka  refcompressratio      1.55x                  -
root@journal0:/tmp# zfs get all  | grep compress
kafka-volume  compressratio         1.00x                  -
kafka-volume  compression           off                    default
kafka-volume  refcompressratio      1.00x                  -

So the compression was activated :

root@journal0:/tmp# zfs set compression=lz4 kafka-volume
root@journal0:/tmp# zfs get all  | grep compress
kafka-volume  compressratio         1.00x                  -
kafka-volume  compression           lz4                    local
kafka-volume  refcompressratio      1.00x                  -

As this parameter is only used for the new written data, we have force a compact on the biggest topics : `directory, revision and content`

 % ./kafka-topics.sh --zookeeper $ZK  --alter --topic swh.journal.objects.revision --config min.cleanable.dirty.ratio=0.01
WARNING: Altering topic configuration from this script has been deprecated and may be removed in future releases.
         Going forward, please use kafka-configs.sh for this functionality
Updated config for topic swh.journal.objects.revision.
vsellier@journal0 /opt/kafka/bin
 % ./kafka-topics.sh --zookeeper $ZK  --alter --topic swh.journal.objects_privileged.revision --config min.cleanable.dirty.ratio=0.01
WARNING: Altering topic configuration from this script has been deprecated and may be removed in future releases.
         Going forward, please use kafka-configs.sh for this functionality
Updated config for topic swh.journal.objects_privileged.revision.
Dec 17 2020, 10:00 AM · System administration, Staging environment
vsellier changed the status of T2897: [staging] kafka data dir over 80% from Open to Work in Progress.
Dec 17 2020, 9:58 AM · System administration, Staging environment

Dec 14 2020

vsellier added a comment to T2817: Enable the swh-search environment in staging.

With the "optimized" configuration, the import is quite faster :

root@search-esnode0:~# curl -XPOST -H "Content-Type: application/json" http://${ES_SERVER}/_reindex\?pretty\&refresh=true\&requests_per_second=-1\&\&wait_for_completion=true -d @/tmp/reindex-production.json    
{
  "took" : 10215280,
  "timed_out" : false,
  "total" : 91517657,
  "updated" : 0,
  "created" : 91517657,
  "deleted" : 0,
  "batches" : 91518,
  "version_conflicts" : 0,
  "noops" : 0,
  "retries" : {
    "bulk" : 0,
    "search" : 0
  },
  "throttled_millis" : 0,
  "requests_per_second" : -1.0,
  "throttled_until_millis" : 0,
  "failures" : [ ]
}

"took" : 10215280, => 2h45

Dec 14 2020, 9:47 AM · System administrators, Staging environment, Journal, Archive search

Dec 11 2020

vsellier added a comment to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage).
  • diff landed and applied on the server
  • VIP 128.93.166.40 configured on the firewall
  • NAT Port forward of port 9093 from public ip to internal journal0 declared on the firewall
  • DNS declaration of broker0.journal.staging.swh.network in gandi
  • Ask to DSI to apply the kafka firewall profile to 128.93.166.40
  • Configure a user to test the pipeline
Dec 11 2020, 6:11 PM · Staging environment, System administration
ardumont moved T2877: Investigate spurious deposit logs from Backlog to Deployed on the SWORD deposit board.
Dec 11 2020, 3:22 PM · System administration, Staging environment, SWORD deposit
vsellier added a revision to T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage): D4726: kafka: activate the authentication on the public network.
Dec 11 2020, 3:17 PM · Staging environment, System administration
ardumont placed T2877: Investigate spurious deposit logs up for grabs.
Dec 11 2020, 3:03 PM · System administration, Staging environment, SWORD deposit
ardumont closed T2877: Investigate spurious deposit logs as Resolved.
Dec 11 2020, 3:03 PM · System administration, Staging environment, SWORD deposit
ardumont claimed T2877: Investigate spurious deposit logs.

And now spurious logs are gone for the deposit.

Dec 11 2020, 3:03 PM · System administration, Staging environment, SWORD deposit
ardumont added a comment to T2877: Investigate spurious deposit logs.

Deployed (rp0.staging, webapp0.azure, moma).

Dec 11 2020, 3:02 PM · System administration, Staging environment, SWORD deposit
vsellier added a comment to T2877: Investigate spurious deposit logs.

I agree for the default site but we have several legit requests from the monitoring not correctly routed so the configuration needs to be adapted.

Dec 11 2020, 11:46 AM · System administration, Staging environment, SWORD deposit
ardumont updated the task description for T2877: Investigate spurious deposit logs.
Dec 11 2020, 11:45 AM · System administration, Staging environment, SWORD deposit
vsellier added a revision to T2877: Investigate spurious deposit logs: D4719: varnish: Correctly handle the vhost when the port number is included.
Dec 11 2020, 11:42 AM · System administration, Staging environment, SWORD deposit
vlorentz added a comment to T2877: Investigate spurious deposit logs.

You could just add a 00-default vhost that shows a generic error message. (that's not even a hack to rely on alphabetical order for vhost configs)

Dec 11 2020, 11:35 AM · System administration, Staging environment, SWORD deposit
ardumont triaged T2877: Investigate spurious deposit logs as Normal priority.
Dec 11 2020, 11:17 AM · System administration, Staging environment, SWORD deposit
vsellier added a comment to T2817: Enable the swh-search environment in staging.

The production index origin was correctly copied from the production cluster but it seems without the configuration to optimize the copy.
We keep this one and try a new optimized copy to check if the server still crash in an OOM with the new cpu and memory settings.

Dec 11 2020, 10:15 AM · System administrators, Staging environment, Journal, Archive search

Dec 10 2020

vsellier changed the status of T2682: Deploy a small publicly available kafka server (with some content) on a staging (+ the related objstorage) from Open to Work in Progress.
Dec 10 2020, 5:41 PM · Staging environment, System administration
vsellier added a comment to T2817: Enable the swh-search environment in staging.

FI: The origin index was recreated with the "official" mapping and a backfill was performed (necessary after the test of the flattened mapping)

Dec 10 2020, 3:42 PM · System administrators, Staging environment, Journal, Archive search
vsellier closed T2817: Enable the swh-search environment in staging as Resolved.

The deployment manifest are ok and deployed in staging so this task can be resolved.
We will work on reactivating search-journal-client for the metadata in another task when T2876 is resolved

Dec 10 2020, 3:29 PM · System administrators, Staging environment, Journal, Archive search
vsellier updated the task description for T2817: Enable the swh-search environment in staging.
Dec 10 2020, 3:19 PM · System administrators, Staging environment, Journal, Archive search
ardumont updated the task description for T2817: Enable the swh-search environment in staging.
Dec 10 2020, 1:21 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4712: staging: Increase elasticsearch jvm heap size to half its memory.
Dec 10 2020, 11:47 AM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2817: Enable the swh-search environment in staging.

The copy of the production index is restarted.
To improve the speed of the copy, the index was tuned to reduce the disk pressure (it's a temporary configuration and should not be used in a normal case as it's not safe) :

cat >/tmp/config.json <<EOF
{
  "index" : {
    "translog.sync_interval" : "60s",
	"translog.durability": "async",
	"refresh_interval": "60s"
  }
}
EOF
Dec 10 2020, 11:14 AM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2817: Enable the swh-search environment in staging.
  • Parition and memory extended with terraform.
  • The disk resize needed some console actions to be extended :
Dec 10 2020, 10:39 AM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2817: Enable the swh-search environment in staging.

The production index import failed because the limit of 90% of used disk spaces was reached at some time to fall back to around 60G after a compaction
The progression was 80M documents of 91M.

Dec 10 2020, 9:59 AM · System administrators, Staging environment, Journal, Archive search

Dec 9 2020

ardumont added a revision to T2817: Enable the swh-search environment in staging: D4710: search.journal_client: Fix key error.
Dec 9 2020, 10:26 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4709: indexer_storage: Publish indexer computation to journal topics.
Dec 9 2020, 10:09 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4704: docker-compose.search.yml: Add journal client for indexed values.
Dec 9 2020, 6:19 PM · System administrators, Staging environment, Journal, Archive search
vsellier added a revision to T2817: Enable the swh-search environment in staging: D4701: Allow configuration through cli or config file.
Dec 9 2020, 5:57 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4699: search: Deploy multiple search journal client instances.
Dec 9 2020, 5:20 PM · System administrators, Staging environment, Journal, Archive search
ardumont updated the task description for T2817: Enable the swh-search environment in staging.
Dec 9 2020, 11:39 AM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2817: Enable the swh-search environment in staging.

The search rpc backend and the journal client listening on origin and origin_visit topics are deployed.
The inventory is up to date for both hosts [1][2]

Dec 9 2020, 9:51 AM · System administrators, Staging environment, Journal, Archive search
vsellier updated the task description for T2817: Enable the swh-search environment in staging.
Dec 9 2020, 9:35 AM · System administrators, Staging environment, Journal, Archive search

Dec 8 2020

ardumont added a revision to T2817: Enable the swh-search environment in staging: D4687: search: Add initialization step on install or upgrade.
Dec 8 2020, 4:06 PM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2817: Enable the swh-search environment in staging.

A dashboard to monitor the ES cluster behavior has been created on grafana [1]
It will be improved during the swh-search tests

Dec 8 2020, 10:49 AM · System administrators, Staging environment, Journal, Archive search

Dec 7 2020

vsellier added a comment to T2817: Enable the swh-search environment in staging.

Interesting note about how to size the shards of an index : https://www.elastic.co/guide/en/elasticsearch/reference/7.x//size-your-shards.html

Dec 7 2020, 6:15 PM · System administrators, Staging environment, Journal, Archive search

Dec 4 2020

ardumont added a revision to T2817: Enable the swh-search environment in staging: D4668: Add swh-search-journal-client to swh_search_with_journal_client role.
Dec 4 2020, 7:27 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4666: staging: Deploy swh-search rpc backend on search0.
Dec 4 2020, 4:54 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a comment to T2817: Enable the swh-search environment in staging.

We added a volume of 100Gib to the search-esnode0 through terraform (D4663).
So we could mount the /srv/elasticsearch as zfs volume.

Dec 4 2020, 12:44 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4664: search0: Add swh-search rpc backend node.
Dec 4 2020, 12:11 PM · System administrators, Staging environment, Journal, Archive search
ardumont added a revision to T2817: Enable the swh-search environment in staging: D4663: search-esnode0: Add a 100Gib storage disk.
Dec 4 2020, 12:04 PM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2817: Enable the swh-search environment in staging.

dedicated ES node for staging deployed (search-esnode0.internal.staging.swh.network) with D4658 and D4651

Dec 4 2020, 11:46 AM · System administrators, Staging environment, Journal, Archive search
vsellier updated the task description for T2817: Enable the swh-search environment in staging.
Dec 4 2020, 11:44 AM · System administrators, Staging environment, Journal, Archive search

Dec 3 2020

ardumont added a revision to T2817: Enable the swh-search environment in staging: D4658: staging: Add search-esnode0.
Dec 3 2020, 5:59 PM · System administrators, Staging environment, Journal, Archive search
vsellier added a revision to T2817: Enable the swh-search environment in staging: D4654: -wip- Switch to the official elasticsearch plugin.
Dec 3 2020, 12:21 PM · System administrators, Staging environment, Journal, Archive search

Dec 2 2020

ardumont added a revision to T2817: Enable the swh-search environment in staging: D4651: Puppetize elasticsearch nodes.
Dec 2 2020, 4:53 PM · System administrators, Staging environment, Journal, Archive search
vsellier added a comment to T2761: Install webapp counters in the staging webapp/storage.

After T2828, It's more clear of what must be deployed to have the counters working on staging:

  • the counters can be intialized via the /stat/refresh endpoint of the storage api (Note: It will create more counters than production as directory_entry_* and revision_history are not counted in production)
  • Add a script/service to execute the `swh_update_counter_bucketed` in an infinite loop
  • Create the buckets in the object_counts_bucketed
    • per object type : identifier|bucket_start|bucket_end. value and last_update will be updated be the stored procedures.
  • configure prometheus sql exporter for db1.staging [1]
  • configure profile_exporter on pergamon
    • Update the script to ensure the data are filtered by environments (to avoid staging data to be included in production counts [2])
    • Configure a new cron
      • loading an empty file for historical data
      • creating a new export_file
  • update webapp to be able to configure the counter origin
Dec 2 2020, 9:55 AM · Storage manager, Web app, Staging environment
ardumont raised the priority of T2761: Install webapp counters in the staging webapp/storage from Low to Normal.
Dec 2 2020, 9:41 AM · Storage manager, Web app, Staging environment
ardumont updated the task description for T2761: Install webapp counters in the staging webapp/storage.
Dec 2 2020, 9:40 AM · Storage manager, Web app, Staging environment