Page MenuHomeSoftware Heritage

vsellier (Vincent Sellier)
UserAdministrator

User Details

User Since
Mar 21 2019, 4:14 PM (185 w, 11 h)
Roles
Administrator

Recent Activity

Today

vsellier committed rSKCONF5058d885aed1: gitlab-staging: upgrade operator to 0.12.4 (authored by vsellier).
gitlab-staging: upgrade operator to 0.12.4
Fri, Oct 7, 12:14 AM
vsellier updated the task description for T4610: upgrade staging instance to 15.4.
Fri, Oct 7, 12:13 AM · meta-task, Roadmap 2020

Yesterday

vsellier committed rSKCONFabefae9c9438: gitlab-staging: fix secrets cluster ip (authored by vsellier).
gitlab-staging: fix secrets cluster ip
Thu, Oct 6, 11:33 PM
vsellier committed rSKCONF1e5488d5005a: gitlab-staging: upgrade operator to 0.12.0 (authored by vsellier).
gitlab-staging: upgrade operator to 0.12.0
Thu, Oct 6, 9:35 PM
vsellier closed D8635: gitlab-staging: Add the configuration to install the gitlab operator.
Thu, Oct 6, 9:32 PM
vsellier committed rSKCONFfef45a541365: gitlab-staging: Add the configuration to install the gitlab operator (authored by vsellier).
gitlab-staging: Add the configuration to install the gitlab operator
Thu, Oct 6, 9:32 PM
vsellier updated the task description for T4610: upgrade staging instance to 15.4.
Thu, Oct 6, 9:31 PM · meta-task, Roadmap 2020
vsellier updated the task description for T4610: upgrade staging instance to 15.4.
Thu, Oct 6, 9:30 PM · meta-task, Roadmap 2020
vsellier changed the status of T4610: upgrade staging instance to 15.4 from Open to Work in Progress.
Thu, Oct 6, 9:17 PM · meta-task, Roadmap 2020
vsellier updated the diff for D8635: gitlab-staging: Add the configuration to install the gitlab operator.
  • fix the version number
  • remove the debug command line
Thu, Oct 6, 8:08 PM
vsellier added inline comments to D8635: gitlab-staging: Add the configuration to install the gitlab operator.
Thu, Oct 6, 8:05 PM
vsellier added inline comments to D8635: gitlab-staging: Add the configuration to install the gitlab operator.
Thu, Oct 6, 1:32 PM
vsellier requested review of D8635: gitlab-staging: Add the configuration to install the gitlab operator.
Thu, Oct 6, 11:03 AM
vsellier added a revision to T4063: Deploy gitlab instance for production: D8635: gitlab-staging: Add the configuration to install the gitlab operator.
Thu, Oct 6, 11:03 AM · System administration, GitLab migration

Wed, Oct 5

vsellier added a comment to T4358: Upgrade AKS versions.

Looks like the gitlab operator is now compatible with kubernetes 1.22.
https://docs.gitlab.com/operator/installation.html#kubernetes

Wed, Oct 5, 2:24 PM · System administration
vsellier planned changes to D8617: thanos: Declare archive-production thanos for live data querying.

unfortunately it can't work without T4604 as the store is configured to use the letencrypt certificate:

Wed, Oct 5, 10:52 AM
vsellier triaged T4604: [dynamic infra] Manage SSL certificates as Normal priority.
Wed, Oct 5, 10:11 AM · System administration
vsellier renamed T4063: Deploy gitlab instance for production from Deploy gitlab instance to Deploy gitlab instance for production.
Wed, Oct 5, 9:57 AM · System administration, GitLab migration

Tue, Oct 4

vsellier requested review of D8617: thanos: Declare archive-production thanos for live data querying.
Tue, Oct 4, 6:45 PM
vsellier added a revision to T4385: Federate prometheus instances through thanos: D8617: thanos: Declare archive-production thanos for live data querying.
Tue, Oct 4, 6:45 PM · meta-task, System administration, Roadmap 2022
vsellier closed D8599: k8s-archive-production: Add an internal ingress to expose reaper webui.
Tue, Oct 4, 6:42 PM
vsellier committed rSPSITE9065bc6afddd: k8s-archive-production: Add an internal ingress to expose reaper webui (authored by vsellier).
k8s-archive-production: Add an internal ingress to expose reaper webui
Tue, Oct 4, 6:42 PM
vsellier updated the diff for D8599: k8s-archive-production: Add an internal ingress to expose reaper webui.

rebase

Tue, Oct 4, 6:42 PM
vsellier added a comment to T4251: [swh-search] Investigate long search queries response time.

@KShivendu Any news regarding these profiling ?

Tue, Oct 4, 6:22 PM · System administration, Archive search
vsellier triaged T4603: move graphql to a sub url instead of the standalone vhost as Normal priority.
Tue, Oct 4, 6:20 PM · System administration, GraphQL API
vsellier closed T4132: Add the graphql service in the docker environment, a subtask of T4131: Graphql service in staging, as Resolved.
Tue, Oct 4, 6:16 PM · System administration, GraphQL API
vsellier closed T4132: Add the graphql service in the docker environment as Resolved.
Tue, Oct 4, 6:16 PM · System administration, GraphQL API
vsellier updated the task description for T4132: Add the graphql service in the docker environment.
Tue, Oct 4, 6:16 PM · System administration, GraphQL API
vsellier closed T4497: [sentry] Out of disk space as Resolved.

Closing as there is no alerts since almost one month

Tue, Oct 4, 6:15 PM · Sentry, System administration
vsellier committed rSKCONFee77cc15f039: argocd: ignore argocd-cm and argocd-rabc-cm changes (authored by vsellier).
argocd: ignore argocd-cm and argocd-rabc-cm changes
Tue, Oct 4, 4:17 PM
vsellier closed T4534: Evaluate MetalLB as inbound loadbalancer, a subtask of T4523: Dynamic infrastructure, as Resolved.
Tue, Oct 4, 3:03 PM · meta-task, System administration
vsellier closed T4534: Evaluate MetalLB as inbound loadbalancer as Resolved.

regarding the last tests, we can start using it to battle proof its usage.
I found in several documentations where it's the tool recommended to manage load balancing on on-premise kubernetes deployments, for example: https://kubernetes.github.io/ingress-nginx/deploy/baremetal/#a-pure-software-solution-metallb

Tue, Oct 4, 3:03 PM · System administration
vsellier added a comment to T4385: Federate prometheus instances through thanos.

thanos expose on production cluster in this commit: rSPRE8fade05553ed4a01e54e1b8481150c0e055e3f34

Tue, Oct 4, 2:32 PM · meta-task, System administration, Roadmap 2022
vsellier committed rSPRE8fade05553ed: Export the grpc port of thanos through an ingress configuration (authored by vsellier).
Export the grpc port of thanos through an ingress configuration
Tue, Oct 4, 2:20 PM
vsellier committed R260:e79a98efcf72: cassandra-replay: increase revision replayer count (authored by vsellier).
cassandra-replay: increase revision replayer count
Tue, Oct 4, 9:32 AM

Mon, Oct 3

vsellier created P1468 (An Untitled Masterwork).
Mon, Oct 3, 4:03 PM
vsellier committed rSPREfa50a5ff1cb2: k8s-archive-production: active thanos sidecard and services (authored by vsellier).
k8s-archive-production: active thanos sidecard and services
Mon, Oct 3, 3:43 PM
vsellier committed R260:b4067e7cf048: cassandra-replay: reduce load on production cluster during metallb tests (authored by vsellier).
cassandra-replay: reduce load on production cluster during metallb tests
Mon, Oct 3, 3:05 PM
vsellier requested review of D8599: k8s-archive-production: Add an internal ingress to expose reaper webui.
Mon, Oct 3, 1:55 PM
vsellier added a revision to T4458: Test reaper to automate the cassandra repair actions: D8599: k8s-archive-production: Add an internal ingress to expose reaper webui.
Mon, Oct 3, 1:55 PM · System administration
vsellier committed rSKCONF110115991c54: reaper: Add an ingress to internally expose the webui (authored by vsellier).
reaper: Add an ingress to internally expose the webui
Mon, Oct 3, 1:49 PM

Sat, Oct 1

vsellier committed R260:de634df57950: cassandra-replay: redispatch replayers after snapshot catchup (authored by vsellier).
cassandra-replay: redispatch replayers after snapshot catchup
Sat, Oct 1, 12:06 PM

Fri, Sep 30

vsellier committed rSKCONF39fd005dd4af: archive-production: configure metallb to allow several service on same ip (authored by vsellier).
archive-production: configure metallb to allow several service on same ip
Fri, Sep 30, 8:01 PM
vsellier committed rSPRE3147328ddfa6: k8s-archive-production: configure thanos sidecar to push on azure (authored by vsellier).
k8s-archive-production: configure thanos sidecar to push on azure
Fri, Sep 30, 5:55 PM
vsellier closed T4461: Move argocd to a private admin url as Resolved.
Fri, Sep 30, 5:27 PM · System administration
vsellier committed rSPSITE4b2879647a52: argocd: fix the missing dot in the dns declaration (authored by vsellier).
argocd: fix the missing dot in the dns declaration
Fri, Sep 30, 5:22 PM
vsellier updated the task description for T4385: Federate prometheus instances through thanos.
Fri, Sep 30, 3:50 PM · meta-task, System administration, Roadmap 2022
vsellier closed D8577: Disable ping on hosts/ips managed by metallb.
Fri, Sep 30, 9:28 AM
vsellier committed rSPSITEbee3d6dc026c: Disable ping on hosts/ips managed by metallb (authored by vsellier).
Disable ping on hosts/ips managed by metallb
Fri, Sep 30, 9:28 AM

Thu, Sep 29

vsellier requested review of D8577: Disable ping on hosts/ips managed by metallb.
Thu, Sep 29, 10:42 AM
vsellier added a revision to T4534: Evaluate MetalLB as inbound loadbalancer: D8577: Disable ping on hosts/ips managed by metallb.
Thu, Sep 29, 10:42 AM · System administration

Wed, Sep 28

vsellier committed rSKCONF01abd45d6427: argocd: Force the redirect to https for the internal ingress (authored by vsellier).
argocd: Force the redirect to https for the internal ingress
Wed, Sep 28, 7:23 PM
vsellier committed rSKCONF246fb2ffa395: argocd: Delete public ingress (authored by vsellier).
argocd: Delete public ingress
Wed, Sep 28, 7:23 PM
vsellier closed D8567: argocd: Remove public site.
Wed, Sep 28, 7:08 PM
vsellier committed rSPSITEdd8df9301d75: argocd: Remove public site (authored by vsellier).
argocd: Remove public site
Wed, Sep 28, 7:08 PM
vsellier updated the test plan for D8567: argocd: Remove public site.
Wed, Sep 28, 5:48 PM
vsellier updated the diff for D8567: argocd: Remove public site.
  • Add monitoring of the internal service
Wed, Sep 28, 5:47 PM
vsellier requested review of D8567: argocd: Remove public site.
Wed, Sep 28, 5:29 PM
vsellier added a revision to T4461: Move argocd to a private admin url: D8567: argocd: Remove public site.
Wed, Sep 28, 5:29 PM · System administration
vsellier added a comment to T4461: Move argocd to a private admin url.

Argo is now reachable on the internal network at https://argocd.internal.admin.swh.network/ but it uses an self-signed certificated

Wed, Sep 28, 12:28 PM · System administration
vsellier committed rSKCONF3bf578f9b891: Use the official kubernetes nginx ingress controller (authored by vsellier).
Use the official kubernetes nginx ingress controller
Wed, Sep 28, 12:01 PM
vsellier committed rSKCONF4b7b571bba07: argocd: don't try to override the default nginx ingress service (authored by vsellier).
argocd: don't try to override the default nginx ingress service
Wed, Sep 28, 10:51 AM
vsellier closed D8559: argocd: Prepare the configuration to migrate to the internal admin network.
Wed, Sep 28, 10:35 AM
vsellier committed rSPSITE73ae99dafa58: argocd: Prepare the configuration to migrate to the internal admin network (authored by vsellier).
argocd: Prepare the configuration to migrate to the internal admin network
Wed, Sep 28, 10:35 AM
vsellier updated the diff for D8559: argocd: Prepare the configuration to migrate to the internal admin network.

rebase

Wed, Sep 28, 10:29 AM
vsellier committed rSPRE6b8883e1f458: argocd: disable default rancher ingress controller (authored by vsellier).
argocd: disable default rancher ingress controller
Wed, Sep 28, 10:27 AM
vsellier requested review of D8559: argocd: Prepare the configuration to migrate to the internal admin network.
Wed, Sep 28, 10:16 AM
vsellier added a revision to T4461: Move argocd to a private admin url: D8559: argocd: Prepare the configuration to migrate to the internal admin network.
Wed, Sep 28, 10:16 AM · System administration
vsellier added a comment to T4461: Move argocd to a private admin url.

hum I completely forgot we'll have to think how to manage the ssl termination.
As is, there is not cert manager deployed on the cluster and I'm not sure we want to do it.

Wed, Sep 28, 10:09 AM · System administration
vsellier committed rSKCONF311693349c85: argocd: migrate to an internal domain (authored by vsellier).
argocd: migrate to an internal domain
Wed, Sep 28, 9:56 AM
vsellier changed the status of T4461: Move argocd to a private admin url from Open to Work in Progress.

As metallb is configured, the internal domain can used the VIP

Wed, Sep 28, 9:56 AM · System administration
vsellier added a comment to D8551: first add forge now requests added to changelog.

Sure there is (and it must have) nothing secret logged in the task.

Wed, Sep 28, 9:29 AM

Tue, Sep 27

vsellier committed rSKCONF8862c36c5e25: metallb: ignore fields updated during runtime (authored by vsellier).
metallb: ignore fields updated during runtime
Tue, Sep 27, 11:40 PM
vsellier added a comment to T4534: Evaluate MetalLB as inbound loadbalancer.

A new test with a node completely down, it seems it recover after

Tue Sep 27 16:31:59 UTC 2022
60 bytes from 2e:81:20:19:02:4a (192.168.100.119): index=2926 time=1.679 msec
Tue Sep 27 16:32:01 UTC 2022
Timeout
Timeout
Tue Sep 27 16:32:03 UTC 2022
...
Tue Sep 27 16:37:56 UTC 2022
Timeout
Timeout
Tue Sep 27 16:37:58 UTC 2022
60 bytes from 2e:84:a0:44:9e:c9 (192.168.100.119): index=2927 time=814.409 msec
60 bytes from 2e:84:a0:44:9e:c9 (192.168.100.119): index=2928 time=864.574 usec
60 bytes from 2e:84:a0:44:9e:c9 (192.168.100.119): index=2929 time=973.083 msec
60 bytes from 2e:84:a0:44:9e:c9 (192.168.100.119): index=2930 time=32.151 msec
Tue, Sep 27, 6:42 PM · System administration
vsellier added a comment to T4534: Evaluate MetalLB as inbound loadbalancer.

If a node is drained out of the cluster, the rebalancing occurs in ~10s which it's what it's announced in the documentation

Tue Sep 27 16:17:21 UTC 2022
60 bytes from 2e:84:a0:44:9e:c9 (192.168.100.119): index=1985 time=1.710 msec
60 bytes from 2e:84:a0:44:9e:c9 (192.168.100.119): index=1986 time=1.376 msec
Tue Sep 27 16:17:23 UTC 2022
Timeout
Tue Sep 27 16:17:25 UTC 2022
Timeout
Timeout
Tue Sep 27 16:17:27 UTC 2022
Timeout
Timeout
Tue Sep 27 16:17:29 UTC 2022
Timeout
Timeout
Tue Sep 27 16:17:31 UTC 2022
Timeout
Timeout
Tue Sep 27 16:17:33 UTC 2022
Timeout
Timeout
60 bytes from 2e:81:20:19:02:4a (192.168.100.119): index=1987 time=669.150 msec
Tue Sep 27 16:17:35 UTC 2022
60 bytes from 2e:81:20:19:02:4a (192.168.100.119): index=1988 time=959.452 usec
Tue, Sep 27, 6:19 PM · System administration
vsellier added a comment to T4534: Evaluate MetalLB as inbound loadbalancer.

With the ingress controller correctly configured and an ingress declared, everything seems to work correctly:

vsellier@pergamon ~ % cat test-ingress.txt
GET /graphql/ HTTP/1.0
Host: archive.softwareheritage.org
Tue, Sep 27, 6:04 PM · System administration
vsellier committed rSKCONF27bee789d015: production: Ensure the ingress class created by nginx is the default (authored by vsellier).
production: Ensure the ingress class created by nginx is the default
Tue, Sep 27, 5:31 PM
vsellier committed R260:581d22afb81d: production: enabled graphql ingress (authored by vsellier).
production: enabled graphql ingress
Tue, Sep 27, 4:40 PM
vsellier committed R260:f4cb0216d314: production: Add missing configuration entry to deploy graphql (authored by vsellier).
production: Add missing configuration entry to deploy graphql
Tue, Sep 27, 4:37 PM
vsellier committed R260:1f07eaf3cece: production add mandatory values for graphql configuration (authored by vsellier).
production add mandatory values for graphql configuration
Tue, Sep 27, 4:31 PM
vsellier committed R260:5f413a4e1350: production: test ingress / metallb by deploying graphql (authored by vsellier).
production: test ingress / metallb by deploying graphql
Tue, Sep 27, 4:23 PM
vsellier committed rSKCONF9d4c4220af86: production: fix nginx ingress chart version (authored by vsellier).
production: fix nginx ingress chart version
Tue, Sep 27, 4:11 PM
vsellier committed rSKCONF6c40e8fe60e3: production: fix nginx ingress chart version (authored by vsellier).
production: fix nginx ingress chart version
Tue, Sep 27, 3:59 PM
vsellier committed rSKCONF28387e6c1fd9: production-cluster: Deploy the nginx ingress controller (authored by vsellier).
production-cluster: Deploy the nginx ingress controller
Tue, Sep 27, 3:51 PM
vsellier committed rSPREacbc844f9b54: Don't use the default rancher ingress manager (authored by vsellier).
Don't use the default rancher ingress manager
Tue, Sep 27, 3:46 PM
vsellier committed rSPRE833b5359f64f: temporary increase prometheus retention (authored by vsellier).
temporary increase prometheus retention
Tue, Sep 27, 3:22 PM
vsellier committed rSPREea34a207f927: Update storage configuration to match the reality (authored by vsellier).
Update storage configuration to match the reality
Tue, Sep 27, 3:22 PM
vsellier committed R260:3a2dd951b885: Upgrade storage replayers to last available version (authored by vsellier).
Upgrade storage replayers to last available version
Tue, Sep 27, 11:48 AM
vsellier committed R259:ce042e30d7eb: Upgrade storage replayers to use the last swh-model optimisations (authored by vsellier).
Upgrade storage replayers to use the last swh-model optimisations
Tue, Sep 27, 11:41 AM
vsellier committed rSKCONF6ccb84be453b: Declare metallb on production cluster (authored by vsellier).
Declare metallb on production cluster
Tue, Sep 27, 10:43 AM
vsellier accepted D8534: rancher-node-production-worker*: Update nameserver entries with ns0.

Thanks!

Tue, Sep 27, 8:37 AM

Mon, Sep 26

vsellier closed T4541: Deploy git loaders dedicated to load add forge now requests, a subtask of T4523: Dynamic infrastructure, as Resolved.
Mon, Sep 26, 1:54 PM · meta-task, System administration
vsellier closed T4541: Deploy git loaders dedicated to load add forge now requests as Resolved.
Mon, Sep 26, 1:54 PM · Add Forge Now , System administration
vsellier added a comment to D8532: Set rancher-node-production worker to high load profile.

Sure, they should also declare ns0 for more redundancy.

Mon, Sep 26, 12:26 PM

Fri, Sep 23

vsellier committed rSKCONF25a3f0a22c4c: metallb: deploy metallb on the argocd cluster (authored by vsellier).
metallb: deploy metallb on the argocd cluster
Fri, Sep 23, 7:38 PM
vsellier changed the status of T4534: Evaluate MetalLB as inbound loadbalancer, a subtask of T4523: Dynamic infrastructure, from Open to Work in Progress.
Fri, Sep 23, 7:38 PM · meta-task, System administration
vsellier changed the status of T4534: Evaluate MetalLB as inbound loadbalancer from Open to Work in Progress.
Fri, Sep 23, 7:38 PM · System administration
vsellier added a reverting change for rSKCONF0c2373f0112d: Revert "change the argocd ingress name to something more explicit": rSKCONF8797d2440cdd: Revert "Revert "change the argocd ingress name to something more explicit"".
Fri, Sep 23, 6:36 PM
vsellier committed rSKCONF8797d2440cdd: Revert "Revert "change the argocd ingress name to something more explicit"" (authored by vsellier).
Revert "Revert "change the argocd ingress name to something more explicit""
Fri, Sep 23, 6:36 PM
vsellier committed rSKCONF36a731d61b10: Revert "argocd: upgrade to v2.4.12" aka wrong file committed (authored by vsellier).
Revert "argocd: upgrade to v2.4.12" aka wrong file committed
Fri, Sep 23, 6:36 PM