Page MenuHomeSoftware Heritage

System administrationFolder
ActivePublic

Milestones

Members

  • This project does not have any members.
  • View All

Watchers

  • This project does not have any watchers.
  • View All

Details

Description

general system administration tasks, not specific to any product

Recent Activity

Today

vsellier closed T3357: Perform some tests of the cassandra storage on Grid5000 as Resolved.

The slide of the restrospective of the experiment are available at : https://hedgedoc.softwareheritage.org/VOP9qh1MTqm4DjPQfFgNbQ

Thu, Dec 2, 10:10 AM · System administration, Storage manager
vsellier closed T3573: [cassandra] directory and content read benchmarks, a subtask of T3357: Perform some tests of the cassandra storage on Grid5000, as Resolved.
Thu, Dec 2, 10:08 AM · System administration, Storage manager
vsellier closed T3573: [cassandra] directory and content read benchmarks as Resolved.

It was not easy to know if it's a lot of call or long running calls because it's regular sample and we don't have this granularity.

Thu, Dec 2, 10:08 AM · System administration, Storage manager

Yesterday

olasd added a comment to T3032: Fix and/or deploy production listers paper cuts.

Two possible solutions here:

  • use stretch/updates, buster/updates, ... as suite names
  • add specific URL template processing in debian lister implementation

I would go for the second one.

Wed, Dec 1, 5:46 PM · System administration, Lister
zack moved T2096: CNAME for graph service: graph.internal.softwareheritage.org (?) from Backlog to Deployed on the Graph service board.
Wed, Dec 1, 5:00 PM · Graph service, System administration
zack moved T2589: expose swh-graph API at archive.s.o/api/1/graph/ from Backlog to Deployed on the Graph service board.
Wed, Dec 1, 4:36 PM · System administration, Web app, Graph service
zack moved T2900: Public graph/ API does not handle streaming results from endpoints from Backlog to Deployed on the Graph service board.
Wed, Dec 1, 4:36 PM · System administration, Graph service, Web app
zack moved T3624: Update swh-graph from 0.3.0 to 0.5.0 on granet from Backlog to Deployed on the Graph service board.
Wed, Dec 1, 4:34 PM · Graph service, System administration
zack moved T3564: Puppetize graph service and add icinga alert from Backlog to Deployed on the Graph service board.
Wed, Dec 1, 4:34 PM · System administration, Graph service, Puppet recipes

Thu, Nov 25

vsellier added a comment to T3750: Upgrade ELK stack to bullseye.

The upgrade by itself was made with the same command as explained previously.
The shard allocation was disabled during the process to avoid unnecessary movements of shard in the cluster

Thu, Nov 25, 7:19 PM · System administration (Component upgrades)
vsellier added a comment to T3750: Upgrade ELK stack to bullseye.
  1. kibana0
  2. The server was updated using the same procedure used for logstash0
  3. there is no error detected when puppet is running
  4. all the services are correctly started
Thu, Nov 25, 3:52 PM · System administration (Component upgrades)
vsellier added a comment to T3750: Upgrade ELK stack to bullseye.
  1. logstash0 upgrade
Thu, Nov 25, 2:50 PM · System administration (Component upgrades)
vsellier added a revision to T3750: Upgrade ELK stack to bullseye: D6689: logstash: remove unecessary dependency to the openjdk package.
Thu, Nov 25, 12:37 PM · System administration (Component upgrades)
vsellier changed the status of T3750: Upgrade ELK stack to bullseye from Open to Work in Progress.
Thu, Nov 25, 9:53 AM · System administration (Component upgrades)

Wed, Nov 24

vsellier closed T3741: swh-search - upgrade elasticsearch backend as Resolved.
Wed, Nov 24, 6:11 PM · System administration, Archive search
vsellier added a comment to T3741: swh-search - upgrade elasticsearch backend.

production nodes are upgraded :
For each node :

  • disable shard allocation:
cat > /tmp/shard_allocation.json <<EOF
{
  "persistent": {
    "cluster.routing.allocation.enable": "primaries"
  }
}
EOF
Wed, Nov 24, 5:26 PM · System administration, Archive search
vsellier added a revision to T3741: swh-search - upgrade elasticsearch backend: D6685: swh-search: upgrade elasticsearch to 7.15.2.
Wed, Nov 24, 4:24 PM · System administration, Archive search
vsellier added a revision to T3741: swh-search - upgrade elasticsearch backend: D6682: swh-search: Upgrade elasticsearch to 7.15.2.
Wed, Nov 24, 11:53 AM · System administration, Archive search
vsellier added a comment to T3741: swh-search - upgrade elasticsearch backend.

The staging elasticsearch is migrated to 7.15.2, everything looks good.

Wed, Nov 24, 10:39 AM · System administration, Archive search

Tue, Nov 23

vsellier added a revision to T3741: swh-search - upgrade elasticsearch backend: D6677: staging: upgrade swh-search elasticsearch to 7.15.2.
Tue, Nov 23, 7:53 PM · System administration, Archive search

Mon, Nov 22

ardumont renamed T3746: staging: Deploy maven indexer/lister/loader from staging: Deploy maven exporter/lister/loader to staging: Deploy maven indexer/lister/loader.
Mon, Nov 22, 2:38 PM · System administration, Archive coverage
ardumont closed T3745: production: Deploy package loader v1.1, deposit server v0.16, lister v2.3 as Resolved.
Mon, Nov 22, 2:29 PM · System administration, Package Loader, Data Model, Archive content
ardumont moved T3745: production: Deploy package loader v1.1, deposit server v0.16, lister v2.3 from in-progress to deployed/landed on the System administration board.
Mon, Nov 22, 2:05 PM · System administration, Package Loader, Data Model, Archive content
ardumont changed the status of T3745: production: Deploy package loader v1.1, deposit server v0.16, lister v2.3 from Open to Work in Progress.
Mon, Nov 22, 2:05 PM · System administration, Package Loader, Data Model, Archive content
ardumont added a project to T3745: production: Deploy package loader v1.1, deposit server v0.16, lister v2.3: System administration.
Mon, Nov 22, 2:04 PM · System administration, Package Loader, Data Model, Archive content
olasd closed T1844: make archive.s.o point to the Azure-hosted webapp as Wontfix.

We haven't had the capacity of making the azure infrastructure perform as well as the in-house one, to the point where the web frontend wasn't really useable.

Mon, Nov 22, 1:31 PM · System administration
olasd closed T1844: make archive.s.o point to the Azure-hosted webapp, a subtask of T1843: make sure front-end services work when the Inria infra is down, as Wontfix.
Mon, Nov 22, 1:31 PM · System administration
olasd closed T2543: Some messages in the (azure) kafka cluster are too large for rdkafka clients to be able to decompress them as Invalid.

There is no azure kafka cluster anymore...

Mon, Nov 22, 1:16 PM · System administration, Journal
ardumont triaged T3746: staging: Deploy maven indexer/lister/loader as Normal priority.
Mon, Nov 22, 12:01 PM · System administration, Archive coverage

Fri, Nov 19

vsellier triaged T3741: swh-search - upgrade elasticsearch backend as Normal priority.
Fri, Nov 19, 5:01 PM · System administration, Archive search
vsellier added a revision to T3621: Create a production read-only objstorage: D6663: varnish: specify the authentication type in case of 401.
Fri, Nov 19, 2:13 PM · System administration
vsellier updated the task description for T3738: Replace failing disks on db1 and storage1 (before the end of february 2022).
Fri, Nov 19, 11:37 AM · System administration
vsellier added a project to T3738: Replace failing disks on db1 and storage1 (before the end of february 2022): System administration.
Fri, Nov 19, 11:35 AM · System administration

Thu, Nov 18

ardumont moved T3374: Ingest sourceforge repositories (origins of type git, svn, hg) from code-review/monitoring to done on the System administration board.
Thu, Nov 18, 3:17 PM · System administration, Archive coverage, Origin-SourceForge
ardumont moved T3599: List and ingest heptapod instances from code-review/monitoring to done on the System administration board.
Thu, Nov 18, 3:17 PM · System administration, Archive coverage
ardumont moved T3455: Make bitbucket origins ingestion concurrent from code-review/monitoring to done on the System administration board.
Thu, Nov 18, 3:17 PM · System administration, Mercurial loader
ardumont moved T3338: Load the archived bitbucket mercurial repositories from deployed/landed to done on the System administration board.
Thu, Nov 18, 3:16 PM · System administration, Mercurial loader
ardumont moved T3507: prod: vault: Deploy v1.0.0 from deployed/landed to done on the System administration board.
Thu, Nov 18, 3:16 PM · System administration, Vault, Web app
ardumont closed T3717: Ingest opam instance https://coq.inria.fr/opam/released/ as Resolved.
Thu, Nov 18, 3:16 PM · System administration, Archive coverage, Opam
ardumont moved T3717: Ingest opam instance https://coq.inria.fr/opam/released/ from code-review/monitoring to deployed/landed on the System administration board.
Thu, Nov 18, 3:16 PM · System administration, Archive coverage, Opam
ardumont closed T3734: Deploy latest swh.vault v1.3.0 as Resolved.
Thu, Nov 18, 11:40 AM · Vault, System administration

Wed, Nov 17

ardumont moved T3734: Deploy latest swh.vault v1.3.0 from in-progress to deployed/landed on the System administration board.
Wed, Nov 17, 6:52 PM · Vault, System administration
ardumont updated the task description for T3734: Deploy latest swh.vault v1.3.0.
Wed, Nov 17, 6:52 PM · Vault, System administration
ardumont updated the task description for T3734: Deploy latest swh.vault v1.3.0.
Wed, Nov 17, 3:09 PM · Vault, System administration
ardumont updated the task description for T3734: Deploy latest swh.vault v1.3.0.
Wed, Nov 17, 3:08 PM · Vault, System administration
ardumont updated the task description for T3716: Migrate giverny from stretch to buster.
Wed, Nov 17, 3:06 PM · System administration (Component upgrades)
ardumont moved T3717: Ingest opam instance https://coq.inria.fr/opam/released/ from in-progress to code-review/monitoring on the System administration board.
Wed, Nov 17, 3:00 PM · System administration, Archive coverage, Opam
ardumont changed the status of T3734: Deploy latest swh.vault v1.3.0 from Open to Work in Progress.
Wed, Nov 17, 3:00 PM · Vault, System administration
ardumont updated the task description for T3734: Deploy latest swh.vault v1.3.0.
Wed, Nov 17, 3:00 PM · Vault, System administration
ardumont added projects to T3734: Deploy latest swh.vault v1.3.0: System administration, Vault.
Wed, Nov 17, 2:51 PM · Vault, System administration