Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 7 2021

vsellier reopened T3165: Generate historical data from the new counters series as "Work in Progress".

Reopening as the release is not working on the stable branch

Apr 7 2021, 5:42 PM · System administration, Monitoring
vsellier changed the status of T3215: Deploy the new counters in staging from Open to Work in Progress.
Apr 7 2021, 5:14 PM · System administration, Monitoring, Web app
vsellier committed rDCNT4d940936d8f2: Use an intermediate temporary file to generate the historical data (authored by vsellier).
Use an intermediate temporary file to generate the historical data
Apr 7 2021, 5:14 PM
vsellier closed T3165: Generate historical data from the new counters series as Resolved.
Apr 7 2021, 5:13 PM · System administration, Monitoring
vsellier closed T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 7 2021, 5:13 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed D5444: Use an intermediate temporary file to generate the historical data.
Apr 7 2021, 5:05 PM
vsellier committed rDCNTf2fba43f087e: Use an intermediate temporary file to generate the historical data (authored by vsellier).
Use an intermediate temporary file to generate the historical data
Apr 7 2021, 5:05 PM
vsellier requested review of D5444: Use an intermediate temporary file to generate the historical data.
Apr 7 2021, 4:53 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5444: Use an intermediate temporary file to generate the historical data.
Apr 7 2021, 4:52 PM · System administration, Monitoring
vsellier closed D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 4:50 PM
vsellier committed rDCNT42157fd04885: Allow the webapp to retrieve the history file via a GET endpoint (authored by vsellier).
Allow the webapp to retrieve the history file via a GET endpoint
Apr 7 2021, 4:50 PM
vsellier updated the diff for D5442: Allow the webapp to retrieve the history file via a GET endpoint.

Register the history endpoint only if the configuration is present

Apr 7 2021, 4:47 PM
vsellier updated the diff for D5442: Allow the webapp to retrieve the history file via a GET endpoint.
  • Return a 404 if the requested file does not exist
  • Use a fixture to configure the tests to make them easier to read
Apr 7 2021, 4:43 PM
vsellier added inline comments to D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 3:39 PM
vsellier requested review of D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 3:11 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 3:10 PM · System administration, Monitoring
vsellier committed rDCNT9a0ae605748b: add python3-requests-mock dependency (authored by vsellier).
add python3-requests-mock dependency
Apr 7 2021, 12:40 PM
vsellier closed D5429: Manage and expose the historical data.
Apr 7 2021, 12:13 PM
vsellier committed rDCNT1b262c37b768: Manage and expose the historical data (authored by vsellier).
Manage and expose the historical data
Apr 7 2021, 12:13 PM
vsellier updated the diff for D5429: Manage and expose the historical data.

update according the review's feedbacks

Apr 7 2021, 12:07 PM
vsellier committed rDSNIPeac990efacb0: Add staging environment diagram (authored by vsellier).
Add staging environment diagram
Apr 7 2021, 11:27 AM
vsellier updated the diff for D5429: Manage and expose the historical data.

fix typos on the commit message

Apr 7 2021, 10:10 AM

Apr 6 2021

vsellier updated the diff for D5429: Manage and expose the historical data.

fix another typo

Apr 6 2021, 7:40 PM
vsellier requested review of D5429: Manage and expose the historical data.
Apr 6 2021, 7:31 PM
vsellier closed T3211: staging: git loader can't load repositories hosted by github.com as Resolved.

The requested change was make by the DSI. Everything is working well now.

Apr 6 2021, 7:11 PM · System administration
vsellier closed D5428: Allow to use several backends with a RPCServerApp.
Apr 6 2021, 7:05 PM
vsellier committed rDCOREcbc9b86b2336: Allow to use several backends with a RPCServerApp (authored by vsellier).
Allow to use several backends with a RPCServerApp
Apr 6 2021, 7:05 PM
vsellier added a comment to D5428: Allow to use several backends with a RPCServerApp.

Why not a single backend class for both?

because it's not the same backend implementation
so not the same concern/perimeter as mentioned in the diff description.

Apr 6 2021, 5:03 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5429: Manage and expose the historical data.
Apr 6 2021, 4:25 PM · System administration, Monitoring
vsellier requested review of D5428: Allow to use several backends with a RPCServerApp.
Apr 6 2021, 4:11 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5428: Allow to use several backends with a RPCServerApp.
Apr 6 2021, 4:09 PM · System administration, Monitoring
vsellier added a comment to T3211: staging: git loader can't load repositories hosted by github.com.

The wrong network profile was asked for the staging gateway so it seems it doesn't have a complete access to internet.
A mail was sent to the DSI to request an unfiltered access.

Apr 6 2021, 3:19 PM · System administration
vsellier changed the status of T3211: staging: git loader can't load repositories hosted by github.com from Open to Work in Progress.
Apr 6 2021, 2:01 PM · System administration

Apr 2 2021

vsellier triaged T3203: docs: Document the firewall installation and procedures as Normal priority.
Apr 2 2021, 10:44 AM · Documentation, System administration
vsellier renamed T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4 from Upgrade opensense firewalls from 20.7.4 to 21.1.4 to Upgrade opnsense firewalls from 20.7.4 to 21.1.4.
Apr 2 2021, 10:41 AM · System administration
vsellier closed T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4 as Resolved.
Apr 2 2021, 10:34 AM · System administration
vsellier added a comment to T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4.

After solving the problem the upgrade was pretty smooth. The firewall perform the following steps:

  • upgrade to the last current minor version of the current major branch
  • upgrade to the first minor version of the next major branch
  • upgrade to the last minor version ot the current major branch
Apr 2 2021, 10:29 AM · System administration
vsellier accepted D5406: prod/webapp: Deploy new production_db configuration.

lgtm

Apr 2 2021, 10:17 AM
vsellier added a comment to T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4.

Before starting the upgrade, we discovered 2 problem we had to fix:

  1. The backup had no access to internet we block the upgrade
  2. The master/backup switch was not working for 4 of the 8 VIPs
Apr 2 2021, 10:07 AM · System administration

Apr 1 2021

vsellier changed the status of T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4 from Open to Work in Progress.
Apr 1 2021, 2:35 PM · System administration
vsellier closed T3190: counters: Error during directory topic ingestion, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 1 2021, 2:18 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3190: counters: Error during directory topic ingestion as Resolved.
Apr 1 2021, 2:18 PM · System administration, Monitoring
vsellier closed D5399: counters: allow to consume big messages of the directory topic.
Apr 1 2021, 12:54 PM
vsellier committed rSPSITE2c2e7ed2403f: counters: allow to consume big messages of the directory topic (authored by vsellier).
counters: allow to consume big messages of the directory topic
Apr 1 2021, 12:54 PM
vsellier added a comment to D5398: postgresql/client: Fix redundant user entry setup.

Is the user not used for the creation of the pgpass file ?

Apr 1 2021, 12:49 PM
vsellier committed rSENV5bcf7c1cc9b3: Update octocatalog-diff facts (authored by vsellier).
Update octocatalog-diff facts
Apr 1 2021, 12:37 PM
vsellier requested review of D5399: counters: allow to consume big messages of the directory topic.
Apr 1 2021, 12:37 PM
vsellier added a revision to T3190: counters: Error during directory topic ingestion: D5399: counters: allow to consume big messages of the directory topic.
Apr 1 2021, 12:37 PM · System administration, Monitoring
vsellier added a comment to T3190: counters: Error during directory topic ingestion.

An improvment of the journal client is necessary to add the support of this configuration like for the producer:

Do you need such improvment though? According to the code you linked, you could pass a
producer_config dict with that key and value.

Apr 1 2021, 12:31 PM · System administration, Monitoring
vsellier added a comment to T3191: journal-client: Add support of max message size configuration.

The journal client supports dynamic configuration via kwargs so no there is no need to improve it.

Apr 1 2021, 12:11 PM · Journal
vsellier closed T3191: journal-client: Add support of max message size configuration as Invalid.
Apr 1 2021, 12:11 PM · Journal
vsellier changed the status of T3191: journal-client: Add support of max message size configuration from Open to Work in Progress.
Apr 1 2021, 11:52 AM · Journal
vsellier added a comment to T3190: counters: Error during directory topic ingestion.

It seems the problem is not present anymore with a higher max message size ('500 * 1024 * 1024').

Apr 1 2021, 11:35 AM · System administration, Monitoring
vsellier added a comment to T3190: counters: Error during directory topic ingestion.

for the record, increasing the property message.max.bytes to 100 * 1024 * 1024 in the consumer configuration is not solving the problem

Apr 1 2021, 10:32 AM · System administration, Monitoring
vsellier added a comment to T3190: counters: Error during directory topic ingestion.

The same problem occured during the poc, theses messages were ignored by using this consumer configuration "errors.tolerance": 'all' [1].
I will try to find if there is a more elegant way to deal with this issue ;)

Apr 1 2021, 10:03 AM · System administration, Monitoring
vsellier updated the task description for T3190: counters: Error during directory topic ingestion.
Apr 1 2021, 9:46 AM · System administration, Monitoring
vsellier changed the status of T3190: counters: Error during directory topic ingestion from Open to Work in Progress.
Apr 1 2021, 9:38 AM · System administration, Monitoring

Mar 31 2021

vsellier updated subscribers of T3041: [production] Provision enough space for the search ES cluster to ingest all intrinsic metadata.

After talking with @rdicosmo, we finally chose to replace on each server the 4 HDD 2.4To by 6 SSD 1.9To to be sure we will have good performances and enought space for the future.
The quote wil nowl be sent to the purchasing service according to the usual procedure [1]

Mar 31 2021, 3:21 PM · System administration, Archive search

Mar 30 2021

vsellier added a project to T3143: Migrate revision metadata to extid in the storage: System administration.
Mar 30 2021, 5:26 PM · System administration, Storage manager, Core Loader
vsellier added a comment to T3041: [production] Provision enough space for the search ES cluster to ingest all intrinsic metadata.

Final quotation sent for approval.
The details are:
3 PowerEdge R6515 (1u) with per server:

  • 10 disks enclosure
  • BOSS controller with 2 240Go cards (for system)
  • 4 SAS 2.5" 10k 2.4To disks
  • SFP+ network card
  • 2 SFP cables
  • 2 power supplies with their cables
  • IDRac enterprise
  • Rack mount rails with cable management
Mar 30 2021, 2:53 PM · System administration, Archive search
vsellier accepted D5383: sys-info: Rework the how-to deploy to add details on architecture.

lgtm

Mar 30 2021, 2:14 PM
vsellier closed T3188: staging/journal: create douardda credentials as Resolved.

credentials sent by PM

Mar 30 2021, 12:52 PM · System administration
vsellier updated the task description for T3188: staging/journal: create douardda credentials.
Mar 30 2021, 12:44 PM · System administration
vsellier added a comment to T3188: staging/journal: create douardda credentials.
  • unprivileged user :
username=swh-douardda
password=XXXXX
Mar 30 2021, 12:43 PM · System administration
vsellier changed the status of T3188: staging/journal: create douardda credentials from Open to Work in Progress.
Mar 30 2021, 12:19 PM · System administration
vsellier triaged T3188: staging/journal: create douardda credentials as Normal priority.
Mar 30 2021, 12:18 PM · System administration
vsellier closed D5377: network: Remove unecessary route between internal network and VLAN1300.
Mar 30 2021, 10:00 AM
vsellier committed rSPSITE6935f3532507: network: Remove unecessary route between internal network and VLAN1300 (authored by vsellier).
network: Remove unecessary route between internal network and VLAN1300
Mar 30 2021, 10:00 AM
vsellier updated the diff for D5377: network: Remove unecessary route between internal network and VLAN1300.

rebase

Mar 30 2021, 9:59 AM

Mar 29 2021

vsellier requested review of D5377: network: Remove unecessary route between internal network and VLAN1300.
Mar 29 2021, 5:58 PM

Mar 26 2021

vsellier committed rDSNIP64291f923a5a: explicit a couple of relations (authored by vsellier).
explicit a couple of relations
Mar 26 2021, 2:37 PM
vsellier added a comment to T3165: Generate historical data from the new counters series.

The final counters architecture looks like this with this improvment:

Mar 26 2021, 12:38 PM · System administration, Monitoring
vsellier committed rDSNIPcf77331a0f8f: Upgrade counters architecture to handle the historical data management (authored by vsellier).
Upgrade counters architecture to handle the historical data management
Mar 26 2021, 12:33 PM
vsellier added a comment to T3165: Generate historical data from the new counters series.

An improvment idea came to me during the refactoring, the script can be splitted and integrated in the 'swh-counters' codebase.

Mar 26 2021, 11:40 AM · System administration, Monitoring
vsellier accepted D5343: docs/sys-info: Update deployment documentation.

lgtm

Mar 26 2021, 10:28 AM
vsellier added inline comments to D5343: docs/sys-info: Update deployment documentation.
Mar 26 2021, 10:23 AM
vsellier accepted D5342: docs/sys-info: Update information and rework sentence phrasing.

lgtm

Mar 26 2021, 10:17 AM

Mar 25 2021

vsellier added inline comments to D5342: docs/sys-info: Update information and rework sentence phrasing.
Mar 25 2021, 6:52 PM
vsellier requested changes to D5342: docs/sys-info: Update information and rework sentence phrasing.
Mar 25 2021, 6:51 PM
vsellier accepted D5341: docs: Unify doc and git READMEs.

LGTM

Mar 25 2021, 5:58 PM
vsellier closed T3175: Prepare production environment as Resolved.

node counters1.internal.softwareheritage.org deployed by terraform. The inventory section is created accordingly[1].
The journal_client is running.

Mar 25 2021, 5:36 PM · Roadmap 2021, System administration, Monitoring
vsellier closed T3175: Prepare production environment, a subtask of T2912: Next generation archive counters, as Resolved.
Mar 25 2021, 5:36 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier committed rSPRE17a1de991af9: production: add counters1 node (authored by vsellier).
production: add counters1 node
Mar 25 2021, 5:34 PM
vsellier closed D5338: counters: Declare production node.
Mar 25 2021, 3:57 PM
vsellier committed rSPSITE83ab9664ca65: counters: Declare production node (authored by vsellier).
counters: Declare production node
Mar 25 2021, 3:57 PM
vsellier requested review of D5338: counters: Declare production node.
Mar 25 2021, 3:33 PM
vsellier added a revision to T3175: Prepare production environment: D5338: counters: Declare production node.
Mar 25 2021, 3:33 PM · Roadmap 2021, System administration, Monitoring
vsellier committed rSENV02e36ac6de2b: vagrant: add prod-counters1 vm (authored by vsellier).
vagrant: add prod-counters1 vm
Mar 25 2021, 3:31 PM
vsellier changed the status of T3175: Prepare production environment from Open to Work in Progress.
Mar 25 2021, 2:52 PM · Roadmap 2021, System administration, Monitoring
vsellier changed the status of T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, from Open to Work in Progress.
Mar 25 2021, 2:38 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier changed the status of T3165: Generate historical data from the new counters series from Open to Work in Progress.
Mar 25 2021, 2:38 PM · System administration, Monitoring
vsellier closed T3164: Expose counters in prometheus format as Resolved.
Mar 25 2021, 2:38 PM · System administration, Monitoring
vsellier closed T3164: Expose counters in prometheus format, a subtask of T2912: Next generation archive counters, as Resolved.
Mar 25 2021, 2:38 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier accepted D5334: README: Update description.
Mar 25 2021, 2:16 PM
vsellier closed D5332: counters: count objects from more topics.
Mar 25 2021, 12:30 PM
vsellier committed rSPSITE9558e50ce902: counters: count objects from more topics (authored by vsellier).
counters: count objects from more topics
Mar 25 2021, 12:30 PM
vsellier added a comment to D5332: counters: count objects from more topics.

:)
thanks

Mar 25 2021, 12:29 PM
vsellier added a comment to T3164: Expose counters in prometheus format.

The counters are now exposed throught a /metrics enpoint and ingested by prometheus.
They are well tagged per environment so we will be able to isolate the counters for each one:

Mar 25 2021, 12:27 PM · System administration, Monitoring
vsellier added a revision to T3164: Expose counters in prometheus format: D5332: counters: count objects from more topics.
Mar 25 2021, 12:14 PM · System administration, Monitoring
vsellier requested review of D5332: counters: count objects from more topics.
Mar 25 2021, 12:14 PM