Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 8 2021

vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

The cluster is configured with the default value for cluster.max_shards_per_nodes [1] so it can have 3000 shards opened (1000*3)

Apr 8 2021, 4:07 PM · System administrators
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

I temporary unblock the ingestion by closing the systemlogs indexes created before 2020-07-01:

curl -s http://$ES_NODE/_cat/indices\?s=index | grep -v close | grep systemlogs | awk '{print $3}' | grep 2020.05 | xargs -n1 -t -i{} curl -XPOST http://${ES_NODE}/{}/_close 
curl -s http://$ES_NODE/_cat/indices\?s=index | grep -v close | grep systemlogs | awk '{print $3}' | grep 2020.06 | xargs -n1 -t -i{} curl -XPOST http://${ES_NODE}/{}/_close
Apr 8 2021, 12:43 PM · System administrators
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

interested reading : https://www.elastic.co/blog/how-many-shards-should-i-have-in-my-elasticsearch-cluster

Apr 8 2021, 12:34 PM · System administrators
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

it seems we have reached a limit on the cluster (from the logstash logs) :

Apr 08 10:30:24 logstash0 logstash[1605158]: [2021-04-08T10:30:24,052][WARN ][logstash.outputs.elasticsearch][main][62d11c4234b8981da77a97955da92ac9de92b9a6dcd4582f407face31fd5c664] Could not index event to Elasticsearch. {:status=>400, :action=>["index", {:_id=>nil, :_index=>"systemlogs-2021.04.08", :routing=>nil, :_type=>"_doc"}, #<LogStash::Event:0x2ec8df34>], :response=>{"index"=>{"_index"=>"systemlogs-2021.04.08", "_type"=>"_doc", "_id"=>nil, "status"=>400, "error"=>{"type"=>"validation_exception", "reason"=>"Validation Failed: 1: this action would add [2] total shards, but this cluster currently has [3000]/[3000] maximum shards open;"}}}}
Apr 08 10:30:24 logstash0 logstash[1605158]: [2021-04-08T10:30:24,052][WARN ][logstash.outputs.elasticsearch][main][62d11c4234b8981da77a97955da92ac9de92b9a6dcd4582f407face31fd5c664] Could not index event to Elasticsearch. {:status=>400, :action=>["index", {:_id=>nil, :_index=>"systemlogs-2021.04.08", :routing=>nil, :_type=>"_doc"}, #<LogStash::Event:0x26cfcf58>], :response=>{"index"=>{"_index"=>"systemlogs-2021.04.08", "_type"=>"_doc", "_id"=>nil, "status"=>400, "error"=>{"type"=>"validation_exception", "reason"=>"Validation Failed: 1: this action would add [2] total shards, but this cluster currently has [3000]/[3000] maximum shards open;"}}}}
Apr 08 10:30:24 logstash0 logstash[1605158]: [2021-04-08T10:30:24,053][WARN ][logstash.outputs.elasticsearch][main][62d11c4234b8981da77a97955da92ac9de92b9a6dcd4582f407face31fd5c664] Could not index event to Elasticsearch. {:status=>400, :action=>["index", {:_id=>nil, :_index=>"systemlogs-2021.04.08", :routing=>nil, :_type=>"_doc"}, #<LogStash::Event:0x1ddd876d>], :response=>{"index"=>{"_index"=>"systemlogs-2021.04.08", "_type"=>"_doc", "_id"=>nil, "status"=>400, "error"=>{"type"=>"validation_exception", "reason"=>"Validation Failed: 1: this action would add [2] total shards, but this cluster currently has [3000]/[3000] maximum shards open;"}}}}
Apr 8 2021, 12:31 PM · System administrators
vsellier changed the status of T3219: No logs are ingested on elasticsearch since 2021-03-26 from Open to Work in Progress.
Apr 8 2021, 12:29 PM · System administrators
vsellier closed D5447: attempt to fix the stable debian build.
Apr 8 2021, 12:19 PM
vsellier committed rDCNTdd8ad46f5f38: attempt to fix the stable debian build (authored by vsellier).
attempt to fix the stable debian build
Apr 8 2021, 12:19 PM
vsellier added a comment to D5447: attempt to fix the stable debian build.

Thanks for your tips to add the option --build-failed-commands %SBUILD_SHELL to the gbp command to get a command line on the chroot environement when a build failed.

Apr 8 2021, 11:32 AM
vsellier requested review of D5447: attempt to fix the stable debian build.
Apr 8 2021, 9:08 AM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5447: attempt to fix the stable debian build.
Apr 8 2021, 9:07 AM · System administration, Monitoring

Apr 7 2021

vsellier committed rDENV0df57e50f498: declare the counters stack (authored by vsellier).
declare the counters stack
Apr 7 2021, 8:05 PM
vsellier committed rDENV95507e1dbaca: adapt limits (authored by vsellier).
adapt limits
Apr 7 2021, 8:05 PM
vsellier committed rDENVe5027034a498: add missing parts for counters (authored by vsellier).
add missing parts for counters
Apr 7 2021, 8:05 PM
vsellier committed rDENVfd44cd7937cf: add scheduler journal client (authored by vsellier).
add scheduler journal client
Apr 7 2021, 8:05 PM
vsellier committed rDENV54a19c6f982c: Add search journal client (authored by vsellier).
Add search journal client
Apr 7 2021, 8:05 PM
vsellier committed rDENV2d99f729009f: Add missing dockerfile for the lister's image (authored by vsellier).
Add missing dockerfile for the lister's image
Apr 7 2021, 8:05 PM
vsellier committed rDENV09777c5212e9: add listers (authored by vsellier).
add listers
Apr 7 2021, 8:05 PM
vsellier committed rDENV2e864a6edb82: reorder skaffold configuration (authored by vsellier).
reorder skaffold configuration
Apr 7 2021, 8:05 PM
vsellier committed rDENV75160ff7fa6a: Increase storage limit to avoid OOM (authored by vsellier).
Increase storage limit to avoid OOM
Apr 7 2021, 8:05 PM
vsellier committed rDENVa90a68405482: add missing point on the documentation (authored by vsellier).
add missing point on the documentation
Apr 7 2021, 8:05 PM
vsellier committed rDENVe168a21730b9: configure the webapp to use postgresql (authored by vsellier).
configure the webapp to use postgresql
Apr 7 2021, 8:05 PM
vsellier committed rDENV947a640807c9: Improve elasticsearch startup (authored by vsellier).
Improve elasticsearch startup
Apr 7 2021, 8:05 PM
vsellier committed rDENV114597b9d02d: Adjust limits (authored by vsellier).
Adjust limits
Apr 7 2021, 8:05 PM
vsellier committed rDENV973aff5c4cfa: Drop scheduler-runner verbosity to INFO (authored by ardumont).
Drop scheduler-runner verbosity to INFO
Apr 7 2021, 8:05 PM
vsellier committed rDENVe1c35bc787ce: rabbitmq: Expose the rabbitmq manager port (authored by ardumont).
rabbitmq: Expose the rabbitmq manager port
Apr 7 2021, 8:05 PM
vsellier committed rDENVae39d7cd26df: Add swh-scheduler-listener service (authored by ardumont).
Add swh-scheduler-listener service
Apr 7 2021, 8:05 PM
vsellier committed rDENVc47f116df85d: Add swh search service (authored by ardumont).
Add swh search service
Apr 7 2021, 8:05 PM
vsellier committed rDENV703ffb8aa631: fix postgresql persistence (authored by vsellier).
fix postgresql persistence
Apr 7 2021, 8:05 PM
vsellier committed rDENVa608596b0bfd: Add prometheus and grafana (authored by vsellier).
Add prometheus and grafana
Apr 7 2021, 8:05 PM
vsellier committed rDENV664fe6567c65: Add elasticsearch service (authored by ardumont).
Add elasticsearch service
Apr 7 2021, 8:05 PM
vsellier committed rDENV11aac37e27dd: Add scheduler-runner service (authored by ardumont).
Add scheduler-runner service
Apr 7 2021, 8:05 PM
vsellier committed rDENV78389fd9dab3: add directory initialisation task (authored by vsellier).
add directory initialisation task
Apr 7 2021, 8:05 PM
vsellier committed rDENVd0d5bb82e592: allow kafka to start without being killed by an oom (authored by vsellier).
allow kafka to start without being killed by an oom
Apr 7 2021, 8:05 PM
vsellier committed rDENVd7a132d1954a: webapp: configure rolling upgrade (authored by vsellier).
webapp: configure rolling upgrade
Apr 7 2021, 8:05 PM
vsellier committed rDENV56efa6737c5c: persist zookeeper data to fix kafka restart (authored by vsellier).
persist zookeeper data to fix kafka restart
Apr 7 2021, 8:05 PM
vsellier committed rDENV760233376e19: webapp: correctly install source via pip (authored by vsellier).
webapp: correctly install source via pip
Apr 7 2021, 8:05 PM
vsellier committed rDENVf99c98b69fcd: Make webapp use an entrypoint and update to use postgres backend (authored by ardumont).
Make webapp use an entrypoint and update to use postgres backend
Apr 7 2021, 8:05 PM
vsellier committed rDENV958fac8529c6: objstorage: correctly install source via pip (authored by vsellier).
objstorage: correctly install source via pip
Apr 7 2021, 8:05 PM
vsellier committed rDENVcff6232837ff: Dockerfile: Clean up whitespaces (authored by ardumont).
Dockerfile: Clean up whitespaces
Apr 7 2021, 8:05 PM
vsellier committed rDENVbc7cbcdd5584: loaders: Decrease loaders verbosity (authored by ardumont).
loaders: Decrease loaders verbosity
Apr 7 2021, 8:05 PM
vsellier committed rDENVf5502cdeba20: add service declaration on the /etc/hosts (authored by vsellier).
add service declaration on the /etc/hosts
Apr 7 2021, 8:05 PM
vsellier committed rDENVab93ade7349f: declare the webapp (authored by vsellier).
declare the webapp
Apr 7 2021, 8:05 PM
vsellier committed rDENV4052c66c0c98: add util scripts on the base image (authored by vsellier).
add util scripts on the base image
Apr 7 2021, 8:04 PM
vsellier committed rDENV719c7130f512: cleanup base image (authored by vsellier).
cleanup base image
Apr 7 2021, 8:04 PM
vsellier committed rDENV3ae31f4ae4d9: deploy kafka (authored by vsellier).
deploy kafka
Apr 7 2021, 8:04 PM
vsellier committed rDENV32d3998c231f: Add rabbitmq service (authored by ardumont).
Add rabbitmq service
Apr 7 2021, 8:04 PM
vsellier committed rDENV146ea3db4074: README: Fix and unify host mount points creation (authored by ardumont).
README: Fix and unify host mount points creation
Apr 7 2021, 8:04 PM
vsellier committed rDENV472b6401eb87: Reference how to build the objstorage (authored by ardumont).
Reference how to build the objstorage
Apr 7 2021, 8:04 PM
vsellier committed rDENVfb8056da5d91: Add scheduler service (authored by ardumont).
Add scheduler service
Apr 7 2021, 8:04 PM
vsellier committed rDENVe2ac9999ba12: Readme: Update reference names (authored by ardumont).
Readme: Update reference names
Apr 7 2021, 8:04 PM
vsellier committed rDENVbe0ed90b044e: Cleanup whitespace noises (authored by ardumont).
Cleanup whitespace noises
Apr 7 2021, 8:04 PM
vsellier committed rDENV38b16b21c11c: docker: stop ignoring .git directory to allow pip to perform local install (authored by vsellier).
docker: stop ignoring .git directory to allow pip to perform local install
Apr 7 2021, 8:04 PM
vsellier committed rDENV4302860f21fa: restore missing python dependencies (authored by vsellier).
restore missing python dependencies
Apr 7 2021, 8:04 PM
vsellier committed rDENV969402e076e9: Specify the righ patch creation on the doc (authored by vsellier).
Specify the righ patch creation on the doc
Apr 7 2021, 8:04 PM
vsellier committed rDENV8f129c74ad87: fix zookeeper configuration (authored by vsellier).
fix zookeeper configuration
Apr 7 2021, 8:04 PM
vsellier committed rDENV01e921459c95: storage: Initialize database schema on startup (authored by vsellier).
storage: Initialize database schema on startup
Apr 7 2021, 8:04 PM
vsellier committed rDENVeb7ccfb85e08: add limit on containers (authored by vsellier).
add limit on containers
Apr 7 2021, 8:04 PM
vsellier committed rDENVd6f6342d48f8: fix database startup (authored by vsellier).
fix database startup
Apr 7 2021, 8:04 PM
vsellier committed rDENVc45eb07841bc: make modules discoverable for swh command line (authored by vsellier).
make modules discoverable for swh command line
Apr 7 2021, 8:04 PM
vsellier committed rDENV33a8e0935f6a: wip try journal (authored by vsellier).
wip try journal
Apr 7 2021, 8:04 PM
vsellier committed rDENV5155b4d64e03: add skaffold section on the readme (authored by vsellier).
add skaffold section on the readme
Apr 7 2021, 8:04 PM
vsellier committed rDENV8062a1f2e4cb: split image to efficiently manage dependencies (authored by vsellier).
split image to efficiently manage dependencies
Apr 7 2021, 8:04 PM
vsellier committed rDENV97bbf2724d8b: wip - kubernetes and skaffold initialization (authored by vsellier).
wip - kubernetes and skaffold initialization
Apr 7 2021, 8:04 PM
vsellier reopened T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, as Work in Progress.
Apr 7 2021, 5:42 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier reopened T3165: Generate historical data from the new counters series as "Work in Progress".

Reopening as the release is not working on the stable branch

Apr 7 2021, 5:42 PM · System administration, Monitoring
vsellier changed the status of T3215: Deploy the new counters in staging from Open to Work in Progress.
Apr 7 2021, 5:14 PM · System administration, Monitoring, Web app
vsellier committed rDCNT4d940936d8f2: Use an intermediate temporary file to generate the historical data (authored by vsellier).
Use an intermediate temporary file to generate the historical data
Apr 7 2021, 5:14 PM
vsellier closed T3165: Generate historical data from the new counters series as Resolved.
Apr 7 2021, 5:13 PM · System administration, Monitoring
vsellier closed T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 7 2021, 5:13 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed D5444: Use an intermediate temporary file to generate the historical data.
Apr 7 2021, 5:05 PM
vsellier committed rDCNTf2fba43f087e: Use an intermediate temporary file to generate the historical data (authored by vsellier).
Use an intermediate temporary file to generate the historical data
Apr 7 2021, 5:05 PM
vsellier requested review of D5444: Use an intermediate temporary file to generate the historical data.
Apr 7 2021, 4:53 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5444: Use an intermediate temporary file to generate the historical data.
Apr 7 2021, 4:52 PM · System administration, Monitoring
vsellier closed D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 4:50 PM
vsellier committed rDCNT42157fd04885: Allow the webapp to retrieve the history file via a GET endpoint (authored by vsellier).
Allow the webapp to retrieve the history file via a GET endpoint
Apr 7 2021, 4:50 PM
vsellier updated the diff for D5442: Allow the webapp to retrieve the history file via a GET endpoint.

Register the history endpoint only if the configuration is present

Apr 7 2021, 4:47 PM
vsellier updated the diff for D5442: Allow the webapp to retrieve the history file via a GET endpoint.
  • Return a 404 if the requested file does not exist
  • Use a fixture to configure the tests to make them easier to read
Apr 7 2021, 4:43 PM
vsellier added inline comments to D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 3:39 PM
vsellier requested review of D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 3:11 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5442: Allow the webapp to retrieve the history file via a GET endpoint.
Apr 7 2021, 3:10 PM · System administration, Monitoring
vsellier committed rDCNT9a0ae605748b: add python3-requests-mock dependency (authored by vsellier).
add python3-requests-mock dependency
Apr 7 2021, 12:40 PM
vsellier closed D5429: Manage and expose the historical data.
Apr 7 2021, 12:13 PM
vsellier committed rDCNT1b262c37b768: Manage and expose the historical data (authored by vsellier).
Manage and expose the historical data
Apr 7 2021, 12:13 PM
vsellier updated the diff for D5429: Manage and expose the historical data.

update according the review's feedbacks

Apr 7 2021, 12:07 PM
vsellier committed rDSNIPeac990efacb0: Add staging environment diagram (authored by vsellier).
Add staging environment diagram
Apr 7 2021, 11:27 AM
vsellier updated the diff for D5429: Manage and expose the historical data.

fix typos on the commit message

Apr 7 2021, 10:10 AM

Apr 6 2021

vsellier updated the diff for D5429: Manage and expose the historical data.

fix another typo

Apr 6 2021, 7:40 PM
vsellier requested review of D5429: Manage and expose the historical data.
Apr 6 2021, 7:31 PM
vsellier closed T3211: staging: git loader can't load repositories hosted by github.com as Resolved.

The requested change was make by the DSI. Everything is working well now.

Apr 6 2021, 7:11 PM · System administration
vsellier closed D5428: Allow to use several backends with a RPCServerApp.
Apr 6 2021, 7:05 PM
vsellier committed rDCOREcbc9b86b2336: Allow to use several backends with a RPCServerApp (authored by vsellier).
Allow to use several backends with a RPCServerApp
Apr 6 2021, 7:05 PM
vsellier added a comment to D5428: Allow to use several backends with a RPCServerApp.

Why not a single backend class for both?

because it's not the same backend implementation
so not the same concern/perimeter as mentioned in the diff description.

Apr 6 2021, 5:03 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5429: Manage and expose the historical data.
Apr 6 2021, 4:25 PM · System administration, Monitoring
vsellier requested review of D5428: Allow to use several backends with a RPCServerApp.
Apr 6 2021, 4:11 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5428: Allow to use several backends with a RPCServerApp.
Apr 6 2021, 4:09 PM · System administration, Monitoring
vsellier added a comment to T3211: staging: git loader can't load repositories hosted by github.com.

The wrong network profile was asked for the staging gateway so it seems it doesn't have a complete access to internet.
A mail was sent to the DSI to request an unfiltered access.

Apr 6 2021, 3:19 PM · System administration
vsellier changed the status of T3211: staging: git loader can't load repositories hosted by github.com from Open to Work in Progress.
Apr 6 2021, 2:01 PM · System administration

Apr 2 2021

vsellier triaged T3203: docs: Document the firewall installation and procedures as Normal priority.
Apr 2 2021, 10:44 AM · Documentation, System administration
vsellier renamed T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4 from Upgrade opensense firewalls from 20.7.4 to 21.1.4 to Upgrade opnsense firewalls from 20.7.4 to 21.1.4.
Apr 2 2021, 10:41 AM · System administration
vsellier closed T3194: Upgrade opnsense firewalls from 20.7.4 to 21.1.4 as Resolved.
Apr 2 2021, 10:34 AM · System administration