Page MenuHomeSoftware Heritage
Feed Advanced Search

Apr 10 2021

vsellier committed rDENV75135ca327f4: webapp use counters to display the history count graph (authored by vsellier).
webapp use counters to display the history count graph
Apr 10 2021, 8:27 PM
vsellier committed rDENV16ef24f70775: monitoring: use consistent names for exporter job (authored by vsellier).
monitoring: use consistent names for exporter job
Apr 10 2021, 8:27 PM
vsellier committed rDENVb09a2a45ab03: use a local storage for the registry (authored by vsellier).
use a local storage for the registry
Apr 10 2021, 8:27 PM
vsellier committed rDENV0c220f3eecfd: try to perform a warm shutdown of the lister and loaders (authored by vsellier).
try to perform a warm shutdown of the lister and loaders
Apr 10 2021, 8:27 PM
vsellier committed rDENV1ca15f499b05: loaders: use a dynamic hostname (authored by vsellier).
loaders: use a dynamic hostname
Apr 10 2021, 8:27 PM

Apr 9 2021

vsellier triaged T3232: remove hardcoded historical values from the webapp as Normal priority.
Apr 9 2021, 7:33 PM · Monitoring, Web app
vsellier triaged T3231: Make the source of the object counts configurable as Normal priority.
Apr 9 2021, 7:22 PM · System administration, Monitoring, Web app
vsellier closed T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 9 2021, 7:02 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3165: Generate historical data from the new counters series as Resolved.

Everything is released correctly and deployed on staging

Apr 9 2021, 7:02 PM · System administration, Monitoring
vsellier closed T3215: Deploy the new counters in staging, a subtask of T2912: Next generation archive counters, as Resolved.
Apr 9 2021, 6:56 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier closed T3215: Deploy the new counters in staging as Resolved.

I finally found why the graphs looks weird : https://forge.softwareheritage.org/source/swh-web/browse/master/swh/web/misc/urls.py$31
With a dirty patch on the server, it's way better:

Apr 9 2021, 6:56 PM · System administration, Monitoring, Web app
vsellier added a project to T3228: Free 4 of 5 remaining ips still used on vlan210: System administration.
Apr 9 2021, 6:31 PM · System administration
vsellier retitled D5479: network: Remove network interface on deprecated VLAN210 network from nerwtork: Remove network interface on deprecated VLAN210 network to network: Remove network interface on deprecated VLAN210 network.
Apr 9 2021, 3:34 PM
vsellier updated the diff for D5479: network: Remove network interface on deprecated VLAN210 network.

fix a typo on the commit message

Apr 9 2021, 3:34 PM
vsellier requested review of D5479: network: Remove network interface on deprecated VLAN210 network.
Apr 9 2021, 3:14 PM
vsellier added a revision to T3228: Free 4 of 5 remaining ips still used on vlan210: D5479: network: Remove network interface on deprecated VLAN210 network.
Apr 9 2021, 3:14 PM · System administration
vsellier committed rSPSITE011681733315: fix wrong usage of alias/lookup (authored by vsellier).
fix wrong usage of alias/lookup
Apr 9 2021, 2:39 PM
vsellier committed rSPSITE4f9371e81f3d: staging: Fix the counters history url (authored by vsellier).
staging: Fix the counters history url
Apr 9 2021, 2:28 PM
vsellier changed the status of T3228: Free 4 of 5 remaining ips still used on vlan210 from Open to Work in Progress.
Apr 9 2021, 2:16 PM · System administration
vsellier added a comment to T3215: Deploy the new counters in staging.

The pipeline is deployed in staging.
It's working but it seems the graphs need some initial values in staging to make the rendering correctly:

Apr 9 2021, 12:48 PM · System administration, Monitoring, Web app
vsellier closed D5470: staging: configure counters history pipeline.
Apr 9 2021, 12:21 PM
vsellier committed rSPSITE52709f7a0330: staging: configure counters history pipeline (authored by vsellier).
staging: configure counters history pipeline
Apr 9 2021, 12:20 PM
vsellier updated the diff for D5470: staging: configure counters history pipeline.

Add a filter to limit the metrics to the current environment

Apr 9 2021, 12:19 PM
vsellier renamed T3228: Free 4 of 5 remaining ips still used on vlan210 from Free 3 of 4 remaing ip still used on vlan210 to Free 4 of 5 remaing ips still used on vlan210.
Apr 9 2021, 10:50 AM · System administration
vsellier triaged T3228: Free 4 of 5 remaining ips still used on vlan210 as Normal priority.
Apr 9 2021, 10:41 AM · System administration
vsellier added a revision to T3215: Deploy the new counters in staging: D5470: staging: configure counters history pipeline.
Apr 9 2021, 9:47 AM · System administration, Monitoring, Web app
vsellier requested review of D5470: staging: configure counters history pipeline.
Apr 9 2021, 9:47 AM

Apr 8 2021

vsellier committed rDCNT46d1d61bb92c: Fix history endpoint path (authored by vsellier).
Fix history endpoint path
Apr 8 2021, 11:52 PM
vsellier closed D5468: Let flask manage json response by itself.
Apr 8 2021, 7:27 PM
vsellier committed rDCNT9958f4035e49: Let flask manage json response by itself (authored by vsellier).
Let flask manage json response by itself
Apr 8 2021, 7:27 PM
vsellier requested review of D5468: Let flask manage json response by itself.
Apr 8 2021, 7:25 PM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5468: Let flask manage json response by itself.
Apr 8 2021, 7:24 PM · System administration, Monitoring
vsellier closed T3219: No logs are ingested on elasticsearch since 2021-03-26 as Resolved.
Apr 8 2021, 4:36 PM · System administrators
vsellier triaged T3223: Elasticsearch: Monitor the max opened shards on a cluster as Normal priority.
Apr 8 2021, 4:35 PM · System administrators
vsellier triaged T3222: Monitor daily indexes are present on the log cluster and logs are correctly ingested as Normal priority.
Apr 8 2021, 4:32 PM · System administration
vsellier triaged T3221: elk: automatically limit log retention as Normal priority.
Apr 8 2021, 4:30 PM · System administration
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

The cluster is configured with the default value for cluster.max_shards_per_nodes [1] so it can have 3000 shards opened (1000*3)

Apr 8 2021, 4:07 PM · System administrators
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

I temporary unblock the ingestion by closing the systemlogs indexes created before 2020-07-01:

curl -s http://$ES_NODE/_cat/indices\?s=index | grep -v close | grep systemlogs | awk '{print $3}' | grep 2020.05 | xargs -n1 -t -i{} curl -XPOST http://${ES_NODE}/{}/_close 
curl -s http://$ES_NODE/_cat/indices\?s=index | grep -v close | grep systemlogs | awk '{print $3}' | grep 2020.06 | xargs -n1 -t -i{} curl -XPOST http://${ES_NODE}/{}/_close
Apr 8 2021, 12:43 PM · System administrators
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

interested reading : https://www.elastic.co/blog/how-many-shards-should-i-have-in-my-elasticsearch-cluster

Apr 8 2021, 12:34 PM · System administrators
vsellier added a comment to T3219: No logs are ingested on elasticsearch since 2021-03-26.

it seems we have reached a limit on the cluster (from the logstash logs) :

Apr 08 10:30:24 logstash0 logstash[1605158]: [2021-04-08T10:30:24,052][WARN ][logstash.outputs.elasticsearch][main][62d11c4234b8981da77a97955da92ac9de92b9a6dcd4582f407face31fd5c664] Could not index event to Elasticsearch. {:status=>400, :action=>["index", {:_id=>nil, :_index=>"systemlogs-2021.04.08", :routing=>nil, :_type=>"_doc"}, #<LogStash::Event:0x2ec8df34>], :response=>{"index"=>{"_index"=>"systemlogs-2021.04.08", "_type"=>"_doc", "_id"=>nil, "status"=>400, "error"=>{"type"=>"validation_exception", "reason"=>"Validation Failed: 1: this action would add [2] total shards, but this cluster currently has [3000]/[3000] maximum shards open;"}}}}
Apr 08 10:30:24 logstash0 logstash[1605158]: [2021-04-08T10:30:24,052][WARN ][logstash.outputs.elasticsearch][main][62d11c4234b8981da77a97955da92ac9de92b9a6dcd4582f407face31fd5c664] Could not index event to Elasticsearch. {:status=>400, :action=>["index", {:_id=>nil, :_index=>"systemlogs-2021.04.08", :routing=>nil, :_type=>"_doc"}, #<LogStash::Event:0x26cfcf58>], :response=>{"index"=>{"_index"=>"systemlogs-2021.04.08", "_type"=>"_doc", "_id"=>nil, "status"=>400, "error"=>{"type"=>"validation_exception", "reason"=>"Validation Failed: 1: this action would add [2] total shards, but this cluster currently has [3000]/[3000] maximum shards open;"}}}}
Apr 08 10:30:24 logstash0 logstash[1605158]: [2021-04-08T10:30:24,053][WARN ][logstash.outputs.elasticsearch][main][62d11c4234b8981da77a97955da92ac9de92b9a6dcd4582f407face31fd5c664] Could not index event to Elasticsearch. {:status=>400, :action=>["index", {:_id=>nil, :_index=>"systemlogs-2021.04.08", :routing=>nil, :_type=>"_doc"}, #<LogStash::Event:0x1ddd876d>], :response=>{"index"=>{"_index"=>"systemlogs-2021.04.08", "_type"=>"_doc", "_id"=>nil, "status"=>400, "error"=>{"type"=>"validation_exception", "reason"=>"Validation Failed: 1: this action would add [2] total shards, but this cluster currently has [3000]/[3000] maximum shards open;"}}}}
Apr 8 2021, 12:31 PM · System administrators
vsellier changed the status of T3219: No logs are ingested on elasticsearch since 2021-03-26 from Open to Work in Progress.
Apr 8 2021, 12:29 PM · System administrators
vsellier closed D5447: attempt to fix the stable debian build.
Apr 8 2021, 12:19 PM
vsellier committed rDCNTdd8ad46f5f38: attempt to fix the stable debian build (authored by vsellier).
attempt to fix the stable debian build
Apr 8 2021, 12:19 PM
vsellier added a comment to D5447: attempt to fix the stable debian build.

Thanks for your tips to add the option --build-failed-commands %SBUILD_SHELL to the gbp command to get a command line on the chroot environement when a build failed.

Apr 8 2021, 11:32 AM
vsellier requested review of D5447: attempt to fix the stable debian build.
Apr 8 2021, 9:08 AM
vsellier added a revision to T3165: Generate historical data from the new counters series: D5447: attempt to fix the stable debian build.
Apr 8 2021, 9:07 AM · System administration, Monitoring

Apr 7 2021

vsellier committed rDENV0df57e50f498: declare the counters stack (authored by vsellier).
declare the counters stack
Apr 7 2021, 8:05 PM
vsellier committed rDENV95507e1dbaca: adapt limits (authored by vsellier).
adapt limits
Apr 7 2021, 8:05 PM
vsellier committed rDENVe5027034a498: add missing parts for counters (authored by vsellier).
add missing parts for counters
Apr 7 2021, 8:05 PM
vsellier committed rDENVfd44cd7937cf: add scheduler journal client (authored by vsellier).
add scheduler journal client
Apr 7 2021, 8:05 PM
vsellier committed rDENV54a19c6f982c: Add search journal client (authored by vsellier).
Add search journal client
Apr 7 2021, 8:05 PM
vsellier committed rDENV2d99f729009f: Add missing dockerfile for the lister's image (authored by vsellier).
Add missing dockerfile for the lister's image
Apr 7 2021, 8:05 PM
vsellier committed rDENV09777c5212e9: add listers (authored by vsellier).
add listers
Apr 7 2021, 8:05 PM
vsellier committed rDENV2e864a6edb82: reorder skaffold configuration (authored by vsellier).
reorder skaffold configuration
Apr 7 2021, 8:05 PM
vsellier committed rDENV75160ff7fa6a: Increase storage limit to avoid OOM (authored by vsellier).
Increase storage limit to avoid OOM
Apr 7 2021, 8:05 PM
vsellier committed rDENVa90a68405482: add missing point on the documentation (authored by vsellier).
add missing point on the documentation
Apr 7 2021, 8:05 PM
vsellier committed rDENVe168a21730b9: configure the webapp to use postgresql (authored by vsellier).
configure the webapp to use postgresql
Apr 7 2021, 8:05 PM
vsellier committed rDENV947a640807c9: Improve elasticsearch startup (authored by vsellier).
Improve elasticsearch startup
Apr 7 2021, 8:05 PM
vsellier committed rDENV114597b9d02d: Adjust limits (authored by vsellier).
Adjust limits
Apr 7 2021, 8:05 PM
vsellier committed rDENV973aff5c4cfa: Drop scheduler-runner verbosity to INFO (authored by ardumont).
Drop scheduler-runner verbosity to INFO
Apr 7 2021, 8:05 PM
vsellier committed rDENVe1c35bc787ce: rabbitmq: Expose the rabbitmq manager port (authored by ardumont).
rabbitmq: Expose the rabbitmq manager port
Apr 7 2021, 8:05 PM
vsellier committed rDENVae39d7cd26df: Add swh-scheduler-listener service (authored by ardumont).
Add swh-scheduler-listener service
Apr 7 2021, 8:05 PM
vsellier committed rDENVc47f116df85d: Add swh search service (authored by ardumont).
Add swh search service
Apr 7 2021, 8:05 PM
vsellier committed rDENV703ffb8aa631: fix postgresql persistence (authored by vsellier).
fix postgresql persistence
Apr 7 2021, 8:05 PM
vsellier committed rDENVa608596b0bfd: Add prometheus and grafana (authored by vsellier).
Add prometheus and grafana
Apr 7 2021, 8:05 PM
vsellier committed rDENV664fe6567c65: Add elasticsearch service (authored by ardumont).
Add elasticsearch service
Apr 7 2021, 8:05 PM
vsellier committed rDENV11aac37e27dd: Add scheduler-runner service (authored by ardumont).
Add scheduler-runner service
Apr 7 2021, 8:05 PM
vsellier committed rDENV78389fd9dab3: add directory initialisation task (authored by vsellier).
add directory initialisation task
Apr 7 2021, 8:05 PM
vsellier committed rDENVd0d5bb82e592: allow kafka to start without being killed by an oom (authored by vsellier).
allow kafka to start without being killed by an oom
Apr 7 2021, 8:05 PM
vsellier committed rDENVd7a132d1954a: webapp: configure rolling upgrade (authored by vsellier).
webapp: configure rolling upgrade
Apr 7 2021, 8:05 PM
vsellier committed rDENV56efa6737c5c: persist zookeeper data to fix kafka restart (authored by vsellier).
persist zookeeper data to fix kafka restart
Apr 7 2021, 8:05 PM
vsellier committed rDENV760233376e19: webapp: correctly install source via pip (authored by vsellier).
webapp: correctly install source via pip
Apr 7 2021, 8:05 PM
vsellier committed rDENVf99c98b69fcd: Make webapp use an entrypoint and update to use postgres backend (authored by ardumont).
Make webapp use an entrypoint and update to use postgres backend
Apr 7 2021, 8:05 PM
vsellier committed rDENV958fac8529c6: objstorage: correctly install source via pip (authored by vsellier).
objstorage: correctly install source via pip
Apr 7 2021, 8:05 PM
vsellier committed rDENVcff6232837ff: Dockerfile: Clean up whitespaces (authored by ardumont).
Dockerfile: Clean up whitespaces
Apr 7 2021, 8:05 PM
vsellier committed rDENVbc7cbcdd5584: loaders: Decrease loaders verbosity (authored by ardumont).
loaders: Decrease loaders verbosity
Apr 7 2021, 8:05 PM
vsellier committed rDENVf5502cdeba20: add service declaration on the /etc/hosts (authored by vsellier).
add service declaration on the /etc/hosts
Apr 7 2021, 8:05 PM
vsellier committed rDENVab93ade7349f: declare the webapp (authored by vsellier).
declare the webapp
Apr 7 2021, 8:05 PM
vsellier committed rDENV4052c66c0c98: add util scripts on the base image (authored by vsellier).
add util scripts on the base image
Apr 7 2021, 8:04 PM
vsellier committed rDENV719c7130f512: cleanup base image (authored by vsellier).
cleanup base image
Apr 7 2021, 8:04 PM
vsellier committed rDENV3ae31f4ae4d9: deploy kafka (authored by vsellier).
deploy kafka
Apr 7 2021, 8:04 PM
vsellier committed rDENV32d3998c231f: Add rabbitmq service (authored by ardumont).
Add rabbitmq service
Apr 7 2021, 8:04 PM
vsellier committed rDENV146ea3db4074: README: Fix and unify host mount points creation (authored by ardumont).
README: Fix and unify host mount points creation
Apr 7 2021, 8:04 PM
vsellier committed rDENV472b6401eb87: Reference how to build the objstorage (authored by ardumont).
Reference how to build the objstorage
Apr 7 2021, 8:04 PM
vsellier committed rDENVfb8056da5d91: Add scheduler service (authored by ardumont).
Add scheduler service
Apr 7 2021, 8:04 PM
vsellier committed rDENVe2ac9999ba12: Readme: Update reference names (authored by ardumont).
Readme: Update reference names
Apr 7 2021, 8:04 PM
vsellier committed rDENVbe0ed90b044e: Cleanup whitespace noises (authored by ardumont).
Cleanup whitespace noises
Apr 7 2021, 8:04 PM
vsellier committed rDENV38b16b21c11c: docker: stop ignoring .git directory to allow pip to perform local install (authored by vsellier).
docker: stop ignoring .git directory to allow pip to perform local install
Apr 7 2021, 8:04 PM
vsellier committed rDENV4302860f21fa: restore missing python dependencies (authored by vsellier).
restore missing python dependencies
Apr 7 2021, 8:04 PM
vsellier committed rDENV969402e076e9: Specify the righ patch creation on the doc (authored by vsellier).
Specify the righ patch creation on the doc
Apr 7 2021, 8:04 PM
vsellier committed rDENV8f129c74ad87: fix zookeeper configuration (authored by vsellier).
fix zookeeper configuration
Apr 7 2021, 8:04 PM
vsellier committed rDENV01e921459c95: storage: Initialize database schema on startup (authored by vsellier).
storage: Initialize database schema on startup
Apr 7 2021, 8:04 PM
vsellier committed rDENVeb7ccfb85e08: add limit on containers (authored by vsellier).
add limit on containers
Apr 7 2021, 8:04 PM
vsellier committed rDENVd6f6342d48f8: fix database startup (authored by vsellier).
fix database startup
Apr 7 2021, 8:04 PM
vsellier committed rDENVc45eb07841bc: make modules discoverable for swh command line (authored by vsellier).
make modules discoverable for swh command line
Apr 7 2021, 8:04 PM
vsellier committed rDENV33a8e0935f6a: wip try journal (authored by vsellier).
wip try journal
Apr 7 2021, 8:04 PM
vsellier committed rDENV5155b4d64e03: add skaffold section on the readme (authored by vsellier).
add skaffold section on the readme
Apr 7 2021, 8:04 PM
vsellier committed rDENV8062a1f2e4cb: split image to efficiently manage dependencies (authored by vsellier).
split image to efficiently manage dependencies
Apr 7 2021, 8:04 PM
vsellier committed rDENV97bbf2724d8b: wip - kubernetes and skaffold initialization (authored by vsellier).
wip - kubernetes and skaffold initialization
Apr 7 2021, 8:04 PM
vsellier reopened T3165: Generate historical data from the new counters series, a subtask of T2912: Next generation archive counters, as Work in Progress.
Apr 7 2021, 5:42 PM · Roadmap 2021, System administration, Monitoring, Web app