Page MenuHomeSoftware Heritage
Feed Advanced Search

Jan 27 2021

vsellier closed D4954: cgit: Don't stop the listing when a repository page is not available.
Jan 27 2021, 3:06 PM
vsellier committed rDLSf6f9f1ca28a9: cgit: Don't stop the listing when a repository page is not available (authored by vsellier).
cgit: Don't stop the listing when a repository page is not available
Jan 27 2021, 3:06 PM
vsellier added a comment to D4954: cgit: Don't stop the listing when a repository page is not available.

Thanks :)

Jan 27 2021, 3:05 PM
vsellier updated the diff for D4954: cgit: Don't stop the listing when a repository page is not available.

Use an exception to validate a repo page can be accessed

Jan 27 2021, 2:54 PM
vsellier closed D4953: cgit: Add support for last_update information during listing.
Jan 27 2021, 2:24 PM
vsellier committed rDLS91fcde83410d: cgit: Add support for last_update information during listing (authored by vsellier).
cgit: Add support for last_update information during listing
Jan 27 2021, 2:24 PM
vsellier updated the diff for D4954: cgit: Don't stop the listing when a repository page is not available.

rebase

Jan 27 2021, 2:19 PM
vsellier updated the diff for D4953: cgit: Add support for last_update information during listing.

Restore missing log when the date can't be parsed

Jan 27 2021, 2:18 PM
vsellier updated the diff for D4954: cgit: Don't stop the listing when a repository page is not available.

rebase

Jan 27 2021, 2:04 PM
vsellier updated the diff for D4953: cgit: Add support for last_update information during listing.

Remove useless variable

Jan 27 2021, 2:03 PM
vsellier requested review of D4954: cgit: Don't stop the listing when a repository page is not available.
Jan 27 2021, 12:47 PM
vsellier moved T2944: Deploy swh-search v0.4.1 from in-progress to deployed/landed/monitoring on the System administration board.
Jan 27 2021, 12:44 PM · System administration, Journal, Archive search
vsellier added a revision to T2988: Improve cgit lister to add last modification date of the repos: D4954: cgit: Don't stop the listing when a repository page is not available.
Jan 27 2021, 12:42 PM · CGit lister, Lister
vsellier updated the diff for D4953: cgit: Add support for last_update information during listing.
  • Reorder methods
  • Adapt date parsing according the review
Jan 27 2021, 12:41 PM
vsellier added a comment to T2944: Deploy swh-search v0.4.1.

To decrease the time to recover the lag, several journal client were launched in // with :

/usr/bin/swh search --config-file /etc/softwareheritage/search/journal_client_objects.yml journal-client objects
Jan 27 2021, 10:00 AM · System administration, Journal, Archive search
vsellier committed rSPREa7c9c625d98c: Allocate more memory to search1 (authored by vsellier).
Allocate more memory to search1
Jan 27 2021, 9:40 AM

Jan 26 2021

vsellier updated the diff for D4953: cgit: Add support for last_update information during listing.

Inline unecessary indirection

Jan 26 2021, 6:52 PM
vsellier updated the diff for D4953: cgit: Add support for last_update information during listing.

Add missing test coverage

Jan 26 2021, 6:49 PM
vsellier requested review of D4953: cgit: Add support for last_update information during listing.
Jan 26 2021, 6:36 PM
vsellier added a revision to T2988: Improve cgit lister to add last modification date of the repos: D4953: cgit: Add support for last_update information during listing.
Jan 26 2021, 6:33 PM · CGit lister, Lister
vsellier changed the status of T2988: Improve cgit lister to add last modification date of the repos from Open to Work in Progress.
Jan 26 2021, 6:04 PM · CGit lister, Lister
vsellier accepted D4946: Install scheduler journal client to saatchi.

LGTM

Jan 26 2021, 12:45 PM
vsellier added a comment to T2944: Deploy swh-search v0.4.1.

Upgrading the index configuration to speedup the indexation :

% cat >/tmp/config.json <<EOF
{
  "index" : {
"translog.sync_interval" : "60s",
"translog.durability": "async",
"refresh_interval": "60s"
  }
}
EOF
% export ES_SERVER=192.168.100.81:9200
% export INDEX=origin            
% curl -s -H "Content-Type: application/json" -XPUT http://${ES_SERVER}/${INDEX}/_settings -d @/tmp/config.json 
{"acknowledged":true}%
Jan 26 2021, 10:31 AM · System administration, Journal, Archive search
vsellier added a comment to T2944: Deploy swh-search v0.4.1.

Production

  • puppet disabled
  • Services stopped :
root@search1:~# systemctl stop swh-search-journal-client@objects.service 
root@search1:~# systemctl stop gunicorn-swh-search
  • Index deleted and recreated
% export ES_SERVER=search-esnode1.internal.softwareheritage.org:9200
% curl -s http://$ES_SERVER/_cat/indices\?v 
health status index  uuid                   pri rep docs.count docs.deleted store.size pri.store.size
green  open   origin Mq8dnlpuRXO4yYoC6CTuQw  90   1  151716299     38861934    260.8gb          131gb
% curl -XDELETE http://$ES_SERVER/origin
{"acknowledged":true}%    
% swh search --config-file /etc/softwareheritage/search/server.yml  initialize
INFO:elasticsearch:PUT http://search-esnode1.internal.softwareheritage.org:9200/origin [status:200 request:2.216s]
INFO:elasticsearch:PUT http://search-esnode3.internal.softwareheritage.org:9200/origin/_mapping [status:200 request:0.151s]
Done.
% curl -s http://$ES_SERVER/_cat/indices\?v                                        
health status index  uuid                   pri rep docs.count docs.deleted store.size pri.store.size
green  open   origin yFaqPPCnRFCnc5AA6Ah8lw  90   1          0            0     36.5kb         18.2kb
  • journal client's consumer group delete:
% export SERVER=kafka1.internal.softwareheritage.org:9092  
% ./kafka-consumer-groups.sh --bootstrap-server ${SERVER} --delete --group swh.search.journal_client
Deletion of requested consumer groups ('swh.search.journal_client') was successful.
  • journal client restarted
  • puppet enabled
Jan 26 2021, 9:39 AM · System administration, Journal, Archive search
vsellier added a comment to T2944: Deploy swh-search v0.4.1.

The filter on visited origins is working correctly on staging. The has_visit flag looks good.
For example for the https://www.npmjs.com/package/@ehmicky/dev-tasks origin

{
  "_index" : "origin",
  "_type" : "_doc",
  "_id" : "019bd314416108304165e82dd92e00bc9ea85a53",
  "_score" : 60.56421,
  "_source" : {
    "url" : "https://www.npmjs.com/package/@ehmicky/dev-tasks",
    "sha1" : "019bd314416108304165e82dd92e00bc9ea85a53"
  },
  "sort" : [
    60.56421,
    "019bd314416108304165e82dd92e00bc9ea85a53"
  ]
}
swh=> select * from origin join origin_visit_status on id=origin where id=469380;
   id   |                       url                        | origin | visit |             date              | status  | metadata |                  snapshot                  | type 
--------+--------------------------------------------------+--------+-------+-------------------------------+---------+----------+--------------------------------------------+------
 469380 | https://www.npmjs.com/package/@ehmicky/dev-tasks | 469380 |     1 | 2021-01-25 13:30:47.221937+00 | created |          |                                            | npm
 469380 | https://www.npmjs.com/package/@ehmicky/dev-tasks | 469380 |     1 | 2021-01-25 13:41:59.435579+00 | partial |          | \xe3f24413d81fd3e9c309686fcfb6c8f5eb549acf | npm
Jan 26 2021, 9:16 AM · System administration, Journal, Archive search

Jan 25 2021

vsellier closed D4943: cgit lister: Add missing types on the init method.
Jan 25 2021, 6:59 PM
vsellier committed rDLSd62e77c1b495: cgit lister: Add missing types on the init method (authored by vsellier).
cgit lister: Add missing types on the init method
Jan 25 2021, 6:59 PM
vsellier requested review of D4943: cgit lister: Add missing types on the init method.
Jan 25 2021, 6:58 PM
vsellier added a revision to T2984: Port cgit lister to the new Lister API: D4943: cgit lister: Add missing types on the init method.
Jan 25 2021, 6:33 PM · Lister, CGit lister, Sprint 2021 01
vsellier added a comment to T2944: Deploy swh-search v0.4.1.

Staging

We are proceeding to a complete index rebuilding

Jan 25 2021, 5:44 PM · System administration, Journal, Archive search
vsellier added a comment to T2944: Deploy swh-search v0.4.1.

Regarding the index rebuilding process, using a naive approach with aliases with the old and the new index[1] returns duplicated results when the search is done.
Using an alias with only the old index, rebuilding a new index and switching the alias to the new index[2] can be a first approach with the default the old index will not be updated until the alias is switched to the new index.
It also requires the swh-search code is able to use different names for the read and write operations.

Jan 25 2021, 4:07 PM · System administration, Journal, Archive search
vsellier changed the status of T2920: Document staging infrastructure from Open to Work in Progress.
Jan 25 2021, 3:32 PM · Documentation, System administration, Staging environment
vsellier changed the status of T2944: Deploy swh-search v0.4.1, a subtask of T2936: Update the swh-search journal client to only set "has_visit" on "full" status of the visit, from Open to Work in Progress.
Jan 25 2021, 3:32 PM · Journal, Archive search
vsellier changed the status of T2944: Deploy swh-search v0.4.1 from Open to Work in Progress.
Jan 25 2021, 3:32 PM · System administration, Journal, Archive search
vsellier renamed T2944: Deploy swh-search v0.4.1 from Deploy swh-search v0.4.1 in staging to Deploy swh-search v0.4.1.
Jan 25 2021, 3:32 PM · System administration, Journal, Archive search
vsellier accepted D4939: gitlab: Adapt celery task implementations to the new lister api.

LGTM

Jan 25 2021, 3:13 PM
vsellier closed D4926: Port cgit lister to the new lister api.
Jan 25 2021, 3:06 PM · Sprint 2021 01, Lister
vsellier committed rDLSe4a590fc7f3e: Port cgit lister to the new lister api (authored by vsellier).
Port cgit lister to the new lister api
Jan 25 2021, 3:06 PM
vsellier updated the diff for D4926: Port cgit lister to the new lister api.
  • rebase
  • update tests according to the review feedbacks
Jan 25 2021, 2:58 PM · Sprint 2021 01, Lister
vsellier committed rSPSITE8e78d5b0edd4: Increase the deposit end-to-end monitoring timeouts (authored by vsellier).
Increase the deposit end-to-end monitoring timeouts
Jan 25 2021, 2:10 PM
vsellier added a comment to T2912: Next generation archive counters.

It seems redis has a Hyperloglog functionnality[1] that can match with the requirements (bloom filter / limited deviation / small memory footprint / efficiency).

Jan 25 2021, 12:52 PM · Roadmap 2021, System administration, Monitoring, Web app
vsellier committed rDSNIPc5aa5b9d15e5: POC for counters with redis hyperloglog (authored by vsellier).
POC for counters with redis hyperloglog
Jan 25 2021, 12:42 PM
vsellier added inline comments to D4926: Port cgit lister to the new lister api.
Jan 25 2021, 12:17 PM · Sprint 2021 01, Lister
vsellier added a revision to T2976: Deposit tests end-to-end are failing in icinga: Unknown Object (Differential Revision).
Jan 25 2021, 11:48 AM · System administration, SWORD deposit
vsellier triaged T2988: Improve cgit lister to add last modification date of the repos as Normal priority.
Jan 25 2021, 11:40 AM · CGit lister, Lister
vsellier updated the task description for T2967: Write journal client subcribed to origin_visit_status topics .
Jan 25 2021, 10:56 AM · Sprint 2021 01, Scheduling utilities

Jan 23 2021

vsellier committed rDSCH3d13cda421b9: Solve uneventful/eventful with unordered messages with snapshots (authored by vsellier).
Solve uneventful/eventful with unordered messages with snapshots
Jan 23 2021, 8:00 PM
vsellier committed rDSCHd528998df596: Do not consider duplicated messages as uneventful event (authored by vsellier).
Do not consider duplicated messages as uneventful event
Jan 23 2021, 8:00 PM

Jan 22 2021

vsellier requested review of D4926: Port cgit lister to the new lister api.
Jan 22 2021, 4:24 PM · Sprint 2021 01, Lister
vsellier added a revision to T2984: Port cgit lister to the new Lister API: D4926: Port cgit lister to the new lister api.
Jan 22 2021, 3:53 PM · Lister, CGit lister, Sprint 2021 01
vsellier closed T2978: Deploy visit-stats journal client on staging as Resolved.
Jan 22 2021, 11:12 AM · Sprint 2021 01, Scheduling utilities
vsellier closed T2978: Deploy visit-stats journal client on staging, a subtask of T2967: Write journal client subcribed to origin_visit_status topics , as Resolved.
Jan 22 2021, 11:12 AM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 22 2021, 11:12 AM · Sprint 2021 01, Scheduling utilities
vsellier moved T2984: Port cgit lister to the new Lister API from Backlog to in-progress on the Sprint 2021 01 board.
Jan 22 2021, 11:09 AM · Lister, CGit lister, Sprint 2021 01
vsellier changed the status of T2984: Port cgit lister to the new Lister API from Open to Work in Progress.
Jan 22 2021, 11:09 AM · Lister, CGit lister, Sprint 2021 01

Jan 21 2021

vsellier closed D4908: Do not consider duplicated messages as uneventful event.
Jan 21 2021, 7:28 PM
vsellier closed D4918: Solve uneventful/eventful with unordered messages with snapshots.
Jan 21 2021, 7:28 PM
vsellier committed rDSCH25d036ef4503: Do not consider duplicated messages as uneventful event (authored by vsellier).
Do not consider duplicated messages as uneventful event
Jan 21 2021, 7:28 PM
vsellier committed rDSCH82b7a8a42367: Solve uneventful/eventful with unordered messages with snapshots (authored by vsellier).
Solve uneventful/eventful with unordered messages with snapshots
Jan 21 2021, 7:28 PM
vsellier updated the diff for D4918: Solve uneventful/eventful with unordered messages with snapshots.

rebase

Jan 21 2021, 7:17 PM
vsellier updated the diff for D4908: Do not consider duplicated messages as uneventful event.

rebase

Jan 21 2021, 7:16 PM
vsellier requested review of D4918: Solve uneventful/eventful with unordered messages with snapshots.
Jan 21 2021, 7:10 PM
vsellier added a revision to T2978: Deploy visit-stats journal client on staging: D4918: Solve uneventful/eventful with unordered messages with snapshots.
Jan 21 2021, 7:06 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the diff for D4908: Do not consider duplicated messages as uneventful event.

Test with "several" database upserts as it's more realistic

Jan 21 2021, 7:06 PM
vsellier requested review of D4908: Do not consider duplicated messages as uneventful event.
Jan 21 2021, 5:19 PM
vsellier added a revision to T2978: Deploy visit-stats journal client on staging: D4908: Do not consider duplicated messages as uneventful event.
Jan 21 2021, 5:17 PM · Sprint 2021 01, Scheduling utilities
vsellier closed D4884: Deploy the scheduler's journal client.
Jan 21 2021, 2:42 PM
vsellier committed rSPSITE9a67834c5b94: Deploy the scheduler's journal client (authored by vsellier).
Deploy the scheduler's journal client
Jan 21 2021, 2:42 PM
vsellier closed D4901: scheduler: deploy journal client service.
Jan 21 2021, 12:16 PM
vsellier committed rDENV6687fd022604: scheduler: deploy journal client service (authored by vsellier).
scheduler: deploy journal client service
Jan 21 2021, 12:16 PM
vsellier updated the diff for D4901: scheduler: deploy journal client service.

split the long entrypoint's command line

Jan 21 2021, 12:14 PM
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 21 2021, 11:48 AM · Sprint 2021 01, Scheduling utilities
vsellier requested review of D4901: scheduler: deploy journal client service.
Jan 21 2021, 11:47 AM
vsellier added a revision to T2978: Deploy visit-stats journal client on staging: D4901: scheduler: deploy journal client service.
Jan 21 2021, 11:47 AM · Sprint 2021 01, Scheduling utilities
vsellier updated the diff for D4884: Deploy the scheduler's journal client.

rebase

Jan 21 2021, 9:57 AM
vsellier moved T2919: Decomission webapp0 node (azure) from deployed/landed/monitoring to done on the System administration board.
Jan 21 2021, 9:47 AM · System administration
vsellier moved T2958: Use all the disks on esnode2 and esnode3 from in-progress to deployed/landed/monitoring on the System administration board.
Jan 21 2021, 9:46 AM · System administration
vsellier moved T2905: Deploy swh-search for production from in-progress to deployed/landed/monitoring on the System administration board.
Jan 21 2021, 9:46 AM · System administration, Journal, Archive search
vsellier moved T2903: Test different disk configuration on esnode1 from in-progress to deployed/landed/monitoring on the System administration board.
Jan 21 2021, 9:46 AM · System administration
vsellier moved T2888: Elasticsearch cluster failure during a rolling restart from in-progress to deployed/landed/monitoring on the System administration board.
Jan 21 2021, 9:46 AM · System administration
vsellier moved T2976: Deposit tests end-to-end are failing in icinga from in-progress to deployed/landed/monitoring on the System administration board.
Jan 21 2021, 9:46 AM · System administration, SWORD deposit
vsellier closed T2976: Deposit tests end-to-end are failing in icinga as Resolved.

With the longer warning threshold, the monitoring is now green.

Jan 21 2021, 9:45 AM · System administration, SWORD deposit
vsellier closed D4900: deposit monitoring: Increase warning threshold.
Jan 21 2021, 9:36 AM
vsellier committed rSPSITE2aeb1821b9ab: deposit monitoring: Increase warning threshold (authored by vsellier).
deposit monitoring: Increase warning threshold
Jan 21 2021, 9:36 AM
vsellier requested review of D4900: deposit monitoring: Increase warning threshold.
Jan 21 2021, 9:30 AM
vsellier added a revision to T2976: Deposit tests end-to-end are failing in icinga: D4900: deposit monitoring: Increase warning threshold.
Jan 21 2021, 9:30 AM · System administration, SWORD deposit
vsellier changed the status of T2976: Deposit tests end-to-end are failing in icinga from Open to Work in Progress.
Jan 21 2021, 9:21 AM · System administration, SWORD deposit

Jan 20 2021

vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 7:08 PM · Sprint 2021 01, Scheduling utilities
vsellier added a comment to T2978: Deploy visit-stats journal client on staging.

Backfill launched from storage1 with this script : P927 (10 ranges in //) and finished in ~15mn

Jan 20 2021, 6:46 PM · Sprint 2021 01, Scheduling utilities
vsellier created P927 backfill script with the right backfill.yml and logging.yml configuration files.
Jan 20 2021, 6:45 PM
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 6:44 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 5:34 PM · Sprint 2021 01, Scheduling utilities
vsellier added a comment to T2978: Deploy visit-stats journal client on staging.

All staging worker stopped:

root@pergamon:~# sudo clush -b -w @staging-workers 'puppet agent --disable "Deploy new storage version"; cd /etc/systemd/system/multi-user.target.wants; for unit in swh-worker@*; do systemctl disable $unit; done; systemctl stop swh-worker@*'
Jan 20 2021, 5:32 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 5:23 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 5:06 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 4:37 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 3:49 PM · Sprint 2021 01, Scheduling utilities
vsellier updated the task description for T2978: Deploy visit-stats journal client on staging.
Jan 20 2021, 3:49 PM · Sprint 2021 01, Scheduling utilities
vsellier added a comment to T2976: Deposit tests end-to-end are failing in icinga.

it seems it's the scheduler running that is taking time to scheduler the deposit task :
08:37:53 -> task is created
08:43:05 -> the runner is scheduling the task
08:43:24 -> the worker acknowledge the task

Jan 20 2021, 9:50 AM · System administration, SWORD deposit

Jan 19 2021

vsellier closed T2866: Integrate former Uffizi server to the proxmox cluster as Resolved.
Jan 19 2021, 7:51 PM · System administration