Page MenuHomeSoftware Heritage
Feed Advanced Search

Jan 10 2018

ftigeot added a comment to T883: set up a replica of the main DB on azure.
  • Created a 6TB /srv volume with 4x 1536GB drives in RAID0
  • 1536GB drives have the same theoritical performance limits as 2TB ones
  • Used lvm2 for the software raid layer.
  • mdadm doesn't allow to grow a RAID0 volume whereas lvm2 could (by concatenating another storage provider at the end)
Jan 10 2018, 2:58 PM · Restricted Project, System administration
ftigeot created T922: Internal servers send mails from invalid hostnames.
Jan 10 2018, 11:41 AM · System administration

Jan 9 2018

ftigeot added a comment to T883: set up a replica of the main DB on azure.

Azure performance appears to be very far from reliable. Linux keeps complaining about issues similar to this one:

[Tue Jan  9 15:47:04 2018] INFO: task kworker/1:3:148 blocked for more than 120 seconds.
[Tue Jan  9 15:47:04 2018]       Not tainted 4.9.0-5-amd64 #1 Debian 4.9.65-3+deb9u2
[Tue Jan  9 15:47:04 2018] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jan 9 2018, 4:50 PM · Restricted Project, System administration
ftigeot added a comment to T883: set up a replica of the main DB on azure.

The sweet spot for Azure drives seems to be 2TB : 7500 IOPS and 250MB/s provisioned.
Smaller ones have less provisioned IOPS and bandwidth per second.
Bigger ones don't get faster.

Jan 9 2018, 2:46 PM · Restricted Project, System administration
ftigeot added a comment to T883: set up a replica of the main DB on azure.

Current database size: ~= 3880 GB
Maximum Azure drive size: 4095 GB

  • We will need to use more than one drive to create a bigger volume
  • We will also most likely need to grow this volume at some point in the future
Jan 9 2018, 2:43 PM · Restricted Project, System administration
ftigeot created T920: Revisit user account management.
Jan 9 2018, 11:32 AM · System administration

Jan 8 2018

ftigeot committed rSPSITE5a9a1a1cdcd7: data/defaults: Add dbreplica0.euwest.azure (authored by ftigeot).
data/defaults: Add dbreplica0.euwest.azure
Jan 8 2018, 5:21 PM
ftigeot committed rSPSITE4cf1438593ec: data/defaults: Send root mails to ftigeot (authored by ftigeot).
data/defaults: Send root mails to ftigeot
Jan 8 2018, 10:36 AM

Jan 2 2018

ftigeot committed rSPSITEc136b96ca888: data/defaults: Do not try to directly backup MySQL databases (authored by ftigeot).
data/defaults: Do not try to directly backup MySQL databases
Jan 2 2018, 2:12 PM

Dec 21 2017

ftigeot added a comment to T910: Do not generate critical monitoring alerts for non-critical hosts .

Not a silver bullet but some of the most recurring critical alerts were silenced by cdec099ecfb5 .

Dec 21 2017, 3:47 PM · System administration
ftigeot committed rSPSITE8d7bfe515245: giverny-specific data: Fix mistyped path (authored by ftigeot).
giverny-specific data: Fix mistyped path
Dec 21 2017, 2:57 PM
ftigeot committed rSPSITEcdec099ecfb5: Add giverny-specific deployment data (authored by ftigeot).
Add giverny-specific deployment data
Dec 21 2017, 2:18 PM

Dec 19 2017

ftigeot created T910: Do not generate critical monitoring alerts for non-critical hosts .
Dec 19 2017, 2:16 PM · System administration

Dec 13 2017

ftigeot added a project to T895: Limit size of most common log files: Easy hack.
Dec 13 2017, 4:25 PM · Easy hack, System administration
ftigeot added a comment to T895: Limit size of most common log files.

Proposed change, already applied manually on uffizi and a few other hosts:

Dec 13 2017, 11:28 AM · Easy hack, System administration
ftigeot created T895: Limit size of most common log files.
Dec 13 2017, 11:27 AM · Easy hack, System administration

Dec 12 2017

ftigeot added a comment to T866: uffizi disk's full makes some workers fail.

Base disk image size was increased from 10 to 20GB.

Dec 12 2017, 4:42 PM · System administration
ftigeot added a comment to T866: uffizi disk's full makes some workers fail.

Resolved with help from @olasd .

Dec 12 2017, 4:41 PM · System administration
ftigeot closed T866: uffizi disk's full makes some workers fail as Resolved.
Dec 12 2017, 4:32 PM · System administration