Page MenuHomeSoftware Heritage
Feed Advanced Search

Oct 23 2020

vsellier added a comment to D4340: Change staging network.

The terraform apply works, the staging gw was removed and apparently without side effects on other servers

Oct 23 2020, 12:09 PM
vsellier added a comment to D4340: Change staging network.

Thanks, i will try my first terraform apply 😬

Oct 23 2020, 11:58 AM
vsellier updated the test plan for D4340: Change staging network.
Oct 23 2020, 11:31 AM
vsellier created D4340: Change staging network.
Oct 23 2020, 11:30 AM
vsellier added a revision to T2721: Install and configure a firewall for the staging environment: D4340: Change staging network.
Oct 23 2020, 11:30 AM · System administration

Oct 22 2020

vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

List of the rules created :

  • icinga : Floating rule: icinga server -> *:icinga port (5665)
  • prometheus: Floating rule: prometheurs server -> *:prometheus ports (9100/9102/9237/7071/9419)
  • logstash/journal: VLAN440 rule: * -> logstash server:logstash_port (5044)
Oct 22 2020, 12:57 PM · System administration
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

worker0 is migrated and reachable. the dns and icinga rules are well updated after puppet ran on worker0 and pergamon.
To update the server, I had to manually change the ip configuration and reboot it because puppet was failing as it was not able to determine the right ip in 192.168.130.0 network as the server was still associated to an ip in 192.168.128.0 :

root@worker0:~# puppet agent --test
Info: Using configured environment 'staging'
Info: Retrieving pluginfacts
Info: Retrieving plugin
Info: Retrieving locales
Info: Loading facts
Error: Could not retrieve catalog from remote server: Error 500 on SERVER: Server Error: Evaluation Error: Error while evaluating a Function Call, pick(): must receive at least one non empty value (file: /etc/puppet/code/environments/staging/site-modules/profile/manifests/prometheus/node.pp, line: 31, column: 28) on node worker0.internal.staging.swh.network
Warning: Not using cache on failed catalog
Error: Could not retrieve catalog; skipping run
Oct 22 2020, 11:42 AM · System administration
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

The new rules have to be also manually declared on pergamon to reach the new networks.
Puppet declared them on the configuration but didn't reload the network :

root@pergamon:~# puppet agent --test
Info: Using configured environment 'production'
Info: Retrieving pluginfacts
Info: Retrieving plugin
Info: Retrieving locales
Info: Loading facts
Info: Caching catalog for pergamon.softwareheritage.org
Info: Applying configuration version '1603355074'
Notice: /Stage[main]/Profile::Network/Debnet::Iface[eth0]/Concat[/etc/network/interfaces]/File[/etc/network/interfaces]/content: 
--- /etc/network/interfaces	2020-09-15 16:10:15.235917411 +0000
+++ /tmp/puppet-file20201022-2531741-3gl773	2020-10-22 08:25:16.977289874 +0000
@@ -18,6 +18,8 @@
   up ip route add 192.168.101.0/24 via 192.168.100.1
   up ip route add 192.168.200.0/21 via 192.168.100.1
   up ip route add 192.168.128.0/24 via 192.168.100.125
+  up ip route add 192.168.130.0/24 via 192.168.100.130
+  up ip route add 192.168.50.0/24 via 192.168.100.130
   up ip rule add from 192.168.100.29 table private
   up ip route add 192.168.100.0/24 src 192.168.100.29 dev eth1 table private
   up ip route add default via 192.168.100.1 dev eth1 table private
@@ -25,6 +27,8 @@
   down ip route del default via 192.168.100.1 dev eth1 table private
   down ip route del 192.168.100.0/24 src 192.168.100.29 dev eth1 table private
   down ip rule del from 192.168.100.29 table private
+  down ip route del 192.168.50.0/24 via 192.168.100.130
+  down ip route del 192.168.130.0/24 via 192.168.100.130
   down ip route del 192.168.128.0/24 via 192.168.100.125
   down ip route del 192.168.200.0/21 via 192.168.100.1
   down ip route del 192.168.101.0/24 via 192.168.100.1
Oct 22 2020, 10:31 AM · System administration
vsellier committed rSPSITEf463bcb9ebbe: Update the network configuration of the staging nodes to use the new network… (authored by vsellier).
Update the network configuration of the staging nodes to use the new network…
Oct 22 2020, 10:22 AM
vsellier closed D4321: Update the network configuration of the staging nodes to use the new network range behind the firewall.
Oct 22 2020, 10:22 AM
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

The first staging node will be migrated one by one to avoid too much noise in the monitoring and make the detection of the mission rules in the firewall easier. Puppet is disabled on all the staging node to avoid a massive migration :

Oct 22 2020, 10:18 AM · System administration
vsellier committed rSPSITE01aafc53af9e: Declare the opnsense firewall and the new networks it manages (authored by vsellier).
Declare the opnsense firewall and the new networks it manages
Oct 22 2020, 10:09 AM
vsellier closed D4317: Declare the opnsense firewall and the new networks it manages.
Oct 22 2020, 10:09 AM
vsellier closed T2707: The VMs cloned from the debian10 templates considered as physical servers by facter as Resolved.

Resolved by D4293 and manual installation of the packages with a standard priority on all the servers by @olasd :

aptitude -y install "?priority(standard)!~i?archive(stable)"
Oct 22 2020, 9:59 AM · System administration
vsellier committed rSPREc90a689c85e9: Build the proxmox templates with packer (authored by vsellier).
Build the proxmox templates with packer
Oct 22 2020, 9:56 AM
vsellier closed D4293: Build the proxmox templates with packer.
Oct 22 2020, 9:56 AM

Oct 21 2020

vsellier added a revision to T2721: Install and configure a firewall for the staging environment: D4321: Update the network configuration of the staging nodes to use the new network range behind the firewall.
Oct 21 2020, 7:32 PM · System administration
vsellier created D4321: Update the network configuration of the staging nodes to use the new network range behind the firewall.
Oct 21 2020, 7:32 PM
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

good news! thanks for the confirmation

Oct 21 2020, 7:27 PM · System administration
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

After having some hard time to configure the initial firewall rules correctly due to the inter-vlan traffic seen as coming from the gateway address and not filtered, the fw rules allow the following facts :

Oct 21 2020, 5:48 PM · System administration
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.
  • Route manually declared on louvre:
root@louvre:~# ip route add 192.168.130.0/24 via 192.168.100.130 dev ens18
root@louvre:~# ip route add 192.168.50.0/24 via 192.168.100.130 dev ens18
root@louvre:~# ip route
default via 128.93.193.254 dev ens19 onlink 
128.93.193.0/24 dev ens19 proto kernel scope link src 128.93.193.5 
192.168.50.0/24 via 192.168.100.130 dev ens18 
192.168.100.0/24 dev ens18 proto kernel scope link src 192.168.100.1 
192.168.101.0/24 via 192.168.101.2 dev tun0 
192.168.101.2 dev tun0 proto kernel scope link src 192.168.101.1 
192.168.128.0/24 via 192.168.100.125 dev ens18 
192.168.130.0/24 via 192.168.100.130 dev ens18

The route command is not installed on louvre as it's now replaced by ip.

Oct 21 2020, 3:28 PM · System administration
vsellier added a comment to D4317: Declare the opnsense firewall and the new networks it manages.

the staging and staging_new properties were changed to staging_legacy and staging as you suggested. it's better this way.

Oct 21 2020, 2:41 PM
vsellier updated the diff for D4317: Declare the opnsense firewall and the new networks it manages.

Update after the review's feedbacks

Oct 21 2020, 2:38 PM
vsellier accepted D4318: README: Add pin information on vagrant for vagrant/libvirt use.
Oct 21 2020, 2:30 PM
vsellier added a revision to T2721: Install and configure a firewall for the staging environment: D4317: Declare the opnsense firewall and the new networks it manages.
Oct 21 2020, 11:10 AM · System administration
vsellier created D4317: Declare the opnsense firewall and the new networks it manages.
Oct 21 2020, 11:10 AM
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

Some rules needs to be declared to be able to reach the new networks through the firewall.

Oct 21 2020, 10:00 AM · System administration
vsellier created P829 D4315 test.
Oct 21 2020, 9:43 AM

Oct 20 2020

vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

Netbox updated accordingly : https://inventory.internal.softwareheritage.org/virtualization/virtual-machines/75/

Oct 20 2020, 6:25 PM · System administration
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

VIPs configuration

On the FW UI, go to Interfaces / Virtual IPs / Settings
Add the following Virtual IPs :

  1. Mode CARP / interface VLAN440 / Address: 192.168.100.130/24 / Virtual IP Password: not significant / VHID Group : 1 / Description: VLAN440 gw wip
  2. Mode CARP / interface VLAN442 / Address: 192.168.50.1/24 / Virtual IP Password: not significant / VHID Group: 2 / Description: VLAN442 fw wip
  3. Mode CARP / interface: VLAN443 / Address: 192.168.130.1/24 / Virtual IP Password: not significant / VHID Group: 3/ Description: VLAN443 fw wip
  4. Mode CARP / interface: VLAN1300 / Address: 128.93.166.2/26 / Virtual IP Password: not significant / VHID Group: 4 / Description: VLAN1300 fw wip
Oct 20 2020, 4:57 PM · System administration
vsellier added a comment to T2721: Install and configure a firewall for the staging environment.

The firewall was installed with an iso image OPNsense-20.7-OpenSSL-dvd-amd64.iso uploaded on the ceph-proxmox storage

Oct 20 2020, 3:55 PM · System administration
vsellier updated the task description for T2721: Install and configure a firewall for the staging environment.
Oct 20 2020, 12:50 PM · System administration
vsellier updated the task description for T2721: Install and configure a firewall for the staging environment.
Oct 20 2020, 12:29 PM · System administration

Oct 19 2020

vsellier changed the status of T2721: Install and configure a firewall for the staging environment from Open to Work in Progress.
Oct 19 2020, 7:23 PM · System administration
vsellier planned changes to D4308: wip - poc network configuration in markdown.
Oct 19 2020, 6:35 PM
vsellier closed T2691: Test and select a software router, a subtask of T2650: Network refactoring - step 1, as Resolved.
Oct 19 2020, 5:57 PM · System administration
vsellier closed T2691: Test and select a software router as Resolved.

The test phase is achieved. OPNSense seems to have a consensus with no blocking points.
Let's start the real implementation now.

Oct 19 2020, 5:57 PM · System administration
vsellier updated the diff for D4308: wip - poc network configuration in markdown.

formating (fat finger)

Oct 19 2020, 5:29 PM
vsellier updated the diff for D4308: wip - poc network configuration in markdown.

formating

Oct 19 2020, 5:27 PM
vsellier updated the diff for D4308: wip - poc network configuration in markdown.

formating

Oct 19 2020, 5:26 PM
vsellier created D4308: wip - poc network configuration in markdown.
Oct 19 2020, 5:24 PM
vsellier updated the diff for D4293: Build the proxmox templates with packer.

rollback the network configuration commit (should be a new diff)

Oct 19 2020, 5:23 PM
vsellier updated the diff for D4293: Build the proxmox templates with packer.

poc network configuration in markdown

Oct 19 2020, 5:20 PM

Oct 16 2020

vsellier created D4293: Build the proxmox templates with packer.
Oct 16 2020, 5:57 PM
vsellier added a revision to T2707: The VMs cloned from the debian10 templates considered as physical servers by facter: D4293: Build the proxmox templates with packer.
Oct 16 2020, 5:57 PM · System administration

Oct 15 2020

vsellier added a comment to T2707: The VMs cloned from the debian10 templates considered as physical servers by facter.

There is a proxmox builder [1] for packer, I will give it a try to check if we can benefit of the work done for vagrant on puppet and have a common base between the real vms and the local vms used to test.

Oct 15 2020, 6:27 PM · System administration
vsellier updated the task description for T2707: The VMs cloned from the debian10 templates considered as physical servers by facter.
Oct 15 2020, 5:21 PM · System administration
vsellier updated the task description for T2707: The VMs cloned from the debian10 templates considered as physical servers by facter.
Oct 15 2020, 5:20 PM · System administration
vsellier changed the status of T2707: The VMs cloned from the debian10 templates considered as physical servers by facter from Open to Work in Progress.
Oct 15 2020, 5:19 PM · System administration
vsellier accepted D4197: proxmox/terraform: Align production nodes declaration with current module version.

👍 it looks synchronized

Oct 15 2020, 9:34 AM

Oct 14 2020

vsellier reopened T2691: Test and select a software router, a subtask of T2650: Network refactoring - step 1, as Work in Progress.
Oct 14 2020, 5:41 PM · System administration
vsellier reopened T2691: Test and select a software router as "Work in Progress".

fix the wrong status change embedded with the previous comment

Oct 14 2020, 5:41 PM · System administration
vsellier closed T2691: Test and select a software router, a subtask of T2650: Network refactoring - step 1, as Resolved.
Oct 14 2020, 5:40 PM · System administration
vsellier closed T2691: Test and select a software router as Resolved.

@olasd / @ardumont , IMO the tests seem to confirm OPNsense can be a viable solution or at worst, it deserves to be tested with the staging infrastructure.

Oct 14 2020, 5:40 PM · System administration
vsellier updated the task description for T2691: Test and select a software router.
Oct 14 2020, 5:20 PM · System administration
vsellier added a comment to T2691: Test and select a software router.

A prometheus exporter is available as an additional plugin.

Oct 14 2020, 5:20 PM · System administration
vsellier added a comment to T2691: Test and select a software router.

The open vpn configuration support a certificat authority and csr stuff currently manually managed on louvre.

Oct 14 2020, 4:57 PM · System administration
vsellier updated the task description for T2691: Test and select a software router.
Oct 14 2020, 3:40 PM · System administration
vsellier added a comment to T2691: Test and select a software router.
  1. IPSec / Azure configuration
Oct 14 2020, 3:40 PM · System administration
vsellier updated the task description for T2691: Test and select a software router.
Oct 14 2020, 3:00 PM · System administration
vsellier added a comment to T2691: Test and select a software router.

I was not able to test the git backup plugin as it seems it's not yet released and it doesn't appear on the installable plugin list.
The commit for the version 1.0 was done 6 days ago : https://github.com/opnsense/plugins/commit/87c4c96fe1d1dc881f72f91ee67b6a84c9dea42a
I have also tested with the development version of pfsense but it also does not appear.

Oct 14 2020, 2:57 PM · System administration
vsellier updated the task description for T2691: Test and select a software router.
Oct 14 2020, 11:44 AM · System administration
vsellier added a comment to T2691: Test and select a software router.

The HA was quite simple to configure with the documentation [1] and an additional blog post which helps with the nat section not very explicit in the official documentation [2]
It's recommended to have a dedicated network link between the 2 firewalls used to the synchronization. In the tests I have done, I configured the sync on the admin network (VLAN442). It works but it's not the optimal configuration.

Oct 14 2020, 11:41 AM · System administration

Oct 13 2020

vsellier added a comment to T2691: Test and select a software router.

Well, I let this problem aside for the moment as there is nothing special configured for the interface on the VLAN1300 and I have no idea of what can be the source of the problem. Perhaps the "illumination" will come later...

Oct 13 2020, 4:57 PM · System administration
vsellier added a comment to T2691: Test and select a software router.

Having the WAN gateway declared on the VLAN1330 is working well.
Changing the default gateway to 128.93.166.62 force to declare an additional route for the vpn connections (192.168.101.0/24 => gw 192.168.100.1).

Oct 13 2020, 4:12 PM · System administration
vsellier added inline comments to D4197: proxmox/terraform: Align production nodes declaration with current module version.
Oct 13 2020, 12:15 PM
vsellier added a comment to T2691: Test and select a software router.

PFSense and OPNsense were tested.

Oct 13 2020, 11:24 AM · System administration
vsellier changed the status of T2650: Network refactoring - step 1 from Open to Work in Progress.
Oct 13 2020, 9:52 AM · System administration
vsellier changed the status of T2691: Test and select a software router from Open to Work in Progress.
Oct 13 2020, 9:52 AM · System administration
vsellier accepted D4236: Vagrant/Packer: Increase the template disk size to 20Gb.
Oct 13 2020, 9:43 AM

Oct 12 2020

vsellier added a comment to T2650: Network refactoring - step 1.

@olasd I looked at the swh-docs repository to store the sources of the diagrams as you have suggested but I'm not sure this is the better place to store them as the goal is not to display them on the doc site.

Oct 12 2020, 6:56 PM · System administration
vsellier accepted D4238: Vagrantfile: Migrate vms to use libvirt.

LGTM (not tested)

Oct 12 2020, 6:03 PM
vsellier accepted D4239: README: Focus the vagrant setup around the libvirt provider.
Oct 12 2020, 5:52 PM
vsellier renamed T2689: swh-fuse tests are failing on the DENV/tests jenkins job from swh-fuse tests are failing on the DENV/tests jobs to swh-fuse tests are failing on the DENV/tests jenkins job.
Oct 12 2020, 4:07 PM · Software Heritage filesystem, System administration
vsellier triaged T2689: swh-fuse tests are failing on the DENV/tests jenkins job as High priority.
Oct 12 2020, 4:06 PM · Software Heritage filesystem, System administration
vsellier accepted D4226: Vagrantfile: Reference the remote qcow2 box url.
Oct 12 2020, 3:37 PM
vsellier accepted D4138: Vagrantfile: Allow libvirt vm build.
Oct 12 2020, 3:37 PM
vsellier accepted D4137: packer: Add template to build qemu/libvirt images.

Thanks, it's really great.
I have tested locally the qemutest vm and converted the staging-webapp and staging-deposit vms, everything looks good.
The virtualbox and libvirt networks (with the same ip range) can't cohabit together but after a cleanup on the virtualbox side, everything works as expected.

Oct 12 2020, 3:36 PM
vsellier added inline comments to D4138: Vagrantfile: Allow libvirt vm build.
Oct 12 2020, 12:35 PM
vsellier claimed T2650: Network refactoring - step 1.
Oct 12 2020, 11:38 AM · System administration
vsellier committed rSPSITE6c48816e6d06: netbox: fix the monitoring by avoiding the redirection to add the trailing / (authored by vsellier).
netbox: fix the monitoring by avoiding the redirection to add the trailing /
Oct 12 2020, 8:43 AM
vsellier closed D4229: netbox: fix the monitoring by avoiding the redirection to add the trailing /.
Oct 12 2020, 8:43 AM
vsellier created D4229: netbox: fix the monitoring by avoiding the redirection to add the trailing /.
Oct 12 2020, 8:21 AM

Oct 8 2020

vsellier accepted D4196: node/module: Allow more provisioning steps and align staging gateway.
Oct 8 2020, 5:27 PM
vsellier accepted D4167: Disable network profile for vagrant..
Oct 8 2020, 3:50 PM
vsellier accepted D4195: Allow multiple networks declaration.
Oct 8 2020, 2:00 PM
vsellier accepted D4192: terraform/proxmox/staging: Actually use the right storage type.

Thanks, no changes are detected by terraform after this diff

Oct 8 2020, 9:49 AM
vsellier accepted D4191: terraform/proxmox: Centralize versions into one versions.tf file.
Oct 8 2020, 9:39 AM

Oct 7 2020

vsellier committed rSPREb3c6d45f6add: Allow to pass several storage declarations to the node template (authored by vsellier).
Allow to pass several storage declarations to the node template
Oct 7 2020, 5:06 PM
vsellier closed D4185: Allow to pass several storage declarations to the node template.
Oct 7 2020, 5:06 PM
vsellier updated the diff for D4185: Allow to pass several storage declarations to the node template.

rebase

Oct 7 2020, 5:05 PM
vsellier committed rSPREd6cc10b98a12: Fix terraform 0.13 initialisation (authored by vsellier).
Fix terraform 0.13 initialisation
Oct 7 2020, 5:02 PM
vsellier closed D4175: Fix terraform 0.13 initialisation.
Oct 7 2020, 5:02 PM
vsellier updated the diff for D4175: Fix terraform 0.13 initialisation.

rebase

Oct 7 2020, 5:02 PM
vsellier updated the diff for D4185: Allow to pass several storage declarations to the node template.

Link to a diff, not a task

Oct 7 2020, 4:44 PM
vsellier updated the diff for D4185: Allow to pass several storage declarations to the node template.

fix a typo on the commit message

Oct 7 2020, 4:42 PM
vsellier retitled D4185: Allow to pass several storage declarations to the node template from Allow to pass several storage declaration to the node template to Allow to pass several storage declarations to the node template.
Oct 7 2020, 4:41 PM
vsellier created D4185: Allow to pass several storage declarations to the node template.
Oct 7 2020, 4:40 PM
vsellier created P808 terraform for_each .
Oct 7 2020, 2:55 PM
vsellier accepted D4071: terraform/staging: Align proxmox and staging declaration.

lgtm, with this, we will be able to update the staging environment without impacting the rest of the infra

Oct 7 2020, 2:02 PM