Page MenuHomePhabricator

SREGroup
ActivePublic

Neueste Aktivität

Heute

ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1260.eqiad.wmnet with OS bullseye executed with errors:

  • wikikube-worker1260 (FAIL)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • The reimage failed, see the cookbook logs for the details,You can also try typing "install-console" wikikube-worker1260.eqiad.wmnet to get a root shellbut depending on the failure this may not work.
Sat, Aug 3, 3:09 AM · SRE, serviceops, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.
Sat, Aug 3, 2:22 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1269.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1269 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030205_jclark_4062815_wikikube-worker1269.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 2:22 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1266.eqiad.wmnet with OS bullseye executed with errors:

  • wikikube-worker1266 (FAIL)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • The reimage failed, see the cookbook logs for the details,You can also try typing "install-console" wikikube-worker1266.eqiad.wmnet to get a root shellbut depending on the failure this may not work.
Sat, Aug 3, 2:15 AM · SRE, serviceops, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.
Sat, Aug 3, 2:01 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1260.eqiad.wmnet with OS bullseye

Sat, Aug 3, 1:49 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1268.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1268 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030130_jclark_4055036_wikikube-worker1268.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 1:48 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1269.eqiad.wmnet with OS bullseye

Sat, Aug 3, 1:46 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1267.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1267 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030128_jclark_4054700_wikikube-worker1267.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 1:45 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1260.eqiad.wmnet with OS bullseye executed with errors:

  • wikikube-worker1260 (FAIL)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • The reimage failed, see the cookbook logs for the details,You can also try typing "install-console" wikikube-worker1260.eqiad.wmnet to get a root shellbut depending on the failure this may not work.
Sat, Aug 3, 1:37 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1268.eqiad.wmnet with OS bullseye

Sat, Aug 3, 1:12 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1265.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1265 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030053_jclark_4041648_wikikube-worker1265.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 1:11 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1267.eqiad.wmnet with OS bullseye

Sat, Aug 3, 1:09 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1264.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1264 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030050_jclark_4041233_wikikube-worker1264.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 1:08 AM · SRE, serviceops, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.
Sat, Aug 3, 1:06 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1263.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1263 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030047_jclark_4038673_wikikube-worker1263.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 1:05 AM · SRE, serviceops, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.
Sat, Aug 3, 12:59 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1262.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1262 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030040_jclark_4034810_wikikube-worker1262.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:57 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1266.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:55 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1261.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1261 (PASS)
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Forced PXE for next reboot
    • Host rebooted via IPMI
    • Host up (Debian installer)
    • Add puppet_version metadata to Debian installer
    • Checked BIOS boot parameters are back to normal
    • Host up (new fresh bullseye OS)
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030037_jclark_4034779_wikikube-worker1261.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:55 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1265.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:33 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1258.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1258 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030022_jclark_4034271_wikikube-worker1258.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:33 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1264.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:30 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1259.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1259 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030019_jclark_4034288_wikikube-worker1259.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:29 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1263.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:27 AM · SRE, serviceops, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.
Sat, Aug 3, 12:22 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1262.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:18 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1261.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:18 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1256.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1256 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030007_jclark_4025538_wikikube-worker1256.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:17 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1260.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:17 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1259.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:14 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1258.eqiad.wmnet with OS bullseye

Sat, Aug 3, 12:14 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1254.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1254 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408030002_jclark_4025526_wikikube-worker1254.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:13 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1255.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1255 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408022356_jclark_4025678_wikikube-worker1255.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:07 AM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1257.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1257 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408022354_jclark_4025565_wikikube-worker1257.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Sat, Aug 3, 12:05 AM · SRE, serviceops, ops-eqiad, DC-Ops

Yesterday

ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1255.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:49 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1256.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:48 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1254.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:48 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1257.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:48 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1253.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1253 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408022335_jclark_4019134_wikikube-worker1253.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Fri, Aug 2, 11:46 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1252.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1252 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408022333_jclark_4017219_wikikube-worker1252.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Fri, Aug 2, 11:44 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1251.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1251 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408022329_jclark_4017039_wikikube-worker1251.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Fri, Aug 2, 11:41 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage started by jclark@cumin1002 for host wikikube-worker1250.eqiad.wmnet with OS bullseye completed:

  • wikikube-worker1250 (PASS)
    • Downtimed on Icinga/Alertmanager
    • Removed from Puppet and PuppetDB if present and deleted any certificates
    • Removed from Debmonitor if present
    • Generated Puppet certificate
    • Signed new Puppet certificate
    • Run Puppet in NOOP mode to populate exported resources in PuppetDB
    • Found Nagios_host resource for this host in PuppetDB
    • Downtimed the new host on Icinga/Alertmanager
    • Removed previous downtime on Alertmanager (old OS)
    • First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202408022326_jclark_4016830_wikikube-worker1250.out
    • configmaster.wikimedia.org updated with the host new SSH public key for wmf-update-known-hosts-production
    • Rebooted
    • Automatic Puppet run was successful
    • Forced a re-check of all Icinga services for the host
    • Icinga status is optimal
    • Icinga downtime removed
    • Updated Netbox data from PuppetDB
    • Updated Netbox status planned -> active
    • The sre.puppet.sync-netbox-hiera cookbook was run successfully
Fri, Aug 2, 11:36 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1253.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:30 PM · SRE, serviceops, ops-eqiad, DC-Ops
Maintenance_bot added a project to T371741: PDU sensor over limit: SRE.
Fri, Aug 2, 11:29 PM · SRE, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1252.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:26 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1251.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:24 PM · SRE, serviceops, ops-eqiad, DC-Ops
ops-monitoring-bot added a comment to T369743: Q1:rack/setup/install wikikube-worker1240 to wikikube-worker1304.

Cookbook cookbooks.sre.hosts.reimage was started by jclark@cumin1002 for host wikikube-worker1250.eqiad.wmnet with OS bullseye

Fri, Aug 2, 11:21 PM · SRE, serviceops, ops-eqiad, DC-Ops
Maintenance_bot removed a project from T348734: Port defs_from_etcd logic to nftables: Patch-For-Review.
Fri, Aug 2, 9:30 PM · Infrastructure-Foundations, SRE
Maintenance_bot removed a project from T356296: confd setup left without configuration doesn't stop confd: Patch-For-Review.
Fri, Aug 2, 9:30 PM · serviceops, SRE