Page MenuHomePhabricator

Add more logging to determine what happens to jobs in the wild
Closed, ResolvedPublic

Description

Motivation

We have had a few bugs where the Tally job went MIA and we would like to know what happened to it. This task is to add more logging for us to be able to see that.

Event Timeline

Change 711256 had a related patch set uploaded (by STran; author: STran):

[mediawiki/extensions/SecurePoll@master] [WIP] Add ad-hoc logging to tally process

https://gerrit.wikimedia.org/r/711256

Change 711256 merged by jenkins-bot:

[mediawiki/extensions/SecurePoll@master] Add ad-hoc logging to tally process

https://gerrit.wikimedia.org/r/711256

Change 710720 had a related patch set uploaded (by Phuedx; author: STran):

[mediawiki/extensions/SecurePoll@wmf/1.37.0-wmf.18] Add ad-hoc logging to tally process

https://gerrit.wikimedia.org/r/710720

Change 710720 merged by jenkins-bot:

[mediawiki/extensions/SecurePoll@wmf/1.37.0-wmf.18] Add ad-hoc logging to tally process

https://gerrit.wikimedia.org/r/710720

Mentioned in SAL (#wikimedia-operations) [2021-08-11T11:17:03Z] <lucaswerkmeister-wmde@deploy1002> Synchronized php-1.37.0-wmf.18/extensions/SecurePoll/: Backport: [[gerrit:710720|Add ad-hoc logging to tally process (T288366)]] (duration: 01m 09s)

☝️ As discussed during today's AHT Standup - RTL meeting, the log lines that @STran added are appearing in Logstash, e.g. see https://logstash.wikimedia.org/goto/e76de3a5f5ec01db310a88641ea6f4b2.