Page MenuHomePhabricator

thcipriani (Tyler Cipriani)
Engineering Manager, Release EngineeringAdministrator

Projects (34)

Heute

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Feb 9 2015, 10:04 PM (494 w, 4 d)
Roles
Administrator
Availability
Available
IRC Nick
thcipriani
LDAP User
Thcipriani
MediaWiki User
TCipriani (WMF) [ Global Accounts ]

Neueste Aktivität

Yesterday

thcipriani added a comment to T371354: startupregistrystats-testwiki maintenance job is failing.

Did this actually end up being caused by the train? Digging in the server admin log/reading random backscroll in random places, I couldn't tell if something on the train needed to be fixed or if this was caused by a different issue.

Fri, Aug 2, 10:57 PM · Regression, MW-1.43-notes (1.43.0-wmf.16; 2024-07-30), Wikimedia-production-error
thcipriani added a comment to T371650: Requesting access to deployment shell access for toyofuku.

Hi @NBaca-WMF @thcipriani, could you confirm this request please?

Fri, Aug 2, 3:54 PM · SRE, SRE-Access-Requests
thcipriani updated the task description for T371650: Requesting access to deployment shell access for toyofuku.
Fri, Aug 2, 3:52 PM · SRE, SRE-Access-Requests
thcipriani awarded Deployment Training Graduate to recipient: SToyofuku-WMF.
Fri, Aug 2, 3:52 PM

Wed, Jul 31

thcipriani edited projects for T367403: Validate CI integration so that Ci can release Maven artifacts on user's demand, added: Release-Engineering-Team (Radar); removed Release-Engineering-Team.
Wed, Jul 31, 4:55 PM · Release-Engineering-Team (Radar), Patch-For-Review, Data-Engineering (Q1 2024 July 1st - September 30th), Java-Scala-Standardization, Discovery-Search, Data-Platform-SRE
thcipriani triaged T364199: Mirroring from one Wikimedia GitLab repository to another one no longer works as Low priority.

@LucasWerkmeister is this mirroring a work around to keep your fork up-to-date with the canonical repo?

Wed, Jul 31, 4:54 PM · User-brennen, Release-Engineering-Team, gitlab-settings, GitLab (Administration, Settings & Policy), Tools
thcipriani triaged T367302: Confusing scap backport behavior on gate-and-submit failure as Medium priority.
Wed, Jul 31, 4:50 PM · Release-Engineering-Team (Priority Backlog 📥), Scap
thcipriani edited projects for T364656: replace production buster deployment servers, added: Release-Engineering-Team (Radar); removed Release-Engineering-Team.
Wed, Jul 31, 4:50 PM · Release-Engineering-Team (Radar), collaboration-services, serviceops, SRE
thcipriani edited projects for T371131: Do not autolink shard names (e.g. s1) as Phabricator Spaces, added: Release-Engineering-Team (Priority Backlog 📥); removed Release-Engineering-Team.
Wed, Jul 31, 4:47 PM · Release-Engineering-Team (Priority Backlog 📥), Patch-For-Review, Phabricator
thcipriani edited projects for T369112: Group -1 pre-train QTE validation environment, added: Release-Engineering-Team (Priority Backlog 📥); removed Release-Engineering-Team.
Wed, Jul 31, 4:44 PM · Release-Engineering-Team (Priority Backlog 📥), Quality-and-Test-Engineering-Team (Test Infrastructure), Epic
thcipriani edited projects for T370400: Java projects hosted on Gerrit should publish artifacts to Gitlab, added: Release-Engineering-Team (Radar); removed Release-Engineering-Team.
Wed, Jul 31, 4:44 PM · Release-Engineering-Team (Radar), Java-Scala-Standardization, Discovery-Search, Data-Engineering
thcipriani edited projects for T369115: [WE6.2.1] Publish pre-train single version containers, added: Release-Engineering-Team (Priority Backlog 📥); removed Release-Engineering-Team.
Wed, Jul 31, 4:44 PM · Release-Engineering-Team (Priority Backlog 📥), OKR-Work, Epic
thcipriani triaged T369884: Fix/remove deployment-charts update_version.py as Low priority.
Wed, Jul 31, 4:43 PM · Release-Engineering-Team (Priority Backlog 📥)

Tue, Jul 30

thcipriani triaged T371427: Transient httpbb errors from on mwdebug boxes as Low priority.
Tue, Jul 30, 7:20 PM · serviceops-radar, Release-Engineering-Team (Seen), Scap
thcipriani created T371427: Transient httpbb errors from on mwdebug boxes.
Tue, Jul 30, 7:20 PM · serviceops-radar, Release-Engineering-Team (Seen), Scap

Mon, Jul 29

thcipriani assigned T366965: 1.43.0-wmf.20 deployment blockers to hashar.
Mon, Jul 29, 11:03 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani assigned T366964: 1.43.0-wmf.19 deployment blockers to Aklapper.
Mon, Jul 29, 11:02 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani assigned T366963: 1.43.0-wmf.18 deployment blockers to jeena.
Mon, Jul 29, 11:02 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani assigned T366962: 1.43.0-wmf.17 deployment blockers to jnuche.
Mon, Jul 29, 11:01 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani updated the task description for T371010: Requesting access to `restricted` group for Michael Große/migr.
Mon, Jul 29, 7:44 PM · SRE, SRE-Access-Requests
thcipriani added a comment to T371010: Requesting access to `restricted` group for Michael Große/migr.

Reason for access makes sense. Approved from my side.

Mon, Jul 29, 7:43 PM · SRE, SRE-Access-Requests
thcipriani moved T370039: BDD tests: environments from Backlog to Ready on the Catalyst (whole 'nother) board.
Mon, Jul 29, 4:54 PM · Catalyst (whole 'nother)
thcipriani assigned T370039: BDD tests: environments to jnuche.
Mon, Jul 29, 4:54 PM · Catalyst (whole 'nother)
thcipriani set the point value for T370039: BDD tests: environments to 3.
Mon, Jul 29, 4:53 PM · Catalyst (whole 'nother)
thcipriani moved T370040: BDD tests: apiTokens from Backlog to Ready on the Catalyst (whole 'nother) board.
Mon, Jul 29, 4:51 PM · Catalyst (whole 'nother)
thcipriani edited projects for T370040: BDD tests: apiTokens, added: Catalyst (whole 'nother); removed Catalyst.
Mon, Jul 29, 4:51 PM · Catalyst (whole 'nother)
thcipriani set the point value for T370040: BDD tests: apiTokens to 3.
Mon, Jul 29, 4:50 PM · Catalyst (whole 'nother)
thcipriani updated the task description for T370664: Enable SSO for Patchdemo.
Mon, Jul 29, 4:36 PM · Catalyst
thcipriani triaged T371056: Evaluate feasibility/desirability to send logs to Beta-Logs as Low priority.

For now, Catalyst folks are focused on the switchover of patchdemo to use the catalyst back end. For that, we have T370078: Add logging for catalyst WMCS k8s that should cover immediate logging needs. We may want to go this route in future, setting as low since we won't be doing this in the very near future.

Mon, Jul 29, 4:27 PM · Catalyst
thcipriani moved T369673: Wiki links for catalyst generated wikis from In progress to Ready on the Catalyst (whole 'nother) board.
Mon, Jul 29, 4:17 PM · Catalyst (whole 'nother)
thcipriani moved T369535: Explore a production Kubernetes PatchDemo host from Waiting for review to In progress on the Catalyst (whole 'nother) board.
Mon, Jul 29, 4:16 PM · Catalyst (whole 'nother)
thcipriani added a comment to T371255: scap backport broken on deploy1003 (bullseye, Git 2.30).

I think the update to from git v2.20.1 to v.2.30.2 explains this. this commit may be what changed to make @jnuche 's fix needed.

Mon, Jul 29, 1:57 PM · Scap

Sat, Jul 27

thcipriani closed T340062: GitLab merge request pages show an error when logged out as Resolved.

Haven't seen this in a while, but the linked task fix rolled out in 16.3.

Sat, Jul 27, 6:57 PM · Release-Engineering-Team (Seen), Upstream, GitLab (Upstream pit of despair 🕳️)

Fri, Jul 26

thcipriani closed T264231: Investigate whether issues, operations, wikis, etc. can be disabled globally on GitLab as Resolved.

Now that we have https://gitlab.wikimedia.org/repos/releng/gitlab-settings running on a schedule, this mostly solves globally disabling things. Talked to KDE folks about this, too. Seems like they have a similar project using the API to disable things on a per-repo basis.

Fri, Jul 26, 9:42 PM · GitLab (Upstream pit of despair 🕳️), Upstream, Release-Engineering-Team (Radar), User-brennen
thcipriani edited projects for T371069: Add helm rollback functionality to scap, added: Release-Engineering-Team (Priority Backlog 📥); removed Release-Engineering-Team.
Fri, Jul 26, 3:57 PM · Release-Engineering-Team (Priority Backlog 📥), MW-on-K8s, Scap

Wed, Jul 24

thcipriani awarded Blog Post: Iterative Improvements a Party Time token.
Wed, Jul 24, 3:20 PM

Tue, Jul 23

thcipriani added a comment to T365449: Upgrade Airflow to 2.9.3.

This is an interesting problem. It seems that the trusted runners should have picked up the job.

Tue, Jul 23, 5:05 PM · Data-Platform-SRE (2024.07.29 - 2024.08.16), collaboration-services, Release-Engineering-Team (Seen), Patch-For-Review, Data Pipelines, Data-Engineering

Mon, Jul 22

thcipriani assigned T367784: Run BDD tests charts and environments endpoints on every commit in CI to SDunlap.
Mon, Jul 22, 4:48 PM · Catalyst (whole 'nother)
thcipriani edited projects for T367784: Run BDD tests charts and environments endpoints on every commit in CI, added: Catalyst (whole 'nother); removed Catalyst.
Mon, Jul 22, 4:47 PM · Catalyst (whole 'nother)
thcipriani added a parent task for T367782: Add BDD tests charts and environments endpoints and scaffolding: T367784: Run BDD tests charts and environments endpoints on every commit in CI.
Mon, Jul 22, 4:47 PM · Catalyst (whole 'nother)
thcipriani added a subtask for T367784: Run BDD tests charts and environments endpoints on every commit in CI: T367782: Add BDD tests charts and environments endpoints and scaffolding.
Mon, Jul 22, 4:47 PM · Catalyst (whole 'nother)
thcipriani removed a subtask for T367782: Add BDD tests charts and environments endpoints and scaffolding: T367784: Run BDD tests charts and environments endpoints on every commit in CI.
Mon, Jul 22, 4:47 PM · Catalyst (whole 'nother)
thcipriani removed a parent task for T367784: Run BDD tests charts and environments endpoints on every commit in CI: T367782: Add BDD tests charts and environments endpoints and scaffolding.
Mon, Jul 22, 4:47 PM · Catalyst (whole 'nother)
thcipriani set the point value for T367784: Run BDD tests charts and environments endpoints on every commit in CI to 2.
Mon, Jul 22, 4:46 PM · Catalyst (whole 'nother)
thcipriani added a comment to T370664: Enable SSO for Patchdemo.

This is for the instance at https://patchdemo.catalyst.wmcloud.org, right?

Mon, Jul 22, 4:42 PM · Catalyst
thcipriani updated the task description for T370078: Add logging for catalyst WMCS k8s.
Mon, Jul 22, 4:19 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani added a comment to T369393: [Refactor] catalyst's wikimedia helm chart uses provisioning scripts from patchdemo.

We had some discussion via chat about this, in particular a couple concerns with templates:

  • Escaping input within the templates
  • syntax highlighting within the script (naming things script.sh.tpl works with vscode, this varies by editor tho)
Mon, Jul 22, 4:14 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani added a comment to T369535: Explore a production Kubernetes PatchDemo host.

Thanks for that tweak @matmarex

Mon, Jul 22, 4:09 PM · Catalyst (whole 'nother)
thcipriani added a comment to T355882: Temp accounts deployment and the release train.

Most of these questions are in our Risky Change Template. The week something risky is going out, it's a nice template to use to give us a heads up, most of the time that suffices. This change seems pretty fundamental, so we might need more. But let's start with there and see.

That sounds good.

Overall, I think the important thing for Release Engineering and specifically train drivers to know would be, after testwiki rollout:

  • testwiki would reflect an editing paradigm (temporary accounts) that is *not* the same as all other wikis. So, during that time it would probably make sense to review logs from other group0 wikis more closely. And WMF teams should probably do any anonymous editing related QA on test2.wikipedia.org

@thcipriani https://www.mediawiki.org/wiki/Help:Temporary_accounts/How_it_works is also a useful reference, that is probably a good idea for RelEng folks to read before we enable this on testwiki.

Mon, Jul 22, 1:27 PM · Quality-and-Test-Engineering-Team, Temporary accounts, Trust and Safety Product Team, Release-Engineering-Team

Thu, Jul 18

thcipriani updated subscribers of T366959: 1.43.0-wmf.14 deployment blockers.

That's not how train deployments work. If there's a specific patch you don't want to roll out to all wikis you could potentially revert and backport that one patch. But we won't keep one wiki on a legacy version because unspecified bots are slow (that was whoever wrote the patch's responsibility before putting it on the train).

Can tody changes in Arabic translation included in the version?

Thu, Jul 18, 4:21 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani added a project to T369602: UpdateTranslatablePageJob Error: Call to a member function clearCaches() on null: Language-Team.
Thu, Jul 18, 3:42 PM · Patch-For-Review, LPL Essential (LPL Essential 2024 Jul-Sep), MediaWiki-extensions-Translate, Wikimedia-production-error
thcipriani renamed T369602: UpdateTranslatablePageJob Error: Call to a member function clearCaches() on null from Error: Call to a member function clearCaches() on null to UpdateTranslatablePageJob Error: Call to a member function clearCaches() on null.
Thu, Jul 18, 3:40 PM · Patch-For-Review, LPL Essential (LPL Essential 2024 Jul-Sep), MediaWiki-extensions-Translate, Wikimedia-production-error
thcipriani renamed T370431: [C-DIS] TypeError: Argument 2 passed to Wikibase\Repo\ChangeModification\DispatchChangeVisibilityNotificationJob::createJobSpecification() must be of the type int, string given from TypeError: Argument 2 passed to Wikibase\Repo\ChangeModification\DispatchChangeVisibilityNotificationJob::createJobSpecification() must be of the type int, string given, called in /srv/mediawiki/php-1.43.0-wmf.14/extensions/Wik to TypeError: Argument 2 passed to Wikibase\Repo\ChangeModification\DispatchChangeVisibilityNotificationJob::createJobSpecification() must be of the type int, string given.
Thu, Jul 18, 3:37 PM · Wikidata, Wikidata Dev Team, Wikimedia-production-error, wmde-wikidata-tech
thcipriani created T370431: [C-DIS] TypeError: Argument 2 passed to Wikibase\Repo\ChangeModification\DispatchChangeVisibilityNotificationJob::createJobSpecification() must be of the type int, string given.
Thu, Jul 18, 3:37 PM · Wikidata, Wikidata Dev Team, Wikimedia-production-error, wmde-wikidata-tech
thcipriani added a project to T366742: PHP Deprecated: Use of MediaWiki\Output\OutputPage::setPageTitle with Message argument was deprecated in MediaWiki 1.41. [Called from SpecialCentralNoticeBanners::execute]: Fundraising Tech - Chaos Crew.

Actually, it looks like this merge includes the fix, so IIUC that will roll out with this week’s train.

Thu, Jul 18, 3:30 PM · Fundraising-Backlog, Fundraising Tech - Chaos Crew, User-brennen, MediaWiki-extensions-CentralNotice, Wikimedia-production-error
thcipriani added a comment to T363488: Wikibase\Repo\RestApi\Infrastructure\DataAccess\Exceptions\EntityUpdateFailed: <Error, collected 1 message(s) on the way, array value set>.

I still see these happening in production on wikidatawiki. Seems to be happening on PATCH requests to the wikibase rest API.

Thu, Jul 18, 3:26 PM · Wikibase REST API (WPP), Wikibase Product Platform Team WPP, wmde-wikidata-tech, Wikidata, User-brennen, Wikimedia-production-error
thcipriani created T370428: PHP Warning: Invalid argument supplied for foreach() in EventBus.php.
Thu, Jul 18, 3:16 PM · Data-Engineering (Q1 2024 July 1st - September 30th), MW-1.43-notes (1.43.0-wmf.15; 2024-07-23), Data-Platform, Event-Platform, Wikimedia-production-error
thcipriani added a project to T369186: PHP Warning: Cannot modify header information - headers already sent by (output started at /srv/mediawiki/php-1.43.0-wmf.11/includes/libs/http/MultiHttpClient.php:477): Editing-team.
Thu, Jul 18, 3:08 PM · MW-Interfaces-Team, MediaWiki-libs-HTTP, SRE-swift-storage, Wikimedia-production-error

Wed, Jul 17

thcipriani added projects to T370352: Missing some email notifications from Phabricator (2024-07-17): Release-Engineering-Team (Priority Backlog 📥), collaboration-services.
Wed, Jul 17, 7:13 PM · Infrastructure-Foundations, Mail, collaboration-services, Release-Engineering-Team (Priority Backlog 📥), Phabricator
thcipriani added a comment to T370110: Apache 2.4.61 throws a 403 Forbidden for links containing %3F.

Using the UnsafeAllow3F flag on the existing /index.php?... rule seems like a reasonable fix to me:

RewriteRule ^(.*)$          /index.php?__path__=$1  [B,L,QSA,UnsafeAllow3F]

We can see in the logging from T370110#9983690 that the url encoded %3F values are passing through as expected and the ? that is triggering the error is actually the one we explicitly are adding to support the front router setup of Phorge's index.php entry point.

Wed, Jul 17, 4:01 PM · Vuln-VulnComponent, SecTeam-Processed, collaboration-services, Release-Engineering-Team (Priority Backlog 📥), Wikimedia-Apache-configuration, Phabricator, User-brennen, Security

Tue, Jul 16

thcipriani renamed T370076: [Go Live] Move patchdemo.wmcloud.org to Kubernetes in the catalyst WMCS project from Move patchdemo.wmflabs.org to the catalyst WMCS project to Move patchdemo.wmflabs.org to Kubernetes in the catalyst WMCS project.
Tue, Jul 16, 5:06 PM · Catalyst
thcipriani added a comment to T370110: Apache 2.4.61 throws a 403 Forbidden for links containing %3F.

@bd808: The RewriteRule ^(.*)$ /index.php?__path__=$1 [B,L,QSA] rule at https://gerrit.wikimedia.org/r/plugins/gitiles/operations/puppet/+/refs/heads/production/modules/phabricator/templates/phabricator-default.conf.erb#75 is going to trigger on this. Not sure what else.

This actually might not be the trigger point? The B flag here should keep the %3F in the inbound URL from being expanded. As I understand the bug & fix Apache is now checking to see if there was a %3F in the inbound URL and a ? in the redirect target and refusing to continue when this is the case.

Tue, Jul 16, 2:34 AM · Vuln-VulnComponent, SecTeam-Processed, collaboration-services, Release-Engineering-Team (Priority Backlog 📥), Wikimedia-Apache-configuration, Phabricator, User-brennen, Security
thcipriani added a comment to P66571 (An Untitled Masterwork).
["config"]=>
array(1) {
  ["config"]=>
  array(1) {
    ["eqiad/ReadOnly"]=>
    bool(false)
  }
}
Tue, Jul 16, 2:07 AM
thcipriani created P66571 (An Untitled Masterwork).
Tue, Jul 16, 1:52 AM

Mon, Jul 15

thcipriani added a comment to T370110: Apache 2.4.61 throws a 403 Forbidden for links containing %3F.

If we turn up the LogLevel on a Phab node to trace2 I think we should see that before/after and maybe figure out which rule should really have a B flag added.

Could use devtools' https://phabricator.wmcloud.org/ for testing here.

Mon, Jul 15, 11:38 PM · Vuln-VulnComponent, SecTeam-Processed, collaboration-services, Release-Engineering-Team (Priority Backlog 📥), Wikimedia-Apache-configuration, Phabricator, User-brennen, Security
thcipriani added a comment to T360784: Rebuild integration-cumin to get rid of Debian Buster.

Hello, @hashar! The deadline for this rebuild is today :)

Likely you can just replace this VM with an identically-puppetized Bullseye host. I'm doing the same in deployment-prep, although in that case I'm blocked by not having the keyholder passphrase.

Mon, Jul 15, 11:07 PM · Release-Engineering-Team, Continuous-Integration-Infrastructure
thcipriani edited projects for T370089: Create account for Levi Ferreira on wikimedia.biterg.io, added: Release-Engineering-Team (Priority Backlog 📥); removed Release-Engineering-Team.
Mon, Jul 15, 5:48 PM · Release-Engineering-Team (Priority Backlog 📥), wikimedia.biterg.io
thcipriani created T370089: Create account for Levi Ferreira on wikimedia.biterg.io.
Mon, Jul 15, 5:48 PM · Release-Engineering-Team (Priority Backlog 📥), wikimedia.biterg.io
thcipriani assigned T369393: [Refactor] catalyst's wikimedia helm chart uses provisioning scripts from patchdemo to jnuche.
Mon, Jul 15, 4:53 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani moved T370078: Add logging for catalyst WMCS k8s from Backlog to Ready on the Catalyst (whole 'nother) board.
Mon, Jul 15, 4:52 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani placed T367086: Patchdemo e2e tests validate that patchdemo loads and displays correctly up for grabs.
Mon, Jul 15, 4:52 PM · Catalyst
thcipriani added a parent task for T366697: Container Logs: retrieve only pods for environment: T366971: Catalyst API user can stream logs from a specific container.
Mon, Jul 15, 4:51 PM · Catalyst (whole 'nother)
thcipriani added a subtask for T366971: Catalyst API user can stream logs from a specific container: T366697: Container Logs: retrieve only pods for environment.
Mon, Jul 15, 4:51 PM · Catalyst (whole 'nother)
thcipriani moved T370080: Moving proxies across wmcs projects for patchdemo.wmflabs.org from Backlog to radar on the Catalyst board.
Mon, Jul 15, 4:49 PM · Cloud-Services, Catalyst
thcipriani created T370080: Moving proxies across wmcs projects for patchdemo.wmflabs.org.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Mon, Jul 15, 4:48 PM · Cloud-Services, Catalyst
thcipriani updated the task description for T370076: [Go Live] Move patchdemo.wmcloud.org to Kubernetes in the catalyst WMCS project.
Mon, Jul 15, 4:45 PM · Catalyst
thcipriani renamed T370076: [Go Live] Move patchdemo.wmcloud.org to Kubernetes in the catalyst WMCS project from Move patchdemo.wikimedia.org to the catalyst WMCS project to Move patchdemo.wmflabs.org to the catalyst WMCS project.
Mon, Jul 15, 4:44 PM · Catalyst
thcipriani edited projects for T370078: Add logging for catalyst WMCS k8s, added: Catalyst (whole 'nother); removed Catalyst.
Mon, Jul 15, 4:43 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani assigned T370078: Add logging for catalyst WMCS k8s to SDunlap.
Mon, Jul 15, 4:43 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani set the point value for T370078: Add logging for catalyst WMCS k8s to 3.
Mon, Jul 15, 4:43 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani created T370078: Add logging for catalyst WMCS k8s.
Mon, Jul 15, 4:42 PM · Patch-For-Review, Catalyst (whole 'nother)
thcipriani closed T368280: [Explore] logging from the API into logstash as Resolved.

Nice work digging :)

Mon, Jul 15, 4:36 PM · Catalyst (whole 'nother)
thcipriani set Final Story Points to 1 on T368280: [Explore] logging from the API into logstash.
Mon, Jul 15, 4:36 PM · Catalyst (whole 'nother)
thcipriani updated the task description for T368280: [Explore] logging from the API into logstash.
Mon, Jul 15, 4:36 PM · Catalyst (whole 'nother)
thcipriani added a comment to T368280: [Explore] logging from the API into logstash.

Suggestions (in order of my recommendations):

  1. host an ELK stack on our cluster
  2. write logs to files on a persisted volume
  3. log on the Wikimedia Observability experimental CloudVPS logstash
  4. continue to rely on Kubernetes logs only Ostrich Algorithm
Mon, Jul 15, 4:34 PM · Catalyst (whole 'nother)
thcipriani added a subtask for T370076: [Go Live] Move patchdemo.wmcloud.org to Kubernetes in the catalyst WMCS project: T368943: Archive the Catalyst PatchDemo fork.
Mon, Jul 15, 4:27 PM · Catalyst
thcipriani added a parent task for T368943: Archive the Catalyst PatchDemo fork: T370076: [Go Live] Move patchdemo.wmcloud.org to Kubernetes in the catalyst WMCS project.
Mon, Jul 15, 4:27 PM · Catalyst
thcipriani created T370076: [Go Live] Move patchdemo.wmcloud.org to Kubernetes in the catalyst WMCS project.
Mon, Jul 15, 4:27 PM · Catalyst
thcipriani updated the task description for T369535: Explore a production Kubernetes PatchDemo host.
Mon, Jul 15, 4:25 PM · Catalyst (whole 'nother)
thcipriani updated the task description for T369673: Wiki links for catalyst generated wikis.
Mon, Jul 15, 4:18 PM · Catalyst (whole 'nother)
thcipriani added a comment to T369535: Explore a production Kubernetes PatchDemo host.

Getting database free disk space in k8s may require more effort than I think it's worth at this point. Let's just drop that information from the front end and if we miss it, we can always explore options after that. As long as we have disk space monitoring in some other form, I don't think we'll miss it in the front-end of the application.

Mon, Jul 15, 4:09 PM · Catalyst (whole 'nother)

Fri, Jul 12

thcipriani added a project to T369962: New beta deployment server unable to connect to logging logstash server in WMCS: Beta-Cluster-Infrastructure.
Fri, Jul 12, 11:09 PM · User-bd808, Beta-Cluster-Infrastructure, Release-Engineering-Team (Radar), SRE Observability
thcipriani created T369962: New beta deployment server unable to connect to logging logstash server in WMCS.
Fri, Jul 12, 11:09 PM · User-bd808, Beta-Cluster-Infrastructure, Release-Engineering-Team (Radar), SRE Observability
thcipriani created T369954: scap prep auto fails on new deployment host.
Fri, Jul 12, 9:28 PM · Scap
thcipriani added a comment to T363957: deployment_server bullseye - mw-cgroup.service: Failed .

Thanks for documenting this, ran into the same thing in deployment prep (T327742), reboot also fixed it there.

Fri, Jul 12, 9:00 PM · serviceops, SRE
thcipriani assigned T366961: 1.43.0-wmf.16 deployment blockers to brennen.
Fri, Jul 12, 4:59 PM · User-brennen, Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani assigned T366960: 1.43.0-wmf.15 deployment blockers to dduvall.
Fri, Jul 12, 4:58 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments
thcipriani assigned T366959: 1.43.0-wmf.14 deployment blockers to dancy.
Fri, Jul 12, 4:57 PM · Release-Engineering-Team (Priority Backlog 📥), Release, Train Deployments

Thu, Jul 11

thcipriani added a member for acl*phabricator: Lferreira.
Thu, Jul 11, 4:25 PM
thcipriani added a member for acl*Project-Admins: Lferreira.
Thu, Jul 11, 4:24 PM