fix(api,task-processor): Close shutdown drain gap and name container ports by germangarces · Pull Request #533 · Flagsmith/flagsmith-charts

germangarces · 2026-05-13T09:21:12Z

Two small, independent fixes to the api and task-processor deployment templates.

1. Close shutdown drain gap

Default terminationGracePeriodSeconds to 75 and add preStop sleep 20 on the API container. Lets the load balancer finish deregistering the pod before gunicorn stops accepting connections, so rolling deploys and HPA scale-downs no longer cause a brief 5xx spike.

2. Name container ports

The api and task-processor container ports were unnamed. PodMonitoring resources that reference them by name (port: http or port: prom) silently scraped nothing as a result. Adding names fixes that. Existing Service and ServiceMonitor resources are unaffected — they reference ports numerically or via the Service name.

Contributes to Flagsmith/infrastructure#317

Signed-off-by: germangarces <german.garces@flagsmith.com>

The api and task-processor container ports were unnamed, so any PodMonitoring (or other) resource referencing them by name (e.g. `port: http`) could not resolve them and silently scraped nothing. Name the existing container port `http`, and declare the Prometheus port 9100 as `prom` when `prometheus.enabled` is true. Service and ServiceMonitor resources are unaffected: both reference ports by numeric value or by the Service's own port name.

matthewelwell · 2026-05-20T10:45:07Z

+  # Container lifecycle hooks. Default preStop delays SIGTERM so the
+  # LB / endpoints controller has time to deregister the pod before
+  # gunicorn closes its listen socket. Without this, rolling deploys
+  # and HPA scale-down can cause a short 5xx spike on traffic that
+  # the LB routes to the pod after it has stopped accepting connections.
+  lifecycle:
+    preStop:
+      exec:
+        command: ["sleep", "20"]


I don't quite understand if we're describing the default or the custom exec command we've added here?

We're talking about the custom command we've added. But I have rephrased it so is more understandable: 0210430

matthewelwell · 2026-05-20T10:45:44Z

+  # Pod termination grace period in seconds. Must exceed the LB's
+  # connection-draining timeout so the kubelet does not SIGKILL
+  # the pod while the LB is still draining in-flight connections.
+  terminationGracePeriodSeconds: 75


Why did we land on 75?

20s preStop + 30s gunicorn default graceful worker shutdown + 25s for possible in-flight requests

Signed-off-by: germangarces <german.garces@flagsmith.com>

germangarces added 2 commits May 13, 2026 11:19

fix(api): close graceful-shutdown gap behind LB

9fd621b

Signed-off-by: germangarces <german.garces@flagsmith.com>

germangarces changed the title ~~fix(api): close graceful-shutdown gap behind LB~~ fix(api,task-processor): Close shutdown drain gap and name container ports May 13, 2026

germangarces requested a review from khvn26 May 19, 2026 08:12

matthewelwell reviewed May 20, 2026

View reviewed changes

docs: clarify lifecyle comment

0210430

Signed-off-by: germangarces <german.garces@flagsmith.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(api,task-processor): Close shutdown drain gap and name container ports#533

fix(api,task-processor): Close shutdown drain gap and name container ports#533
germangarces wants to merge 3 commits into
mainfrom
fix/drain-timers

germangarces commented May 13, 2026 •

edited

Loading

Uh oh!

matthewelwell May 20, 2026

Uh oh!

germangarces May 20, 2026

Uh oh!

matthewelwell May 20, 2026

Uh oh!

germangarces May 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

germangarces commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

1. Close shutdown drain gap

2. Name container ports

Uh oh!

matthewelwell May 20, 2026

Choose a reason for hiding this comment

Uh oh!

germangarces May 20, 2026

Choose a reason for hiding this comment

Uh oh!

matthewelwell May 20, 2026

Choose a reason for hiding this comment

Uh oh!

germangarces May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

germangarces commented May 13, 2026 •

edited

Loading

germangarces May 20, 2026 •

edited

Loading