[refactor] Split Services into watcher/handler under felix by sknat · Pull Request #776 · projectcalico/vpp-dataplane

sknat · 2025-09-02T15:04:25Z

This patch splits services in two components,

a watcher that handles the informer fetching services and
endpoints from the k8s API.
a handler that takes care of programming VPP with the NAT
rules, within the context of the felix server's single goroutine.

The intent is to move away from a model with multiple servers
replicating state and communicating over a pubsub. This being
prone to race conditions, deadlocks, and not providing many
benefits as scale & asynchronicity will not be a constraint
on nodes with relatively small number of pods (~100) as is k8s
default.

This patch splits the felix server in two pieces: - a felix watcher placed under `agent/watchers/felix` - a felix server placed under `agent/felix` The former will have only the responsibility of watching and submitting events into a single event queue. The latter will receive the event in a single goroutine and proceed to program VPP as a single thred. The intent is to move away from a model with multiple servers replicating state and communicating over a pubsub. This being prone to race conditions, deadlocks, and not providing many benefits as scale & asynchronicity will not be a constraint on nodes with relatively small number of pods (~100) as is k8s default. Signed-off-by: Nathan Skrzypczak <nathan.skrzypczak@gmail.com>

This patch splits the CNI watcher and handlers in two pieces. The handling will be done in the main 'felix' goroutine, while the watching / grpc server will live under watchers/ and not store or access agent state. The intent is to move away from a model with multiple servers replicating state and communicating over a pubsub. This being prone to race conditions, deadlocks, and not providing many benefits as scale & asynchronicity will not be a constraint on nodes with relatively small number of pods (~100) as is k8s default. Signed-off-by: Nathan Skrzypczak <nathan.skrzypczak@gmail.com>

This patch moves the Connectivity handlers in the main felix loop to allow lockless access to the cache. The intent is to move away from a model with multiple servers replicating state and communicating over a pubsub. This being prone to race conditions, deadlocks, and not providing many benefits as scale & asynchronicity will not be a constraint on nodes with relatively small number of pods (~100) as is k8s default. Signed-off-by: Nathan Skrzypczak <nathan.skrzypczak@gmail.com>

This patch splits services in two components, - a watcher that handles the informer fetching services and endpoints from the k8s API. - a handler that takes care of programming VPP with the NAT rules, within the context of the felix server's single goroutine. The intent is to move away from a model with multiple servers replicating state and communicating over a pubsub. This being prone to race conditions, deadlocks, and not providing many benefits as scale & asynchronicity will not be a constraint on nodes with relatively small number of pods (~100) as is k8s default. Also cleaned up unused code from single-thread agent refactor. Signed-off-by: Nathan Skrzypczak <nathan.skrzypczak@gmail.com>

aritrbas · 2026-05-19T08:02:42Z

Rebased on latest master to resolve merge conflicts and applied some fixes to comply with the latest Felix API updates in release/v3.31.0 and release/v3.32.0 as well as the NPOL and CNAT changes in VPP.

sknat requested review from aritrbas, hedibouattour and onong September 2, 2025 15:04

sknat self-assigned this Sep 2, 2025

sknat added this to the agent refactoring single thread milestone Nov 17, 2025

sknat changed the title ~~Split Services into watcher/handler under felix~~ [refactor] Split Services into watcher/handler under felix Jan 7, 2026

sknat added 4 commits May 18, 2026 18:53

aritrbas force-pushed the nsk-split-svc branch from 05e1278 to 012b664 Compare May 19, 2026 08:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[refactor] Split Services into watcher/handler under felix#776

[refactor] Split Services into watcher/handler under felix#776
sknat wants to merge 4 commits into
masterfrom
nsk-split-svc

sknat commented Sep 2, 2025

Uh oh!

aritrbas commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sknat commented Sep 2, 2025

Uh oh!

aritrbas commented May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants