GEP-3792: Out-of-Cluster Gateways¶

Issue: #3792
Status: Provisional

User Story¶

Chihiro and Ian want a way for out-of-cluster Gateways to be able to usefully participate in a GAMMA-compliant in-cluster service mesh.

Historically, API gateways and ingress controllers have often been implemented using a Service of type LoadBalancer fronting a Kubernetes pod running a proxy. This is simple to reason about, easy to manage for sidecar meshes, and will presumably be an important implementation mechanism for the foreseeable future. Some cloud providers, though, are moving the proxy outside of the cluster, for various reasons which are out of the scope of this GEP. Chihiro and Ian want to be able to use these out-of-cluster proxies effectively and safely, though they recognize that this may require additional configuration.

Nomenclature and Background¶

In this GEP:

We will use out-of-cluster Gateway (OCG) to refer to a conformant implementation of Gateway API's GATEWAY profile that's running outside of the cluster. This would most commonly be a managed implementation from a cloud provider, but of course there are many other possibilities -- and in fact it's worth noting that anything we define here to support OCGs could also be used by workloads that run in-cluster but which, for whatever reason, can't be brought into the mesh in the mesh's usual way.
We'll also distinguish between mTLS meshes, which rely on standard mTLS for secure communication (authentication, encryption, and integrity checking) between workloads, and non-mTLS meshes, which do anything else. We'll focus on mTLS meshes in this GEP; this isn't because of a desire to exclude non-mTLS meshes, but because we'll have enough trouble just wrangling the mTLS meshes! Supporting non-mTLS meshes will be a separate GEP.

Note: It's important to separate mTLS and HTTPS here. Saying that the mTLS meshes use mTLS for secure communication does not preclude them from using custom protocols on top of mTLS, and certainly does not mean that they must use only HTTPS.
Authentication is the act of verifying the identity of some principal; what the principal actually is depends on context. For this GEP we will primarily be concerned with workload authentication, in which the principal is a workload, as opposed to user authentication, in which the principal is the human on whose behalf a piece of technology is acting. We expect that the OCG will handle user auth, but of course meshed workloads can't trust what the OCG says about the user unless the OCG successfully authenticates itself as a workload.

Note: A single workload will have only one identity, but in practice we often see a single identity being used for multiple workloads (both because multiple replicas of a single workload need to share the same identity, and because some low-security workloads may be grouped together under a single identity).
Finally, we'll distinguish between inbound and outbound behaviors.

Inbound behaviors are those that are applied to a request arriving at a given workload. Authorization and rate limiting are canonical examples of inbound behaviors.

Outbound behaviors are those that are applied to a request leaving a given workload. Load balancing, retries, and circuit breakers are canonical examples of outbound behaviors.

Goals¶

Allow Chihiro and Ian to configure an OCG and a mesh such that the OCG can usefully participate in the mesh, including:
- The OCG must be able to securely communicate with meshed workloads in the cluster, where "securely communicate" includes encryption, authentication, and integrity checking.
- The OCG must have a proper identity within the mesh, so that the mesh can apply authorization policy to requests from the OCG.
- Whatever credentials the OCG and the mesh use to authenticate each other must be able to be properly maintained over time (for example, if they use mTLS, certificates will need rotation over time).
- The OCG must be able to distinguish meshed workloads from non-meshed workloads, so that it can communicate appropriately with each.
Allow Ana to develop and operate meshed applications without needing to know whether the Gateway she's using is an OCG or an in-cluster Gateway.
Define a basic set of requirements for OCGs and meshes that want to interoperate with each other (for example, the OCG and the mesh will likely need to agree on how workload authentication principals are represented).
Define how responsibility is shared between the OCG and the mesh for outbound behaviors applied to requests leaving the OCG. (Note that "the OCG has complete responsibility and authority over outbound behaviors for requests leaving the OCG" is very much a valid definition.)

Non-Goals¶

Support multicluster operations. It may be the case that functional multicluster (with, e.g., a single OCG fronting multiple clusters) ends up falling out of this GEP, but it is not a goal.
Support meshes interoperating with each other. It's possible that this GEP will lay a lot of groundwork in that direction, but it is not a goal.
Support non-mTLS meshes in Gateway API 1.4. We'll make every effort not to rule out non-mTLS meshes, but since starting with the mTLS meshes should tackle a large chunk of the industry with a single solution, that will be the initial focus.
Solve the problem of extending a mesh to cover non-Kubernetes workloads (AKA mesh expansion). In many ways, mesh expansion is adjacent to the OCG situation, but the where the OCG is aware of the cluster and mesh, mesh expansion deals with a non-Kubernetes workload that is largely not aware of either.
Solve the problem of how to support an OCG doing mTLS directly to a non-meshed workload (AKA the backend TLS problem). Backend TLS to non-meshed workloads is also adjacent to the OCG situation, but its configuration has different needs: backends terminating TLS on their own are likely to need per-workload configuration of certificates, cipher suites, etc., where the mesh as a whole should share a single configuration.
Prevent the OCG API from being used by an in-cluster workload. We're not going to make in-cluster workloads a primary use case for this GEP, but neither are we disallowing them.

Overview¶

Making an OCG work with an in-cluster mesh at the most basic level doesn't really require any special effort. As long as the OCG has IP connectivity to pods in the cluster, and the mesh is configured with permissive security, the OCG can simply forward traffic from clients directly to meshed pods, and things will "function" in that requests from clients, through the OCG, can be handled by workloads in the cluster.

Of course, this sort of non-integration has obvious and terrible security implications, since the traffic between the OCG and the application pods in the cluster will be cleartext in the scenario above. The lack of encryption is awful in its own right, but the fact that any mTLS mesh uses mTLS for authentication also means that the mesh loses any way to enforce authorization policy around the OCG. Combined, these items amount to a major problem.

An additional concern is that the OCG needs to be able to implement features (e.g. sticky sessions) which require it to speak directly to endpoint IPs, which can limit what the mesh will be able to do. This is likely a more minor concern since a conformant OCG should itself be able to provide advanced functionality; however, at minimum it can create some friction in configuration.

The Problems¶

To allow the OCG to usefully participate in the mesh, we need to solve at least four significant problems. Thankfully, these are mostly problems for Chihiro -- if we do our jobs correctly, Ana will never need to know.

1. The Trust Problem¶

The trust problem is fairly straightforward to articulate: the OCG and the mesh both need access to whatever information will allow each of them to trust the other.

In the case of mTLS meshes, we are helped by the fact that basically every OCG candidate already speaks mTLS, so the trust problem becomes "only" one of setting things up for the OCG and the mesh to each include the other's CA certificate in their trust bundle. (They may be using the same CA certificate, but we shouldn't rely on that.)

In the case of non-mTLS meshes, the trust problem is more complex; this is the major reason that this GEP is focused on mTLS meshes.

2. The Protocol Problem¶

The protocol problem is that the data-plane elements of the mesh may assume that they'll always be talking only to other mesh data-plane elements, which the OCG will not be. If the mesh data-plane elements use a specific protocol, then either the OCG will need to speak that protocol, or the mesh will need to relax its requirements (perhaps on a separate port?) to accept requests directly from the OCG.

For example, Linkerd and Istio Legacy both use standard mTLS for proxy-to-proxy communication -- however, both also use ALPN to negotiate custom (and distinct!) "application" protocols during mTLS negotiation, and depending on the negotiated protocol, both can require the sending proxy to send additional information after mTLS is established, before any client data is sent. (For example, Linkerd requires the originating proxy to send transport metadata right after the TLS handshake, and it will reject a connection which doesn't do that correctly.)

3. The Discovery Problem¶

When using a mesh, not every workload in the cluster is required to be meshed (for example, it's fairly common to have some namespaces meshed and other namespaces not meshed, especially during migrations). The discovery problem here is that the OCG needs to be know which workloads are meshed, so that it can choose appropriate communication methods for them.

4. The Outbound Behavior Problem¶

The OCG will need to speak directly to endpoints in the cluster, as described above. This will prevent most meshes from being able to tell which service was originally requested, which makes it impossible for the mesh to apply outbound behaviors. This is the outbound behavior problem: it implies that either the OCG must be responsible for outbound behaviors for requests leaving the OCG for a meshed workload, or that the OCG must supply the mesh with enough information about the targeted service to allow the mesh to apply those outbound behaviors (if that's even possible: sidecar meshes may very well simply not be able to do this.)

This is listed last because it shouldn't be a functional problem to simply declare the OCG solely responsible for outbound behaviors for requests leaving the OCG. It is a UX problem: if a given workload needs to be used by both the OCG or other meshed workloads, you'll need to either provide two Routes with the same configuration, or you'll need to provide a single Route with multiple parentRefs.

API¶

Most of the API work for this GEP is TBD at this point, but there are two important points to note:

First, Gateway API has never defined a Mesh resource because, to date, it's never been clear what would go into it. This may be the first configuration item that causes us to need a Mesh resource.

Second, since the API should affect only Gateway API resources, it is not a good candidate for policy attachment. It is likely to be much more reasonable to simply provide whatever extra configuration we need inline in the Gateway or Mesh resources.

Graduation Criteria¶

In addition to the general graduation criteria, this GEP must also guarantee that all four of the problems listed above need resolutions, and must have implementation from at least two different Gateways and two different meshes.

Gateway for Ingress (North/South)¶

Gateway For Mesh (East/West)¶

Conformance Details¶

Feature Names¶

This GEP will use the feature name MeshOffClusterGateway, under the assumption that we will indeed need a Mesh resource.