Protocol ossification

From HandWiki
Short description: Reduction in the flexibility of network protocol design due to middleboxes

Protocol ossification is the loss of flexibility, extensibility and evolvability of network protocols. This is largely due to middleboxes that are sensitive to the wire image of the protocol, and which can interrupt or interfere with messages that are valid but which the middlebox does not correctly recognise. This is a violation of the end-to-end principle. Secondary causes include inflexibility in endpoint implementations of protocols.

Ossification is a major issue in Internet protocol design and deployment, as it can prevent new protocols or extensions from being deployed on the Internet, or place strictures on the design of new protocols; new protocols may have to be encapsulated in an already-deployed protocol or mimic the wire image of another protocol. Because of ossification, the Transmission Control Protocol (TCP) and User Datagram Protocol (UDP) are the only practical choices for transport protocols on the Internet, and TCP itself has significantly ossified, making extension or modification of the protocol difficult.

Recommended methods of preventing ossification include encrypting protocol metadata, and ensuring that extension points are exercised and wire image variability is exhibited as fully as possible; remedying existing ossification requires coordination across protocol participants. QUIC is the first IETF transport protocol to have been designed with deliberate anti-ossification properties.

History

Significant ossification had set in on the Internet by 2005, with analyses of the problem also being published in that year;[1] (Ammar 2018) suggests that ossification was a consequence of the Internet attaining global scale and becoming the primary communication network.[2]

Multipath TCP was the first extension to a core Internet protocol to deeply confront protocol ossification during its design.[3]

The IETF created the Transport Services (taps) working group in 2014.[4] It has a mandate to mitigate ossification at the transport protocol layer.[5]

QUIC is the first IETF transport protocol to deliberately minimise its wire image to avoid ossification.[6]

The Internet Architecture Board identified design considerations around the exposure of protocol information to network elements as a "developing field" in 2023.[7]

Causes

The primary cause of protocol ossification is middlebox interference,[8] invalidating the end-to-end principle.[9] Middleboxes may entirely block unknown protocols or unrecognised extensions to known protocols, interfere with extension or feature negotiation, or perform more invasive modification of protocol metadata.[10] Not all middlebox modifications are necessarily ossifying; of those which are potentially harmful, they are disproportionately towards the network edge.[11] Middleboxes are deployed by network operators unilaterally to solve specific problems,[12] including performance optimisation, security requirements (e.g., firewalls), network address translation or enhancing control of networks.[13] These middlebox deployments provide localised short-term utility but degrade the global long-term evolvability of the Internet in a manifestation of the tragedy of the commons.[12]

Changes to a protocol must be tolerated by all on-path intermediaries; if wide Internet deployment of the change is desired, then this extends to a large portion of intermediaries on the Internet. A middlebox must tolerate widely-used protocols as they were being used at the time of its deployment, but is liable not to tolerate new protocols or changes to extant ones, effectively creating a vicious cycle as novel wire images cannot gain wide enough deployment to make middleboxes tolerate the new wire image across the entire Internet.[9] Even all participants tolerating the protocol is no guarantee of use: in the absence of a negotiation or discovery mechanism, the endpoints may default to a protocol that is considered more reliable.[14]

Beyond middleboxes, ossification can also be caused by insufficient flexibility within the endpoint's implementation. Operating system kernels are slow to change and deploy,[14] and protocols implemented in hardware can also inappropriately fix protocol details.[15] A widely-used API that makes assumptions about the operation of underlying protocols can hinder the deployment of protocols that do not share those assumptions.[9]

Prevention and remediation

The Internet Architecture Board recommended in 2019 that implicit signals to observers should be replaced with signals deliberately intended for the consumption of those observers, and signals not intended for their consumption should not be available to them (e.g., by encryption); and also that the protocol metadata should be integrity protected so that it cannot be modified by middleboxes.[16] However, even fully encrypted metadata may not entirely prevent ossification in the network, as the wire image of a protocol can still show patterns that come to be relied upon.[17] Network operators use metadata for a variety of benign management purposes,[18] and Internet research is also informed by data gathered from protocol metadata;[19] a protocol's designer must balance ossification resistance against observability for operational or research needs.[17] (Arkko Hardie) provides further guidance on these considerations: disclosure of information by a protocol to the network should be intentional,[20] performed with the agreement of both recipient and sender,[21] authenticated to the degree possible and necessary,[22] only acted upon to the degree of its trustworthiness,[23] and minimised and provided to a minimum number of entities.[24][25]

Active use of extension points is required if they are not to ossify.[26] Reducing the number of extension points, documenting invariants that protocol participants can rely on as opposed to incidental details that must not be relied upon, and prompt detection of issues in deployed systems can assist in ensuring active use.[27] However, even active use may only exercise a narrow portion of the protocol and ossification can still occur in the parts that remain invariant in practice despite theoretical variability.[28][29] "Greasing" an extension point, where some implementations indicate support for non-existent extensions, can ensure that actually-existent-but-unrecognised extensions are tolerated (cf. chaos engineering).[30] HTTP headers are an example of an extension point that has successfully avoided significant ossification, as participants will generally ignore unrecognised headers.[31]

A new protocol may be designed to mimic the wire image of an existing ossified protocol;[32] alternatively, a new protocol may be encapsulated within an existing, tolerated protocol. A disadvantage of encapsulation is that there is typically overhead and redundant work (e.g., outer checksums made redundant by inner integrity checks).[33]

Besides middleboxes, other sources of ossification can also be resisted. User-space implementation of protocols can lead to more rapid evolution. If the new protocol is encapsulated in UDP, then user-space implementation is possible.[34][35] Where support for protocols is uncertain, participants may simultaneously try alternative protocols, at the cost of increasing the amount of data sent.[36]

With sufficient effort and coordination, ossification can be directly reversed. A flag day, where protocol participants make changes in concert, can break the vicious cycle and establish active use. This approach was used to deploy EDNS, which had formerly not been tolerated by servers.[37]

Examples

The Transmission Control Protocol has suffered from ossification.[38] One measurement found that a third of paths across the Internet encounter at least one intermediary that modifies TCP metadata, and 6.5% of paths encounter harmful ossifying effects from intermediaries.[39] Extensions to TCP have been affected: the design of MPTCP was constrained by middlebox behaviour,[3][40] and the deployment of TCP Fast Open has been likewise hindered.[41][38]

The Stream Control Transmission Protocol has been little-deployed on the Internet due to intolerance from middleboxes,[9] and also due to the very widespread BSD sockets API ill-fitting its capabilities.[42] In practice, TCP and UDP are the only usable Internet transport protocols.[43]

Transport Layer Security (TLS) has experienced ossification. TLS was the original context for the introduction of greasing extension points. TLS 1.3, as originally designed, proved undeployable on the Internet: middleboxes had ossified the protocol's version parameter. This was discovered late in the protocol design process, during experimental deployments by web browsers. As a result, version 1.3 mimics the wire image of version 1.2.[44]

QUIC has been specifically designed to be deployable, evolvable and to have anti-ossification properties;[45] it is the first IETF transport protocol to deliberately minimise its wire image for these ends.[6] It is greased,[30] it has protocol invariants explicitly specified,[46] it is encapsulated in UDP, and its protocol metadata is encrypted.[45] Still, applications using QUIC must be prepared to fall back to other protocols, as UDP is blocked by some middleboxes.[47]

See also

References

  1. Ammar 2018, p. 57-58.
  2. Ammar 2018, p. 59.
  3. 3.0 3.1 Raiciu et al. 2012, p. 1.
  4. "Transport Services (taps) – Group history". IETF. https://datatracker.ietf.org/wg/taps/history/. 
  5. "Transport Services – charter-ietf-taps-02". IETF. https://datatracker.ietf.org/doc/charter-ietf-taps/. 
  6. 6.0 6.1 Trammell & Kuehlewind 2019, p. 2.
  7. Arkko et al. 2023, 3. Further Work.
  8. Papastergiou et al. 2017, p. 619.
  9. 9.0 9.1 9.2 9.3 Papastergiou et al. 2017, p. 620.
  10. Edeline & Donnet 2019, p. 171.
  11. Edeline & Donnet 2019, p. 173-175.
  12. 12.0 12.1 Edeline & Donnet 2019, p. 169.
  13. Honda et al. 2011, p. 1.
  14. 14.0 14.1 Papastergiou et al. 2017, p. 621.
  15. Corbet 2015.
  16. Hardie 2019, p. 7-8.
  17. 17.0 17.1 Fairhurst & Perkins 2021, 7. Conclusions.
  18. Fairhurst & Perkins 2021, 2. Current Uses of Transport Headers within the Network.
  19. Fairhurst & Perkins 2021, 3. Research, Development, and Deployment.
  20. Arkko et al. 2023, 2.1. Intentional Distribution.
  21. Arkko et al. 2023, 2.2. Control of the Distribution of Information.
  22. Arkko et al. 2023, 2.3. Protecting Information and Authentication.
  23. Arkko et al. 2023, 2.5. Limiting Impact of Information.
  24. Arkko et al. 2023, 2.4. Minimize Information.
  25. Arkko et al. 2023, 2.6. Minimum Set of Entities.
  26. Thomson & Pauly 2021, 3. Active Use.
  27. Thomson & Pauly 2021, 4. Complementary Techniques.
  28. Thomson & Pauly 2021, 3.1. Dependency Is Better.
  29. Trammell & Kuehlewind 2019, p. 7.
  30. 30.0 30.1 Thomson & Pauly 2021, 3.3. Falsifying Active Use.
  31. Thomson & Pauly 2021, 3.4. Examples of Active Use.
  32. Papastergiou et al. 2017, p. 623.
  33. Papastergiou et al. 2017, p. 623-4.
  34. Papastergiou et al. 2017, p. 630.
  35. Corbet 2016.
  36. Papastergiou et al. 2017, p. 629.
  37. Thomson & Pauly 2021, 3.5. Restoring Active Use.
  38. 38.0 38.1 Thomson & Pauly 2021, A.5. TCP.
  39. Edeline & Donnet 2019, p. 175-176.
  40. Hesmans et al. 2013, p. 1.
  41. Rybczyńska 2020.
  42. Papastergiou et al. 2017, p. 627.
  43. McQuistin, Perkins & Fayed 2016, p. 1.
  44. Sullivan 2017.
  45. 45.0 45.1 Corbet 2018.
  46. Thomson 2021, 2. Fixed Properties of All QUIC Versions.
  47. Kühlewind & Trammell 2022, 2. The Necessity of Fallback.

Bibliography

Further reading