-
Notifications
You must be signed in to change notification settings - Fork 311
add HTTP spec #508
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add HTTP spec #508
Changes from 30 commits
bc1aa59
1f075f6
12f86b8
146c09a
5398f5d
b6c1bc2
8a57943
d506145
946f516
3681472
dd5d07c
46d1857
ebe612c
7e5a077
db2b3b5
6319458
c7c9c43
454e25c
a25267b
3014b22
f96359b
1e87960
d0f0d93
8fbd64a
71415b0
4a03bb0
877899d
dc71f2c
d8850aa
78e8ca1
d30efda
8628b5a
3c0ac40
75bc635
f95e4db
e3eb9dc
95ffe6d
8f44d00
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||
|---|---|---|---|---|---|---|
| @@ -0,0 +1,133 @@ | ||||||
| # HTTP | ||||||
|
|
||||||
| | Lifecycle Stage | Maturity | Status | Latest Revision | | ||||||
| | --------------- | ------------- | ------ | --------------- | | ||||||
| | 1A | Working Draft | Active | r0, 2023-01-23 | | ||||||
|
|
||||||
| Authors: [@marten-seemann], [@MarcoPolo] | ||||||
|
|
||||||
| Interest Group: [@lidel], [@thomaseizinger] | ||||||
|
|
||||||
| [@marten-seemann]: https://github.com/marten-seemann | ||||||
| [@MarcoPolo]: https://github.com/MarcoPolo | ||||||
| [@lidel]: https://github.com/lidel | ||||||
| [@thomaseizinger]: https://github.com/thomaseizinger | ||||||
|
|
||||||
| ## Introduction | ||||||
|
|
||||||
| This document defines how libp2p nodes can offer and use an HTTP transport alongside their other transports to support application protocols with HTTP semantics. This allows a wider variety of nodes to participate in the libp2p network, for example: | ||||||
|
MarcoPolo marked this conversation as resolved.
|
||||||
|
|
||||||
| - Browsers communicating with other libp2p nodes without needing a WebSocket, WebTransport, or WebRTC connection. | ||||||
| - HTTP only edge workers can run application protocols and respond to peers on the network. | ||||||
| - `curl` from the command line can make requests to other libp2p nodes. | ||||||
|
|
||||||
| The HTTP transport will also allow application protocols to make use of HTTP intermediaries such as HTTP caching, and layer 7 proxying and load balancing. This is all in addition to the existing features that libp2p provides such as: | ||||||
|
MarcoPolo marked this conversation as resolved.
|
||||||
|
|
||||||
| - Connectivity – Work on top of WebRTC, WebTransport, QUIC, TCP, or an HTTP transport. | ||||||
| - Hole punching – Work with peers behind NATs. | ||||||
| - Peer ID Authentication – Authenticate your peer by their libp2p peer id. | ||||||
| - Peer discovery – Learn about a peer given their peer id. | ||||||
|
|
||||||
| ## HTTP Semantics vs Encodings vs Transport | ||||||
|
|
||||||
| HTTP is a bit of an overloaded term. This section aims to clarify what we’re talking about when we say “HTTP”. | ||||||
|
|
||||||
|
|
||||||
| ```mermaid | ||||||
| graph TB | ||||||
| subgraph "HTTP Semantics" | ||||||
| HTTP | ||||||
| end | ||||||
| subgraph "Encoding" | ||||||
| HTTP1.1[HTTP/1.1] | ||||||
| HTTP2[HTTP/2] | ||||||
| HTTP3[HTTP/3] | ||||||
| end | ||||||
| subgraph "Transports" | ||||||
| Libp2p[libp2p streams] | ||||||
| HTTPTransport[HTTP transport] | ||||||
| end | ||||||
| HTTP --- HTTP1.1 | ||||||
| HTTP --- HTTP1.1 | ||||||
| HTTP1.1 --- Libp2p | ||||||
| HTTP --- HTTP2 | ||||||
| HTTP --- HTTP3 | ||||||
| HTTP1.1 --- HTTPTransport | ||||||
| HTTP2 --- HTTPTransport | ||||||
| HTTP3 --- HTTPTransport | ||||||
|
Comment on lines
+37
to
+57
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. h2c to show how it can be multiplexed and negociated in many ways (header compression, binary based protocol, ...) ? |
||||||
| ``` | ||||||
|
|
||||||
| - *HTTP semantics* ([RFC 9110](https://www.rfc-editor.org/rfc/rfc9110.html)) is | ||||||
| the stateless application-level protocol that you work with when writing HTTP | ||||||
| apis (for example). | ||||||
|
|
||||||
| - *HTTP encoding* is the thing that takes your high level request/response | ||||||
| defined in terms of HTTP semantics and encodes it into a form that can be sent | ||||||
| over the wire. | ||||||
|
|
||||||
| - *HTTP transport* is the thing that takes your encoded reqeust/response and | ||||||
| sends it over the wire. For HTTP/1.1 and HTTP/2, this is a TCP+TLS connection. | ||||||
| For HTTP/3, this is a QUIC connection. | ||||||
|
|
||||||
| When this document says *HTTP* it is generally referring to *HTTP semantics*. | ||||||
|
|
||||||
| ## Interoperability with existing HTTP systems | ||||||
|
|
||||||
| A goal of this spec is to allow libp2p to be able to interoperate with existing HTTP servers and clients. Care is taken in this document to not introduce anything that would break interoperability with existing systems. | ||||||
|
Comment on lines
+72
to
+76
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. So this is a bit confusing to me. Above you are saying the you generally refer to HTTP semantics and the next sentence says that a goal is to interoperate with existing HTTP servers and clients which refers to the transport, correct?
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Refers to both actually |
||||||
|
|
||||||
| ## HTTP Transport | ||||||
|
|
||||||
| Nodes MUST use HTTPS (i.e., they MUST NOT use plaintext HTTP). It is RECOMMENDED to use HTTP/2 and HTTP/3. | ||||||
|
|
||||||
| Nodes signal support for their HTTP transport using the `/http` component in | ||||||
| their multiaddr. E.g., `/dns4/example.com/tls/http`. See the [HTTP multiaddr | ||||||
|
MarcoPolo marked this conversation as resolved.
|
||||||
| component spec](https://github.com/libp2p/specs/pull/550) for more details. | ||||||
|
|
||||||
| ## Namespace | ||||||
|
|
||||||
| libp2p does not squat the global namespace. libp2p application protocols can be | ||||||
| discovered by the [well-known resource](https://www.rfc-editor.org/rfc/rfc8615) | ||||||
| `.well-known/libp2p`. This allows server operators to dynamically change the | ||||||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We use well-known resources elsewhere, e.g.
Suggested change
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Aesthetically it's nicer if we aren't referencing HTTP multiple times. Technically this doesn't matter, but my vote is for
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not sure I agree. Given we have other well-known resources, Though yes, from a technical perspective it just has to be a predictable string with a low chance of collision.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @lidel can you be our tie breaker?
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Squatting I think if we want to avoid that, there is still time to can change it (this spec is still a draft), we should go with either The latter ( If we don't want 'http' twice, could be
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This is different though. This is metadata about a peer's supported protocols. The well-know webtransport URI is about where to send the HTTP CONNECT request to.
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not sure it is, I think how you interact with the well-known resource is a detail of the protocol that's unrelated to the path it uses.
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The Any other well-known resource we would want could also fit under that suffix. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Changing this from "/.well-known/libp2p" to "/.well-known/libp2p/protocols" will break the ability to communicate with existing servers deployed without upgrading them first. Since these servers are running in places not under our control, they will not be immediately upgradable. Because there are already deployments that use the old value, I suggest keeping it as
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I understand the pain in this rollout. Especially when the full rollout is outside your control. Maybe go-libp2p can provide a transition period to help out? Since this spec itself is in draft we need flexibility in how things work. As much as I prefer the Once this is merged, things will be stable and we won't break existing users. That's the guarantee that comes with merging this. And partly the reason why I haven't merged it yet. I want to build the js-libp2p side before merging and make sure that the interop works and is reasonable. I'm pretty close on that. Hoping to get it done by the end of the month :) |
||||||
| URLs of the application protocols offered, and not hard-code any assumptions how | ||||||
| a certain resource is meant to be interpreted. | ||||||
|
|
||||||
| ```json | ||||||
|
|
||||||
| { | ||||||
| "protocols": { | ||||||
| "/kad/1.0.0": {"path": "/kademlia/"}, | ||||||
| "/ipfs/gateway": {"path": "/"}, | ||||||
| } | ||||||
| } | ||||||
| ``` | ||||||
|
|
||||||
| The resource contains a mapping of application protocols to a URL namespace. For | ||||||
| example, this configuration file would tell a client | ||||||
|
|
||||||
| 1. That the Kademlia application protocol is available with prefix `/kademlia` | ||||||
|
MarcoPolo marked this conversation as resolved.
Outdated
|
||||||
| and, | ||||||
| 2. The [IPFS Trustless Gateway API](https://specs.ipfs.tech/http-gateways/trustless-gateway/) is mounted at `/`. | ||||||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If I understand correctly, this only specifies the path but not the method (GET / POST) to use when accessing this protocol over HTTP and that's up to the specific protocol to define how to run it over HTTP?
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Hm.. the methods will be specific to each protocol at each mount point, so not part of this spec?
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That makes sense. As I see it there are two sections in this document:
So should we mention it in the specs that a libp2p protocol supporting http transport should specify the http method and headers to be used for the protocol. For the path they can expose it via the wellknown endpoint
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not sure I understand. An application protocol would be built using HTTP semantics, and that protocol would then be able to run on libp2p streams or "standard" http transports like h2, h3. What do you mean by:
This spec does not define how you would take an existing libp2p protocol and map it to HTTP semantics. That is best done by the specific protocol itself. But maybe I'm misunderstanding your point? |
||||||
|
|
||||||
| It is valid to expose a service at `/`. It is RECOMMENDED that implementations facilitate the coexistence of different service endpoints by ensuring that more specific URLs are resolved before less specific ones. For example, when registering handlers, more specific paths like `/kademlia/foo` should take precedence over less specific handler, such as `/`. | ||||||
|
|
||||||
| ## Peer ID Authentication | ||||||
|
|
||||||
| When using the HTTP Transport, peer id authentication is optional. You only pay | ||||||
|
MarcoPolo marked this conversation as resolved.
Outdated
|
||||||
| for it if you need it. This benefits use cases that don’t need peer | ||||||
| authentication (e.g., fetching content addressed data) or authenticate some | ||||||
| other way (not tied to libp2p peer ids). | ||||||
|
|
||||||
| Specific authentications schemes for authenticating Peer IDs will be defined in | ||||||
|
MarcoPolo marked this conversation as resolved.
Outdated
|
||||||
| a future spec. | ||||||
|
|
||||||
| ## Using HTTP semantics over stream transports | ||||||
|
MarcoPolo marked this conversation as resolved.
Outdated
|
||||||
|
|
||||||
| Application protocols using HTTP semantics can run over any libp2p stream transport. Clients open a new stream using `/http/1.1` as the protocol identifer. Clients encode their HTTP request as an HTTP/1.1 message and send it over the stream. Clients parse the response as an HTTP/1.1 message and then close the stream. Clients SHOULD NOT pipeline requests over a single stream. Servers SHOULD set the [`Connection: close` header](https://datatracker.ietf.org/doc/html/rfc2616#section-14.10) to signal to clients that this is not a persistent connection. | ||||||
|
MarcoPolo marked this conversation as resolved.
Outdated
|
||||||
|
|
||||||
| HTTP/1.1 is chosen as the minimum bar for interoperability, but other encodings of HTTP semantics are possible as well and may be specified in a future update. | ||||||
|
Jorropo marked this conversation as resolved.
Outdated
|
||||||
|
|
||||||
| ## Using other request-response semantics (not HTTP) | ||||||
|
|
||||||
| This document has focused on using HTTP semantics, but HTTP may not be the common divisor amongst all transports (current and future). It may be desirable to use some other request-response semantics for your application-level protocol, perhaps something like rust-libp2p’s [request-response](https://docs.rs/libp2p/0.52.1/libp2p/request_response/index.html) abstraction. Nothing specified in this document prohibits mapping other semantics onto HTTP semantics to keep the benefits of using an HTTP transport. | ||||||
|
|
||||||
| To support the simple request-response semantics, for example, the request MUST be encoded within a `POST` request to the proper URL (as defined in the Namespace section). The response is read from the body of the HTTP response. The client MUST authenticate the server and itself **before** making the request. | ||||||
|
MarcoPolo marked this conversation as resolved.
Outdated
MarcoPolo marked this conversation as resolved.
Outdated
|
||||||
Uh oh!
There was an error while loading. Please reload this page.