Sending HTTP traffic over Hyperswarm. Should we consider supporting multiplexing in the protocol?

Question

Sending HTTP traffic over Hyperswarm. Should we consider supporting multiplexing in the protocol?

Closed this issue 3 years ago · 25 comments

Sending HTTP traffic over Hyperswarm sockets is a use-case I'm currently exploring. This leads to a small challenge:

HTTP semantics depend on the connection life-cycle. After a response is finished, the server is expected to close the connection. This seems like a problem for Hyperswarm, because connection setup isn't as quick as it is for traditional TCP. We don't want to open and close connections for every request.

One option is to use empty messages to indicate the end of a request or response, without closing the stream. This should work so long as responses are sent in the same order that requests are sent, but that's a performance penalty.

We can improve on that by using multiplexing such as in libp2p-mplex. This would allow multiple in-flight HTTP request/responses, and the "channels" could be freed for reuse after the req/res sessions end on each of them.

I'm opening this issue for two reasons:

To confirm that I understand the situation correctly. (not a given)
If I'm correct, I wonder if there'd be any merit to supporting multiplexing natively in the protocol? I have two reasons for suggesting that: A) It might be useful in other contexts, and B) I suspect that sending HTTP over Hyperswarm sockets will be really common, and since Hyperswarm sockets don't have any "standard header" to declare the message format they're about to send, having multiplexing on all sockets would simplify things.

Answer 1 · 2021-10-06T23:28:08.000Z

Another option I might look into is using HTTP/2, which supports multiplexing.

EDIT: One consideration is that HTTP/2 does not, sadly, support "P2P". That means the client/server relationship can't be inverted on an existing connection. If HTTP/2 is the solution, we'll need at least 2 hyperswarm sockets (or an additional multiplexer) to do full p2p messaging.

Answer 2 · 2021-10-07T07:07:19.000Z

The hyperswarm layer will likely never support any high level protocols. Support is coming for TCP in addition to UTP and we are open to adding more "raw protocols". If you wanna run any multiplexing you should do that on top of your raw connection. The only exception to this might be a websocket interface for browser compat, but I'd prefer not doing that here as well for the same reasons. For the same reasons headers don't make much sense, it's just an encrypted raw socket.

In addition connections are always handshaked seperately through the dht for holepunching reasons.

I'd suggest the following:

Quantify what "isn't as quick" means - it's a testnet that is clustered in europe atm - do you mean high latency? Whats the numbers? Where are you connecting to/from.
Contribute low level benchmarks that help us track this over time

Answer 3 · 2021-10-07T08:25:15.000Z

In my experiments when relaying browser communications to a hyperswarm relayed http server, (logical) multiplexing and keepalive was happening, presumably negotiated between the browser and web server, this was also confirmed and improved on by adding and correctly configuring two http proxies on the chain, nginx for ssl addition and an npm implementation for programmability.

see https://github.com/lanmower/hyper-web-server

Answer 4 · 2021-10-07T14:38:39.000Z

@mafintosh I expected that would be the case. That's fine, just wanted to ask.

Quantify what "isn't as quick" means

See https://github.com/pfrazee/hyperswarm-bench. Summarized results:

10 TCP connections: 922ms
10 HS connections: 7036ms

In my experiments when relaying browser communications to a hyperswarm relayed http server, (logical) multiplexing and keepalive was happening

I need to understand how that multiplexing is happening. HTTP1.* doesn't support it AFAIK, so either nginx is using HTTP2 in its reverse proxy or there's some other mechanic involved that I'm not aware of.

Answer 5 · 2021-10-07T14:45:50.000Z

yes, I'm getting similar speeds even if the other nodes are on the same host... even with keepalives being a thing, it takes time for the browser to initiate those kept-alive connections, which takes 1s at the best of times. In cases where I'm using IPC I've had to batch things together, and even in those circumstances its sometimes slow to do chained sequences, if there is potential to improve this latency or somehow cache the wait out on repetitions with pre-connecting or keeping alive, that would be ideal...

…

On Thu, Oct 7, 2021 at 4:40 PM Paul Frazee ***@***.***> wrote: @mafintosh <https://github.com/mafintosh> I expected that would be the case. That's fine, just wanted to ask. Quantify what "isn't as quick" means See https://github.com/pfrazee/hyperswarm-bench. Summarized results: - 10 TCP connections: 922ms - 10 HS connections: 7036ms In my experiments when relaying browser communications to a hyperswarm relayed http server, (logical) multiplexing and keepalive was happening I need to understand how that multiplexing is happening. HTTP1.* doesn't support it AFAIK, so either nginx is using HTTP2 in its reverse proxy or there's some other mechanic involved that I'm not aware of. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#54 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFAPI6DMTLVBFU52DJQGWLUFWWK7ANCNFSM5FP7L3ZQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

Answer 6 · 2021-10-07T14:59:38.000Z

Doesn't matter where the nodes are, I'd imagine the bulk of the current latency is speed of light to the testnet DHT which is all running in northern germany. All handshakes go through the DHT always - that's how you holepunch.

With a more widely deployed DHT this might be better or worse. For both your localities I'd imagine it can only get better as it's really far geographically to the testnet.

Answer 7 · 2021-10-07T15:02:08.000Z

@mafintosh yeah that's fine. Like I said, this is what I expected. Now I'm just trying to find the right approach to work with these properties -- even if the handshake got down to 100ms, that would still be too high for HTTP proxies.

@mafintosh @lanmower My guess is that we're going to want to establish stable connections between peers and reuse them until some amount of inactive-time passes. What I'm trying to figure out now is, how do the protocols we're going to use behave relative to the connection lifecycle?

If we have a lot of protocols like HTTP which expect to use 1 connection per exchange, the next step might be to find/create a wrapper protocol. That's why I'm trying to thoroughly understand HTTP's multiplexing behavior, because ideally we wouldnt create a meta protocol. We may have to though.

Answer 8 · 2021-10-07T15:13:40.000Z

Very much doubt that we'll get everywhere close to 100ms on average even when the network is deployed. DHTs multiple require roundtrips, none of which is necessarily optimised for geographic closeness. That's not saying it wont get much faster than it is, but it'll always be much slower, relatively speaking, than direct connecting to anything.

Answer 9 · 2021-10-07T15:14:43.000Z

Right! You're debating me on something we agree on. The 100ms was my example of "wow somehow we magically got the DHT to be that fast." My point was, even then it would be too slow for the use-case of naive HTTP proxying.

Answer 10 · 2021-10-07T15:22:59.000Z

If you make tons of connections to different keys yea. For HTTP/1.1 keep-alive should solve it for you per default.

Answer 11 · 2021-10-07T15:23:37.000Z

(closing for now as there isn't anything actionable for hyperswarm to do here, but happy to keep the discussion going)

Answer 12 · 2021-10-07T15:37:05.000Z

Right, okay. Keep-alive was what I was missing about HTTP 1.1. It's not multiplexing so your requests end up getting de-parallelized, but it does reuse the connection.

Given that making a wrapper protocol is complex, here's what I'll do next:

For Web traffic: Use HTTP2 when possible, HTTP1.1 when necessary.
Open two Hyperswarm connections when server-to-server HTTP is required. (HTTP 1 and 2 both require a fixed server/client assignment on a connection.)
...but look into how gRPC does bi-directional RPC, because some writing suggests that they use event-streams or server push to accomplish that.

Answer 13 · 2021-10-07T15:45:23.000Z

You should look into HTTP pipelining, it's basically multiplexing for 90% of all use cases.

Answer 14 · 2021-10-07T15:46:12.000Z

Also making multiple connections to the SAME host is very optimisable (read competive to direct connections)

Answer 15 · 2021-10-07T15:53:48.000Z

You should look into HTTP pipelining, it's basically multiplexing for 90% of all use cases.

Yeah it seems to have pretty mixed support these days because HTTP gives no clear way to establish Pipelining support, and I'm not sure if you can elegantly handle a "wrong guess." I'll check into it though. HTTP/2 may just be the simplest solution.

Also making multiple connections to the SAME host is very optimisable (read competive to direct connections)

By Hyperswarm? That's what we're looking at here, so if Hyperswarm can optimize that to within 50-100ms, then that plus keep-alive means we're golden.

Answer 16 · 2021-10-07T15:58:08.000Z

If you are making a proxy, you're implementing the pipelining yourself as the multiplex layer so I don't think generel supports matter - this is just a simple plug'n'play solution (i think node core has support even, but maybe i'm remembering wrong)

Yea by Hyperswarm. Pretty easy but not high on the list, so wouldn't happen for a while.

Answer 17 · 2021-10-07T16:01:28.000Z

Okay cool. Thanks for the help!

Answer 18 · 2021-10-07T16:04:07.000Z

Just confirmed through some googling that node's http server HAS pipelining but the client doesn't. https://github.com/nodejs/undici does tho.

Answer 19 · 2021-10-07T16:09:15.000Z

Okay thanks for checking that.

Answer 20 · 2021-10-07T16:14:09.000Z

I suppose if you're implementing http yourself you might be stuck doing it in code, in my test cases however, when relaying connections from a browser to a capable web host or proxy, they do their usual keep alive business the way its expected... Im happy to provide nodes on oracle to see if that eliminates the latency on oracle. How do I go about installing test net nodes?

…

On Thu, 07 Oct 2021, 5:02 pm Paul Frazee ***@***.***> wrote: @mafintosh <https://github.com/mafintosh> yeah that's fine. Like I said, this is what I expected. Now I'm just trying to find the right approach to work with these properties -- even if the handshake got down to 100ms, that would still be too high for HTTP proxies. @mafintosh <https://github.com/mafintosh> @lanmower <https://github.com/lanmower> My guess is that we're going to want to establish stable connections between peers and reuse them until some amount of inactive-time passes. What I'm trying to figure out now is, how do the protocols we're going to use behave relative to the connection lifecycle? If we have a lot of protocols like HTTP which expect to use 1 connection per exchange, the next step might be to find/create a wrapper protocol. That's why I'm trying to thoroughly understand HTTP's multiplexing behavior, because ideally we *wouldnt* create a meta protocol. We may have to though. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#54 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFAPI2T6PZBA6BQS43FIGLUFWY7XANCNFSM5FP7L3ZQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

Answer 21 · 2021-10-07T16:59:49.000Z

@lanmower I'm guessing your hyper-web-relay package is going to get different kinds of HTTP traffic from nginx than it would from a browser directly accessing it.

Answer 22 · 2021-10-07T18:06:50.000Z

Yeah both the proxies I used needed some tweaks, if you take away the browsers ability to keep alive with the proxy for instance, it loses it.

…

On Thu, Oct 7, 2021 at 7:00 PM Paul Frazee ***@***.***> wrote: @lanmower <https://github.com/lanmower> I'm guessing your hyper-web-relay package is going to get different kinds of HTTP traffic from nginx than it would from a browser directly accessing it. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#54 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFAPI3RUYDXOT5DO665FTLUFXGY7ANCNFSM5FP7L3ZQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

Answer 23 · 2021-11-18T09:50:58.000Z

That being said, all browsers multiplex.

Answer 24 · 2021-11-18T09:51:49.000Z

@pfrazee can you link me to where you do multiplexing? maybe I can blatantly steal your technique for hyper-web-relay :)

Answer 25 · 2021-11-18T16:43:28.000Z

I ended up solving it by putting an nginx reverse proxy in front with keep-alive enabled