nimforum mirror - Show Nim: Mummy, a new HTTP + WebSocket server that returns to the ancient ways of threads

guzba (orginal) [2022-12-02T06:08:47+01:00] view original

Hello all, today I wanted to share Mummy, my new multithreaded HTTP + WebSocket server written entirely in Nim.

I started writing Mummy because I wanted an alternative to async. There are very good historical reasons for async (refc + threadlocal heaps making multithreaded stuff extra challenging), however ORC / ARC and Nim 2.0 coming very soon have really opened the door to another option: threads.

Mummy takes what I think a pretty modern approach to threads and IO. The general model for Mummy goes like this: multiplex socket IO on one thread and dispatch ready-to-go requests to a pool of worker threads. This keeps worker threads far away from client sockets while still enabling the pleasant inline blocking boring Nim code that makes me a very happy programmer.

I see enormous value in simple blocking code and would be willing to make some performance sacrifice to have it. It turns out, though, that I don't have to.

Mummy is able to outperform AsyncHttpServer, as well as Node and Go. Mummy even outperforms HttpBeast, though this may be due to a bug.

You can confirm the results yourself by looking at the benchmarks, the code behind them, and running them yourself.

Mummy basically offers the maximum performance potential while also enabling you to write the simplest code. This is a dream combo.

Please check out the Mummy README if you want to learn more.

Yardanico (orginal) [2022-12-02T06:19:44+01:00] view original

How would I go about using global objects with Mummy? I assume I will have to use atomics/locks manually myself?

And since there's a maxThreads which is set to 2x of CPU count by default, does that mean that if requests take a long time to do, all other requests will just stall and have to wait for the worker threads to finish the current requests?

guzba (orginal) [2022-12-02T06:25:02+01:00] view original

Yes, for global memory you'll want to use atomics / locks. Async does not actually save you from this either, races are easy to create with await so I consider it a wash.

I just kind of randomly set that. Honestly set it to 1000 threads if you want, it doesn't matter, better to be super high unless you're doing intense computation per request.

Araq (orginal) [2022-12-02T06:59:24+01:00] view original

I love it! Does it support SSL?

Yardanico (orginal) [2022-12-02T07:01:10+01:00] view original

Doesn't seem so, but usually those services are served behind a Nginx/Apache/Caddy instance or Cloudflare (or some other cloud) that handle all the details.

guzba (orginal) [2022-12-02T07:02:44+01:00] view original

Today it does not, so those using Mummy would want to consider having something like Nginx or whatever do the SSL stuff and reverse proxy to your Mummy server. Or put Cloudflare infront of the server.

I've personally found self-hosting + Cloudflare Tunnels to be an interesting option too, very old+new. Similar things can be done with a public VPS or whatever, lots of ways to skin this cat.

Araq (orginal) [2022-12-02T07:22:46+01:00] view original

Doesn't seem so, but usually those services are served behind a Nginx/Apache/Caddy instance or Cloudflare (or some other cloud) that handle all the details.

Yes, I know. So let me rephrase my question. How much work would it be to implement SSL support?

guzba (orginal) [2022-12-02T07:39:57+01:00] view original

I honestly have no idea at this point. The actual nuts-and-bolts implementing of SSL in code is not something I am familiar with. More to learn.

jasonfi (orginal) [2022-12-02T07:53:25+01:00] view original

Great work, although this is obviously more of a proof of concept and not yet ready for serious work.

Besides SSL, there doesn't appear to be any handling/parsing of URL parameters. The handling of such parsing is also critical to performance. Jester creates a very useful Request object, but that probably doesn't explain the full gap in performance.

The await bug is also interesting, not sure if any of the core devs have ideas about that.

Araq (orginal) [2022-12-02T08:18:33+01:00] view original

Maybe adapt lexim to implement URL parsing.

Araq (orginal) [2022-12-02T22:04:00+01:00] view original

I will not claim Mummy is DoS-proof by any means, that'd be nuts without extensive evidence to back it up, but I have been mindful in my choices to not leave obvious weak points.

I did audit asynchttpserver fwiw and I will happily audit Mummy too. I'm not an expert on these things but I have some experience. Please tell me once you consider it ready for me to review.

elcritch (orginal) [2022-12-02T22:10:24+01:00] view original

I'll have to look through your epoll setup! I did a of work with them and I'm curious how you set them up.

What are you using to send data between the IO thread and workers? Just the threading channels or something else?

guzba (orginal) [2022-12-03T02:15:11+01:00] view original

I'll have to look through your epoll setup!

I ended up having success with Nim's std/selectors package so the main socket IO loop uses that. I am happy this low-level library was available.

What are you using to send data between the IO thread and workers?

There's only 2 memory-thread transition points: requests from IO thread -> worker threads and responses from worker threads -> IO thread. This is also true for WebSocket messages.

For incoming requests I'm manually managing the memory since I want the Request object to be many-thread-safe to enable even further use of threading (fanning out in an incoming request onto a few more worker threads to do some stuff in parallel before responding). That's only something I have running locally, not shown in the repo right now.

As for responses from workers -> IO thread, that's an ownership transfer move into a queue + Atomic-as-lock. Same idea as a channel but I had very simple needs so I just kept it simple.

jasonfi (orginal) [2022-12-03T08:12:33+01:00] view original

The fix discussed looks like something used by httpbeast and not httpbeast itself. If so, is there going to be an update to Jester?

boia01 (orginal) [2022-12-03T18:21:28+01:00] view original

Just for awareness, I'll mention another, relatively mature, select + threads-based HTTP server with websocket support: https://github.com/olliNiinivaara/GuildenStern

alexeypetrushin (orginal) [2022-12-03T20:10:46+01:00] view original

This is the step in the right direction (I think it's called the Reactor Pattern). It covers most of cases needed from the server.

Good job! @boia01 and other implementations are good job too :).

arik (orginal) [2022-12-29T00:55:41+01:00] view original

I'm trying to put it into production behind cloudflare, but it's not reaching, even though it works locally and the ports are configured properly... I'll keep posted, very exciting project after all!

guzba (orginal) [2022-12-30T00:32:14+01:00] view original

Thanks for trying out Mummy. I have a theory on the issue you're facing.

By default Mummy only listens on "localhost" meaning the port is not externally accessible. This is an important default for security reasons, however for a public web server this is a no-go. To address this you'll want to use something like server.serve(Port(80), "0.0.0.0") as shown here: https://github.com/treeform/nimdocs/blob/master/src/nimdocs.nim#L157

Then when you have Cloudlfare proxy requests, it will hit IP_ADDRESS:80 for example and work since "0.0.0.0" is publicly available.

arik (orginal) [2022-12-30T20:36:13+01:00] view original

Thanks so much, that worked! I configured firewalld to only accept Cloudflare's IPset to my machine, hopefully that negates the security issue. Thanks again!

Mirror of forum.nim-lang.org

9683 :: Show Nim: Mummy, a new HTTP + WebSocket server that returns to the ancient ways of threads