nimforum mirror - Nimony async/passive procs

mhjkl (orginal) [2026-06-23T23:14:36+02:00] view original

I know it's still early to tell, but I'm trying to understand what direction Nim 3 is going for when it comes to async. Passive calls are supposed to be transparently wrapped in a trampoline when called from a non-passive proc and ran until completion, but how is it expected to complete if it does an asynchronous operation that requires waiting? If the wrapper just calls the event loop scheduler and polls then runs other events until the called proc completes you could end up with recursively nested event loops if there are function calls that alternate between passive and non-passive procs. Is it supposed to just block the thread until completion? But then it's not really a scheduler and you will need different schedulers at the bottom of the stack and when calling deeper within non-passive procs, a non-blocking one on the top level and a blocking one when nesting calls or you would need some mechanism to tell async functions or scheduler calls if they're being called recursively to switch to blocking. Since they're supposed to be called unmarked without any await marking I suspect some amount of mixing of passive and non-passive procs is to be expected and I'm wondering how it would be handled. I'm not a big fan of the whole "colored functions bad" meme, though it's not that I think colorless async is bad. But my concern here is that I'm not sure what Nim 3 is going for here and what mental model to hold of it. I remember Zig being very confused as to what it wants to do with async, though I stopped following its development a while ago and maybe it's all clear now

Araq (orginal) [2026-06-24T00:15:01+02:00] view original

If the wrapper just calls the event loop scheduler and polls then runs other events until the called proc completes you could end up with recursively nested event loops if there are function calls that alternate between passive and non-passive procs.

That is pretty much exactly what will happen. But IMHO it works beautifully:


active main()                    ← hardware stack, thread-local
  complete(readLine())           ← hardware stack loop on same thread
    readLine {.passive.}         ← stack-allocated coroVar
      ioWait → suspend()         ← worker freed to pool
      ... other tasks run ...
      resume on SAME thread      ← coroVar pointer still valid
    complete() returns
  use result

complete() may nest when active and passive alternate, but each level is just a shallow loop — coroutine state lives in CPS frames, not on the hardware stack. While one call suspends, the worker keeps doing other pool work; from main's perspective it's still a plain blocking call.

The stack frame from the active caller pins resume to the same thread, which is exactly what makes this safe. The main footgun is {.threadvar.} inside {.passive.} — that's task semantics on thread-local storage, and the compiler could reject it statically.

blackmius (orginal) [2026-06-24T13:51:29+02:00] view original

the call stack could grow until you exceed stack size limits…

Araq (orginal) [2026-06-24T14:25:19+02:00] view original

The stack doesn't grow with passive depth — it grows with active/passive alternation depth, and each crossing is a shallow complete() loop, not unbounded recursion.

blackmius (orginal) [2026-06-24T15:01:21+02:00] view original

here is alternaring example that grows stack size


import std/syncio

var RLIMIT_STACK* {.importc: "RLIMIT_STACK", header: "<sys/resource.h>".}: cint
type rlim_t {.importc: "rlim_t", header: "<sys/resource.h>".} = cint
type Rlimit {.importc: "struct rlimit", header: "<sys/resource.h>".} = object
    rlim_cur: rlim_t
    rlim_max: rlim_t
proc getrlimit(resource: cint, outptr: ptr Rlimit) {.importc: "getrlimit", header: "<sys/resource.h>".}
proc setrlimit(resource: cint, outptr: ptr Rlimit) {.importc: "setrlimit", header: "<sys/resource.h>".}

var rlimit = Rlimit(rlim_cur: 1024*128.cint, rlim_max: 1024*1024.cint)
setrlimit(RLIMIT_STACK, rlimit.addr)
getrlimit(RLIMIT_STACK, rlimit.addr)
echo "CURRENT RLIMIT_STACK ", rlimit.rlim_cur, " ", rlimit.rlim_max

var q = 0

proc a(b: int) {.passive.} =
    if b > 0:
        a(b-1)
    q += 1

a(10000)
echo q

q = 0

proc alternate2(b: int) =
    if b > 0:
        alternate(b-1)
    q += 1

proc alternate(b: int) {.passive.} =
    if b > 0:
        alternate2(b-1)
    q += 1

alternate(10000)
echo q

blackmius (orginal) [2026-06-24T15:26:03+02:00] view original

or even better

 nim
var queue = newSeq[Continuation]()
proc scheduler(c: Continuation): Continuation =
    result = c.fn(c.env)
    if result.fn == nil and queue.len > 0:
        result = queue.pop()
setScheduler(scheduler)

proc ioWait() {.passive.} =
    suspend()

proc nonpassive() =
    ioWait()

proc task() {.passive.} =
    nonpassive()

for i in 0..10000:
    queue.add delay task()

let c = queue.pop()
c.complete()

blackmius (orginal) [2026-06-24T15:35:09+02:00] view original

ok, now i see, scheduler is not a proper tool for queuing. in real code it would be a side while queue.len > 0 loop which pops coroutines and complete them

Araq (orginal) [2026-06-24T15:39:13+02:00] view original

That's just your scheduler doing poor things, it doesn't invalidate the design. Our std/threadpool.nim does not have this problem.

Araq (orginal) [2026-06-24T16:45:13+02:00] view original

ok, now i see, scheduler is not a proper tool for queuing.

Thanks for your report anyway, it makes for a great addition to the documentation!

mhjkl (orginal) [2026-06-24T19:38:44+02:00] view original

That seems fine if we assume a finite non-growing set of async tasks, but if we have something interactive like (a webserver for example, or a real-time gui app with long-running background tasks) new tasks can appear (from new requests for example). For the stack to unwind all tasks below the current level have to be completed, but if the tasks complete and spawn new tasks in an arbitrary order it is possible for the stack to grow indefinitely. For example, you may have two functions processing requests with nested event loops, and before they complete, a third comes. Now the top stack layer prevents unwinding even if the previous two complete. If before the third completes a fourth request comes and alternates the stack, it cannot unwind either. Would imagine then you would need a non-recursive scheduler for some domains using the aforementioned nesting detection but the nested scheduler looks neat if we assume a semi-fixed task set where tasks spawn in rare bursts

Araq (orginal) [2026-06-24T20:07:31+02:00] view original

Concurrent requests shouldn't nest on one hardware stack. Each connection is spawned as its own continuation and submitted to the pool — new requests go in the run queue, not inside an unreturned complete(). Stack depth is bounded per handler call chain, not per number of live connections. The queue-in-scheduler antipattern is real, but the fix isn't a fancier nested scheduler — it's enqueue at pool boundaries (which the thread pool already does). Interactive domains don't need a finite task set; they need to not model concurrency as nested blocking calls on one thread.

Assembly via stack manipulation trades the "where do I come from data" against the "what do I need in the future" -- that is worse for memory consumption.

Mirror of forum.nim-lang.org

13999 :: Nimony async/passive procs