nimforum mirror - Multithread support in a dynamic language interpreter

gcao (orginal) [2022-10-29T17:28:33+02:00] view original

Hi,

I would like to add multithreading support in the pet language I'm working on [1]. After reading about Nim's thread model, I'm not sure what is the right way to approach this.

In my program, there are a few things that may not be compatible. The VM and values are all ref objects [2]. The VM is a global object.

If I don't change things dramatically, I was thinking I can do this:

Add a global lock.

Add a global array to store arguments and results for the thead handler function.

Pass the index of arguments/result to the handler

The handler will acquire the lock before accessing the VM and the global arguments array

A few questions / issues with this approach:

I need to update every place where I access the VM. This will make the code looks clumsy. And I guess it'll have a big impact to the performance as well.

How do I allow the user code to do something without holding the lock? Without this, there is no benefit of supporting multithread.

Not sure whether I'm on the right path. Please shed some light on me. Any comments or suggestions are welcome.

https://github.com/gcao/gene-new/

https://github.com/gcao/gene-new/blob/master/src/gene/types.nim

Araq (orginal) [2022-10-29T18:23:44+02:00] view original

The important question is "how does threading work in your language?". What are the primitives it supports? How can data be shared between threads? Does it only support message passing?

Once you have answered these questions you can think about how your implementation needs to do it. Your implementation based on ref might not survive this and you need to replace ref with ptr and a custom atomic reference counting mechanism. But it should be managable.

mratsim (orginal) [2022-10-30T13:15:06+01:00] view original

Regarding the primitives you support, it's important to know if you want multithreading for IO, multithreading for compute, or both, hence tell us what the multithreading use-cases are and read this (disclaimer, self-promotion) blog post: https://nim-lang.org/blog/2021/02/26/multithreading-flavors.html

In particular, it's unclear if you want to parallelize your VM (why? when?) or offer parallel primitives for programs build on your VM (in that case, you don't need to change anything).

Quite important to know if you want multithreading for IO is how you will handle async?

Also if you want unified handling (say async/await) for async, parallel IO and parallel compute tasks. "Simple" unified handling will have performance implications for compute tasks (due to kernel context switches) but significantly streamline ergonomics and implementation.

If we ignore the std/threadpool module which is being phased out, Nim offers you a blank slate on top of Windows/Unix thread creation/destruction primitives. Only thing you cannot do is preemptive multithreading (as opposed to cooperative) where you pause a thread and run another one without the paused thread cooperation, premptive multithreading can only be done by the OS (well, maybe you can cheat with signals ...).

gcao (orginal) [2022-10-30T14:28:15+01:00] view original

Thanks a lot for the suggestions.

What I would like to support is what you'll expect from any language with multithread support

allow user to create threads

avoid deadlock and race conditions - if I have to put some burden on the user, it'll be ok.

I'll see what is possible with my current design first, and then see if I need to change some part of my design in order to provide better multithread support.

@mratsim will read thru your blog. thanks for the pointer.

gcao (orginal) [2022-10-30T15:28:11+01:00] view original

Re-reading your post and here are some of my answers:

I would like to add multithread primitives to allow users to hand some tasks to threads. It should support both IO and Compute tasks. The goal is to better utilize multi-cores.

So either spawn or thread creation or both are ok.

Parallelizing the VM is not currently a goal.

Regarding premptive vs non-preemptive, whatever Nim supports is what you get since the language is built in Nim.

Mirror of forum.nim-lang.org

9560 :: Multithread support in a dynamic language interpreter