nimforum mirror - nlvm update 2020

arnetheduck (orginal) [2021-01-14T23:55:28+01:00] view original

Not much time for nlvm unfortunately, but I was able to spend some time on it over new year, meaning a few bugfixes made it in there at least!

llvm updated to v11.0 - nice perf boost, latest wasm updates etc

nim updated to 1.2.6 - this took quite a few changes, but support for 1.2 is now as good as it was with 1.0 - no 1.4 yet because we can't get newer nim versions to compile nimbus, which is what I use nlvm for occasionally

RVO implemented finally - before, large values would be returned on the stack which would cause incompatibilities with some nim code

module init order fixes - when importing modules, nim runs code in a particular order that nlvm now also uses

FFI fixes - again, the Nim compiler has it's own rules for when a parameter will be passed by value or reference - even though these aren't optimal for nlvm, they have to match or there are crashes in some FFI interfaces

better DWARF debug info for many types and code

Of course, there are a few no-brainers that I'd really like to see at some point, that are missing:

there's an experimental feature for using lld - the llvm linker - to create a fully standalone compiler that doesn't rely on the gcc at all, meaning that Nim code could finally be compiled without a c compiler installed

there's another experimental feature to run Nim code in the LLVM JIT - this works surprisingly well but needs some work around library loading - I imagine this could replace the internal Nim VM at some point meaning the same IR that's used to generate machine code would also run at compile time, removing most compile-time vs run-time differences that keep popping up

orc/arc isn't supported - in part because it comes with a new runtime that uses a different layout for strings - it's not hard to fix, just needs... work.

windows support would be nice - it's the environment that would benefit most from the native linking support in nlvm, making installing nim as easy as getting a single binary.

Araq (orginal) [2021-01-15T00:08:56+01:00] view original

This is excellent news. I hoped it would support ARC/ORC given that most of the implementation is done as an AST transformation. But not a big issue, maybe you should skip 1.4 altogether, 1.6 has many internal changes already in preparation for IC and IC could change how nlvm's code base looks like in quite fundamental ways.

My vision is that the frontend produces a set of .rod files (inspired by Java's and .NET's bytecode files) and backends turn the set of .rod files into binary code (via C, C++, LLVM or whatever seems fit).

arnetheduck (orginal) [2021-01-15T09:59:24+01:00] view original

I hoped it would support ARC/ORC

it's probably not too much work - the biggest issue is the new runtime string - the layout changed so everything that touches strings (and seq I presume) now needs a new implementation - the more string stuff that could move into the std library, the better (ie why is appending an element special-cased/magic in the compiler?). Most of all though, until ARC/ORC is stable enough to use on the Nim compiler itself, it's a bit too much of a moving target to keep up with, timewise, for me - generally, nlvm must implement the cgen bug-for-bug because there's code out there that depends on the bugs as well.

frontend produces a set of .rod files

well, presumably there will be some in-memory represenation. what's difficult in the compiler right now is that even the cgen does semantic and transformation stuff, updating the AST itself, so it's a bit hard to know what "shapes" of AST need to be supported and which AST node kinds appear for the backend to process - it also makes any kind of caching or pre-calculation steps hard - ideally by the time things reach the backend, the AST would be.. immutable-ish.

The other thing is incremental compilation and the VM - right now, the VM is tightly coupled but for something like LLVM it would make a lot of sense to generate IR early for leaf functions and use the LLVM JIT - that would probably save a fair bit of processing and make the nim stage of the compiler a fair bit faster because compile-time stuff can now contribute to the final IR without having to first compile to VM format then compile to backend IR - I imagine there would be some sort of API between the semantic passes and the VM so that either can be used during compilation.

What happens next is that llvm has two kinds of optimizations: function-level and global - the function-level optimizations can easily be run at any point in in the compilation process - put another way, instead of rod files, the most sensible thing for nlvm to store would be llvm ir snippets - not sure if it's possible to make that backend-specific. It doesn't really matter if it's done this way from the start, but it's a nice thought - it would complicate some other things though - in particular it might complicate the cgen, so it might not be a price worth paying - maintaining a single backend is hard enough, end-user code tends to start relying on both bugs and features in there - maintaining multiple middle-layers might simply not be worth whatever gains there might be, unless the Nim compiler is significantly refactored.

Araq (orginal) [2021-01-15T10:51:38+01:00] view original

... unless the Nim compiler is significantly refactored.

That's what we're doing, but it's a slow process and "hacks" have the annoying tendency to live long. "Nothing is more permanent than a makeshift"...

Mirror of forum.nim-lang.org

7387 :: nlvm update 2020