nimforum mirror - [Solved] How to properly bind a function to a compiler buildin?

Krux02 (orginal) [2017-04-08T20:18:48+02:00] view original

Currently I improve the proceduer computeSize in types.nim. I worked now quite a bit on the procedure, but I don't understand, how system.sizeof eventually calls types.computeSize. I know that in some cases the call system.sizeof was not able to be evaluated at compile time. I would like to know how this decision is made.

Eventually I would like to add system.alignof analogue to system.sizeof. Then I would like to add offsetof, but here I do not know how the interface should look like. This is an idea:

proc offsetof[T](typ: typedesc[T]; member: untyped): int

For my use case it is very important, that I can get all the sizes at compile time, therefore I would like to make all this also available with NimNode arguments:

# macros.nim
proc sizeof(typ: NimNode)
proc alignof(typ: NimNode)
proc offsetof(member: NimNode)

But here again, I don't understand how I can bind these procedures to the functions I defined in types.nim.

I need it for my current pull request

Varriount (orginal) [2017-04-09T00:47:15+02:00] view original

What do you mean by 'bind'?

Araq (orginal) [2017-04-09T07:52:54+02:00] view original

There are multiple ways to do this in the compiler and you need to play around to see what works best for sizeof and alignof and offsetof.

Add new mAlignOf, mOffsetOf to the ast.TMagic enum.

Either add the handling to:

magicsAfterOverloadResolution in semmagic.nim

Or:

semMagic in semexprs.nim and as the comment says don't forget to read the comment:

# DON'T forget to update ast.SpecialSemMagics if you add a magic here!

With (2) you can bypass the overloading resolution mechanism which introduces a builtin that cannot be overloaded (only shadowed) so (1) is the preferred way.

The macros API requires patching vmgen and vm. I'd reuse the opcNGetType opcode for this.

cblake (orginal) [2017-04-09T13:51:03+02:00] view original

For my use case it is very important, that I can get all the sizes at compile time

You may not be able to make all these things available at Nim-compile-time because they are not fully determined until the C (or whatever) backend compile is reached. For example,

type foo object =
  bar: char
  baz: int

will induce generation by Nim of some struct in the C backend. This creates a choice for the C of how to layout the struct - is it "1 byte packed next to 4 or 8 bytes" or rather is there a 3 or 7 byte gap between bar and baz. Many back ends will put in that 3 or 7 byte padding to have the int field be aligned which can result in faster generated assembly code on some CPUs.

Almost all C compilers allow some kind of compiler directives like GCC's __attribute__((__packed__)) or an analogue to kill all padding and let users control layout. For the C backend (but maybe not others..?) it may be possible for Nim to use those directives for all its structs to workaround the ambiguities of backend struct layout and resolve these to Nim-compile-time values. It is very debatable if that is desirable, though. It is definitely more complexity than just "binding" calls. That binding is really more delegation than resolution to a value.

What the compiler currently does seems the right thing to me which is to delegate the complex/compound cases to the backend and "optimize" the cases for simple/non-composite types where it can reliably know the answer. For compound types, Nim can know these backend layout sensitive values are Nim- and backend-compile-time constants, just not what the values are to do fully resolved at compile-time calculations.

So, a higher level question is if this resolution to a value known at Nim compile time is really a strict requirement for your purposes or if you can relax that to delegated backend work so the calculation can be done at "backend compile time".

Krux02 (orginal) [2017-04-10T10:48:53+02:00] view original

@cblake To my research Each platform (x86 arm powerpc) may have different alignment rules. But there is definitively no alignment rule distinction between compilers, otherwise it would be impossible to use structs as interfaces in C. This makes header files in C compatible between different compilers. This compatibility was never enforced or was defined, the C standard actually would allow different behaviours, but compiler manufacturers wanted compatibility and that is what we can rely on today.

I am aware of the packed attribute, I think nim should have the same.

To your last question. I think I can move the resolution of the alignment parameter to the runtime, but that would not only increase the complexity of the code a lot, but also remove the ability for compile time error checking.

Since you can use statements such as when defined(windows): in your code, the generated C code is already platform dependent. This means it would not be wrong at all to generate platform dependent constants in the C code. For other platforms the C code would be regenerated anyway.

Krux02 (orginal) [2017-04-10T11:21:17+02:00] view original

@Araq

I just looked at magicsAfterOverloadResolution, since you said it is the preferred way. I tried to find the mSizeOf handling in there for an example, but it is not handled there. I would like to know what I am actually supposed to do in this procedure. What information do the parameters provide to me? And what am I supposed to return?

Currently the procedure itself does not have any documentation string, nor does any of the procedure argument types have any documentation.

So my question here is, what are the parameters, and what should it return? Especially the parameter c: PContext is a bit confusing to me. Since I already used PNode a bit, I guess it should eventually return an integer literal in form of a PNode with the value of the offset/alignment/size.

cblake (orginal) [2017-04-10T13:42:21+02:00] view original

You are right that C compilers already strive to ensure ABI struct compatiblity which is why I think having Nim leverage that effort is better than reproducing it at the Nim level, though using backends themselves to pre-generate things may help. My point was to get truly Nim-time resolved values was trickier than it seemed was being framed by the discussion. Another thought along those lines might be to resolve things to Nim-time values only for types using Nim's {.packed.} pragma. Maybe only {.packed.} types would get the best error reports.

Krux02 (orginal) [2017-04-10T14:15:44+02:00] view original

@cblake I think it should be handled at nim level, because it can be hadled at Nim level. The more things can be resolved at the Nim side, the more powerful the language is. On the other side, Nim does not only have a C backend, it also has an LLVM backend. I do not know the state of this backend or how LLVM actually works, but I could imagine that here it is part of the nim compiler to define the alignment. And it is also not too complicated to implement this behaviour backend dependent. The algorithm that runs on each backend is exactly the same, it is only the alignment value of different types that might change between backends. So it boils down to a simple mapping from a type bachend pairr to an alignment value. This map needs to be complete though.

cblake (orginal) [2017-04-10T14:32:41+02:00] view original

Fair point about the LLVM backends, though I don't know their level of C struct compatibility or if they can lever internal LLVM apis to get that. Anyway, just trying to help. Code complexity jello (squeeze one place, watch others expand) is very surely not an exact science. :-)

You've probably already considered it, and I surely don't know all the different ABIs out there for all CPUs, but I can imagine nested struct transitions being complicated for some base types at the beginnings/ends of nested structs. It also might be good for you to try get two pretty different backend CPU archs working at first to cover all your cases/tests..Maybe ARM32 and x86_64..whatever you have easy access to.

Krux02 (orginal) [2017-04-11T16:58:12+02:00] view original

I have problems with the procedure magicsAfterOverloadResolution.

I added a proc to system.nim:

proc alignOf*[T](x: T): int {.magic: "AlignOf", noSideEffect.}

And ann entry in magicsAfterOverloadResolution:

of mAlignOf:
    echo "evaluating mAlignOf:"
    let typ = n[1].typ
    debug(typ)
    let align = typ.getAlign
    result = newIntNode(nkIntLit, align)
    result.info = n.info
    debug(result)

The output I get is the following:


nim c  compiler/nim
CC: compiler_sem
bin/nim_temp  c -r testsizeof
evaluating mAlignOf:
tyObject SimpleAlignment(null, node: {
    "kind": "nkRecList",
    "info": ["testsizeof.nim", 12, 20],
    "flags": {},
    "sons": [
      {
        "kind": "nkSym",
        "info": ["testsizeof.nim", 13, 4],
        "flags": {},
        "sym": a_104030,
        "typ": tyInt8 int8
      },
      {
        "kind": "nkSym",
        "info": ["testsizeof.nim", 13, 6],
        "flags": {},
        "sym": b_104031,
        "typ": tyInt8 int8
      },
      {
        "kind": "nkSym",
        "info": ["testsizeof.nim", 14, 4],
        "flags": {},
        "sym": c_104032,
        "typ": tyInt64 int64
      }
    ]
  })
{
  "kind": "nkIntLit",
  "info": ["testsizeof.nim", 111, 51],
  "flags": {},
  "intVal": 8
}
testsizeof.nim(111, 52) Error: type mismatch: got (void)
but expected one of:
proc `$`(x: int): string
[...]
FAILURE

the line with the error in testsizeof.nim is the following:

echo a.type.name, ":\t", sizeof(a), "\t", alignof(a)

where a is of type SimpleAlignment

type
  SimpleAlignment = object
    a,b: int8
    c: int64

Why do I get this error? I create an integer literal, and the compiler complains that it is of type void!

siddydv28 (orginal) [2019-04-23T10:12:25+02:00] view original

Great http://stmkeyscodes.net

Mirror of forum.nim-lang.org

2905 :: [Solved] How to properly bind a function to a compiler buildin?