nimforum mirror - Nim as a classic programming language

Sixte (orginal) [2021-05-04T21:53:45+02:00] view original

Nim is in the line of classic programming languages: FORTRAN, Pascal, C. Nim represents a minimal (if not THE minimal) closure over C's memory model.

FORTRAN did not use a stack for variables at all, because any of these were placed in global memory. Recursion did not work of course, but the memory requirements could be precisely precalculated. The stack was only needed for addresses. Therefore, "stackless FORTRAN" is an oxymoron. The memory footprint of FORTRAN was somewhat minimal, and this was a requirement : The original Mainframe IBM 360 ( a "huge" machine at an almost unimaginable price) provided RAM of 32 to 64 KBytes. Why so little? Well, RAM was costly : 400 $ / KByte (approx.) in the late 60ies. Anyway, a typical FORTRAN program would work well with a stack capacity of 64 bytes and a total RAM footprint (prog. + data) with less then 16 KB.

C placed variables on the stack, so more "dynamic" RAM was needed, but otherwise, it followed FORTRAN's terse memory footprint well. This paved the way for C's success. Moreover, C reflected the addressing modes of the PDP11 (the workhorse for scientific labs in the 70ies) nicely, therefore, the ` autoincrement operator etc. . However, C did not provide an automatic heap management.

Now, Nim kicks in and offers the Seq exactly for that. With seq , stack-bounded lifetime can safely be expanded into the heap. No other requirements needed, no GC at all. Furthermore, parameters passed by pointers are not considered mutable if not declared explicitly with var . Therefore, Nim offers safety and convenience at the same time but remains in line with C.

So, the microcontroller people should immediately jump on the bandwagon . Because, they are the true 60ies programmers in our times, since there is so little RAM , 4K, 8K ... (The processor is way faster than a IBM 360 mainframe though . If and why they don't jump, I'll come to this later.

PLs in computer science. (Tensorflow reloaded?). Evaluation of types, type safety, compiler construction, etc. . Show me a PL that allows you do write compilers more easily than Nim. Regarding both convenience and efficiency (speed), Nim is on the top. To promote this further, it would be helpful if a Nim macro could work together with a redefined lexer/parser. Until now, macros can only deal with Nim-parsed code and therefore the Nim parser is in the way.

Ressource management "safety", stackless Nim and stateful Nim. If multithreading is considered, any of these things come into play. Within realtime applications, coroutines will be involved almost unavoidably . Now, state-bound transitions need to be considered, potential pitfalls with data races etc. Example: A digital altimeter in an airplane. If you move a switch on its front panel, you want a state update immediately and accordingly. The state update must not depend on the previous state of the altimeter. And no time for error messages (Please reboot....) . Almost ironically, we have to reconsider old FORTRAN-style with a defined memory layout for that.

Dynamic (duck-type) languages. Seems to be the worst invention ever, but, Objective-C , being a thin extension of C, made its way through the decades. (C++ tried to be smarter and failed...). The update of methods at runtime is certainly not in the focus of Nim. However, in the 20ies now, compilers are pretty good at extracting static (precompiled) functionality from it. Javascript/Nodescript is the proof of concept. Javascript was born as a pure scripting language. It made an astonishing career , very similar to Objective-C (now Swift)

So, the main selling point for Nim is Core-Nim. No fancy concepts, no type analysis of generic functions at compiletime (they are checked at instantiation instead and this works surprisingly well!) Core-Nim could be promoted heavier though, again, I will come to this later.

Memory safety is a concern. Nim's Arc is excellent but Rust is a mighty competitor. They have an important application (FireFox) and the Mozilla foundation. But the selling point for Rust is not a "killer" application. The question is: Will Rust keep its promises? How will Rust evolve as stateful (continuation-based) Rust? Are their "lifetimes" type-theoretically complete? What can be done differently? Nim has already an (almost invisible) borrow checker, should Nim expand it and (hopefully not ) introduce lifetime parameters? What needs to be done else?

The latest point is a question of ongoing research. In Science, Nim should be found more frequently, more scientific papers where Nim plays a role . Existing programming languages (to mention Haskell and Scala here) can be toolboxes for type reasoning, linear types (or quantitative types, there is even a theory for it..) continuation models etc. Nim could take a place here as its own.

Forget about waiting for a "killer app" , the next new webserver and so on. A killer app does not rely on the language. It could - in principle and in reality - always be rewritten in another PL. The PL market ist highly fragmented.

Araq (orginal) [2021-05-07T10:25:54+02:00] view original

I agree with most of your points, but I mind

Until now, macros can only deal with Nim-parsed code and therefore the Nim parser is in the way.

How is it in the way? You can easily write compile-time parsers that work on a triple quoted string literal and produced NimNodes, see how strformat does it.

Other points read a bit like "Within Nim there is a much smaller and cleaner language struggling to get out"... Which I agree with, and we will get this better Nim language -- eventually. Proposals like https://github.com/nim-lang/RFCs/issues/369 are highly appreciated.

hankas (orginal) [2021-05-07T17:29:13+02:00] view original

My ideal language is something like a modern C or a minimalist C++. A language with built-in concurrency library, unicode support, basic OO constructs, generics, intuitive metaprogramming, clean syntax, easy interoperability with C, and performant. Being memory safe is a plus, but I am fine with constructors/destructors, smart pointers, and manual memory management.

Nim is already close to this. There is no more header files, and other clunky C/C++ stuffs. The problem is Nim tries too hard to be memory safe. Nim ends up with all kinds of GCs and each has its own limitations to ensure memory safety. We have ref and ptr, alloc and allocShared, etc. It seems that Nim is now exploring the ownership ala Rust (sink, lent, move). I certainly hope Rust lifetime stuff does not creep into Nim.

It is okay to be memory unsafe. C and C++ are not memory safe and they are still popular. It is better to pass the control back to the users than to complicate the language. They know what they sign up for when they learn the language.

Note that with all that fancy memory model, Rust still need smart pointers to cover for corner cases. I believe Rust has six of them (probably more), while C++ only have three. In C++, smart pointers are not necessary and more like convenience tools, while in Rust they are required.

GamzWithJamz (orginal) [2021-05-07T18:43:47+02:00] view original

Even though Nim has lots of GC options, most seem like toys or are for special use cases. Like @Araq mentioned here, --gc:orc changes a lot for Nim's coming evolutions/additions.

Simplification of Nim can easily come in the form of better defaults (like using the orc GC over the refc GC as a default), and also in PRs like the one above which will remove an unnecessary part of the language.

Also, if I am not mistaken, @Araq, you have expressed time and again that Nim will not follow in the footsteps of Rust, when it comes to lifetime annotations.

Having the options for sink, lent, and move I have seen as being most helpful for those who need/desire total control of memory with Nim without a GC or with their own GC and these become explicit annotations for the code's behavior compared to implicit copies and moves like we see in C/C++.

Araq (orginal) [2021-05-07T19:14:48+02:00] view original

It seems that Nim is now exploring the ownership ala Rust (sink, lent, move). I certainly hope Rust lifetime stuff does not creep into Nim.

So far there are no plans to go beyond sink/lent/move/cursor/acyclic. I know these 5 new things can be a bitter pill to swallow but they are opt-in and also address issues existing since Nim's initial design, not only "memory safety". To the best of my knowledge the design is reasonably complete and the implementation is catching up. Don't worry. :-)

And don't please underestimate memory safety -- there are valid reasons why Rust is taking off and why the entire rest of the industry moved to memory safe languages years ago when they could.

Just imagine you work in a Java/C# shop and you try to sell them to use C(++) instead: "Hey, we get more control over the code and it uses two times less memory then. Occassionally there will be ridiculously hard to track down bugs that can take up months to fix but they are fine because we know C(++) is not memory safe." -- "Er, you know, maybe you should look for a new job..."

bpr (orginal) [2021-05-07T21:03:56+02:00] view original

To the best of my knowledge the design is reasonably complete and the implementation is catching up.

Cool! Quick question about that: in the destructor manual, the mostly complete example (needs 3 line implementation of resize) of defining a myseq calls the =destroy on all the elements of the sequence. What's supposed to happen when you have a sequence of elements that don't have an =destroy, like ints? Can value types get an implicit no-op =destroy?

And don't underestimate memory safety please -- there are valid reasons why Rust is taking off and why the entire rest of the industry moved to memory safe languages years ago when they could.

So true. Rust got adopted by Mozilla because it handled a lot of the bugs in their C++ codebase. And other large C++ shops are looking for safer alternatives. IMO Nim could be one alternative. For C++ programmers, those 5 new things are hardly a bitter pill. You'd have to add many more things even to catch up to C++ initialization...

I do think @hankas has a point though. Ada, amongst others, is not "memory safe" but it's a lot safer than C++ in practice.

Araq (orginal) [2021-05-07T22:33:23+02:00] view original

What's supposed to happen when you have a sequence of elements that don't have an =destroy, like ints? Can value types get an implicit no-op =destroy?

Every type has =destroy but it can be "trivial", like in C++.

bpr (orginal) [2021-05-07T23:25:51+02:00] view original

Every type has =destroy but it can be "trivial", like in C++.

When I took the code from the destructors manual, added a missing resize

proc resize[T](x: var myseq[T]) =
  x.data[x.len] = cast[typeof(x.data)](realloc(x.data, x.len * sizeof(T)))
  x.cap = x.len

and tried to test the code by creating some myseq[int] I got


Error: type mismatch: got <int>
but expected one of:
proc `=destroy`[T](x: var T)
  first type mismatch at position: 1
  required type for x: var T
  but expression 'x[i]' is immutable, not 'var'
proc `=destroy`[T](x: var myseq[T])
  first type mismatch at position: 1
  required type for x: var myseq[=destroy.T]
  but expression 'x[i]' is of type: int

which strongly suggested to me that int (and string tested separately) do not have =destroy attached.

Araq (orginal) [2021-05-07T23:34:14+02:00] view original

Works for me and the newer version of the document is a complete example, no need for a resize.

bpr (orginal) [2021-05-07T23:52:04+02:00] view original

Thanks, I spotted the bug now.

Sixte (orginal) [2021-05-23T09:01:06+02:00] view original

For those who use C++, C++ 20 concepts fail this test as well

Well, concepts were newly introduced (Dec 2020), the draft paper had 380+ pages, so give C++ a try, the language will mature soon.

BTW, nice example! I hope you file an issue.

IDK. Probably not. The typing world has changed considerably between ... 2006 and 2021. Type polarity is now well understood. If we obey type polarity, we'll obtain stable substitutions. We can bind early and reduce later. Very similar to the bare-metal lambda calculus. If we do not obey it, the compiler will then constantly have to check the environment and even backtrack if substitutions failed.

Concepts will add a considerable amount of boilerplate , so in some sense, they try to cure a problem with a problem. They can led to combinatoric explosion. They add an (expensive) Meta-DSL to the language. However, if they worked biporarily, they would give us a nice high-level macro-language for almost everything, e.g. the construction of virtual function tables and much more.

I gave some examples above where generics did not work as expected. Something elementary is going on here. A change in the type system (precisely: type inference system) would break existing code. So, a simple add-on needs to be done. Basically at the module (= file) level. We have a clever developer who wrote a baseModule and a user programmer who writes a userModule by importing the baseModule. The pair (userModule, baseModule) should compile and work as expected. That's all about it.

Sixte (orginal) [2021-05-23T09:47:01+02:00] view original

...but we omitted the star at Gstack. This makes the type extendable . Due to the {.prototype.} pragma, the compiler knows that Gstack is a virtual type will be exported. But now, the user programmer could write:

Gstack[U] = Ustack[U]

and therefore extend Gstack with a new implementation type Ustack. The implementation of Ustack is in the user's responsability. The user might write now:

import basemodule

type
  Gstack[U] = Ustack[U] or Ustack[type unit] # {.prototype.}
  Ustack[U] = object
    data : seq[U]
    len  : int

proc len*(st : Ustack) : int  {.inline.} = st.len

proc push*[U](st : var Ustack[U], item : U)    =
  st.data.add item
  st.len = st.data.len

proc pop*[U](st : var Ustack[U])  :  U  =
  let len = st.data.len-1
  if len < 0 : st.len = 0
  else : st.len = len
  result = st.data.pop

gemath (orginal) [2021-05-23T20:13:48+02:00] view original

Really interesting, as your comments usually are. I've started diving into type theory and things you, @timothee and others have contributed are part of the reason.

A modification proposal: the minimal one-pragma approach is elegant, but let's not break import visibility semantics: the meaning of * could remain as it is and one could pass an optional parameter to prototype to denote extensibility:

{.prototype(open).}

{.prototype(extensible).}

These could also be valid inside the defining module.

A general thought: prototypical/exemplary definitions seem to go against most peoples' intuition in the context of statically typed languages. It feels JavaScript-y. Comparable to the witness construct you proposed during the discussion about symbol aliases started by @timothee. Back then I felt that supplying "explanatory example code" to the compiler sounded really weird. But now I think that that was irrational, especially with meta-programming making generating various kinds of types possible, at least in principle.

If it was easier to experiment with types, people could get creative and I believe some substantial practical progress could be made. A DSL for type calculus and transformation rules which produces the compiler code for type verification and resolution and defines new type implementations with Nim's existing types as building blocks; is that a pipe dream or a realistic goal?

bpr (orginal) [2021-05-23T21:51:08+02:00] view original

Well, concepts were newly introduced (Dec 2020), the draft paper had 380+ pages, so give C++ a try, the language will mature soon.

If you read the code at the end of the message you replied to, you'll see a translation into C++ 20 which compiles with both clang and g++ (--std==c++20), and demonstrates the problem.

If you're not going to file the issue with concepts, I guess I will. As for the rest, currently even though type Foo = int or string is valid Nim, it's a lie, Foo isn't a real type, much less a sum or union type. As the manual documents


Whilst the syntax of type classes appears to resemble that of
ADTs/algebraic data types in ML-like languages, it should be
understood that type classes are static constraints to be enforced
at type instantiations. Type classes are not really types in
themselves but are instead a system of providing generic "checks"
that ultimately resolve to some singular type.

Sixte (orginal) [2021-05-23T23:16:45+02:00] view original

Me:"Well, concepts were newly introduced (Dec 2020), the draft paper had 380+ pages, so give C++ a try, the language will mature soon."

If you read the code at the end of the message you replied to, you'll see a translation into C++ 20 which compiles with both clang and g++ (--std==c++20), and demonstrates the problem.

I'm sorry, but you didn't get the irony here....

If you read the code at the end of the message you replied to, you'll see a translation into C++ 20 which compiles with both clang and g++ (--std==c++20), and demonstrates the problem.

Yes, I've read your C++ code. And I appreciate it, because it was much work, way more difficult than my Nim example.

Foo isn't a real type

Of course it isn't. It is used contravariantly only (you can't construct smth with it), and therefore, as a type, it could only be used as a common subtype (that in most cases doesn't exist). Sumtypes are either unions (enforced coercion, like in C, so they can fail) or ADTs. Nim's ADTs are Variants.

bpr (orginal) [2021-05-23T23:33:42+02:00] view original

I'm sorry, but you didn't get the irony here....

I guess the joke's on me. I couldn't decide if you were serious or not :-)

Araq (orginal) [2021-05-24T01:14:35+02:00] view original

A DSL for type calculus and transformation rules which produces the compiler code for type verification and resolution and defines new type implementations with Nim's existing types as building blocks; is that a pipe dream or a realistic goal?

A required goal if we want to get out of today's whack-a-mole type system development. ;-)

Sixte (orginal) [2021-06-02T16:27:11+02:00] view original

>A required goal if we want to get out of today's whack-a-mole type system development. ;-)

Whack-a-mole ? Type system ? Really?

Imperative languages do offer product types for both datatypes and function types and typechecking via identity. Some coercions, e.g. int8 -> int32, and conversions, eg. int -> float, are standard. C allows for some form of parametricity , for separate type definitions and type dependencies. Moreover, the C preprocessor allows to split them further, with a "#define" part and a declaration part, rendering all types by forehand. The .c file provides function body and data instantiation and typechecks. The code could then be rewritten with type parameters (see the NimC example above). Without function overloading, it is a parametric (but monomorphic) module instantiation. The .h files add modularity to C. This advantage paved the way for C's success.

If several module instantiations are needed at the same time, function overloading needs to be implemented and we obtain syntactical polymorphism and sets of typesets, allowing for first order type reasoning (inclusion and instantiation order). I call this PMI : "polymorphic module instantiation" PMI gets handled via modules in ML/SML or typeclasses in Haskell.

C++ went another way: It offered function overloading first but without the module-wide type parameters, therefore, no PMI, no type sets. I call this PPMI : "pointwise polymorphic module instantiation", because the extent of the module instantation is completely dependent from the programmer. (There might be no module at all, even if it looks like one). A "whack-a-mole" dependent type system was added later with type generators aka templates. Types can now be "calculated", but need to be restricted too. Therefore, some time (30yrs) later, concepts were introduced. They are the counterpart to templates: Concepts are destructors and templates are constructors.

Nim's generic parameters, like "T" in proc xxxT are nothing else than convenient comprehensions of differing overloads. They are still part of PPMI, they do not add anything new to the type system.

The same with concepts. They are redundant. Example: A programmer implements with a concept that requires a proc myprocex. The proc is not implemented though, because it is not used. Compilation will stop because myprocex is not there. But it could compile because the implementation does not depend of myprocex. Now, the programmer writes a dummy myprocex and the prog compiles. However, the compiler will throw a warning : "myprocex not used". Otherwise, if myprocex had been called, a missing myprocex had led to a failed compilation with and without the concept. Well...

Some people think that a set-like representation of a module is missing, therefore "interfaces". Let's see what we have:

https://dev.to/xflywind/zero-overhead-interface-exploration-in-nim-language-2b3d

nothing is really convincing here, in principle, some boilerplate , an intermediate layer that is not needed and not appropriate because "it is not in the language".

So, what to do?

Bring (back) typesets with module-wide type parameters. Anything being part of the implementation of a type parameter will be a member of its type set. If the type parameter is unconstrained, the behavior will be the same as with the unbound type parameters just now. It can be fine-grained later. Module-wide type parameters might be introduced via a pragma or syntax.

single out type parameters for PMI. Example given above with {.prototype.} . These will now work like concepts, but with no overhead. Syntax extensions not needed. Concepts could then compare a module property with their own typeset and the usual attributes (extension, inclusion, subtype property) might apply.

Concepts could present specific overloads of the members of their own typesets. This could supercede "when" clauses in functions. However, even the slightest move in this direction would convert concepts into generators. Therefore...

Concepts could be combined with generators. Different implementations possible. Concepts would then act like templates in C++, but guided, with a "concept" indeed.

Modules are missing in Nim. They could be added easily though.

Sixte (orginal) [2021-06-11T23:00:57+02:00] view original

{.prototype(extensible).} These could also be valid inside the defining module.

Yes but you have submodules then. How to make it explicit in recent Nim, it's not in the language yet. Your syntactic extension looks reasonable.

A general thought: prototypical/exemplary definitions seem to go against most peoples' intuition in the context of statically typed languages. It feels JavaScript-y.

A prototype in Javascript is a convenient way to construct local data and local functions via delegate bindings. The prototype provides the constructors. You don't need to know about the concrete types the prototype is working with. The types remain abstract for you. Prototypes can be regarded as (sub)modules. Let's say you are writing an user app Uapp and you have a prototypic module Pmodule providing an abstract type T. So, you have the pair : (Uapp,Pmodule[exT]). JS will bind Uapp to the prototype. The "exT" stands for "existentially bound". This is the abstraction. Since you never can define a free floating [exT] within JS, you always have a monotyped implementation type :

Comparable to the witness construct you proposed

The witness. (It is called this way, not my invention though...) At this point, the abstraction is a mental model only. JS doesn't know about the exT, nor it can declare it. JS determines (standard) types at runtime and is typeless otherwise. Now the other way round (Uapp[exT],Pmodule), where [exT] is abstract for Pmodule. JS is typeless, so you can always pass your own types to PModule. Because the prototype is complete (by definition), PModule will work "out of the box" with your exT. However, due to JS's dynamic contexts (the key feature of JS) and runtime name lookup particularly, you can overlaod PModule's functions and therefore "extend" it.

Now to Nim. A Nim pair (Uapp.nim,Pmodule.nim) to consider with the two abstract cases (Uapp,Pmodule[exT]) and (Uapp[exT],Pmodule). Let us postpone the former for good reasons. Then we have (Uapp[exT],Pmodule) and this is idiomatic Nim, it should work always even if it does not. {.prototype.} will come to help here. If some overloads are specified, it will check the overloads. The most general overload is the unit type - it is completely abstract. If Uapp wants to specify its own overload, it can do this by "extending" the prototypically bound type e.g. Gstack[U] with its own exT and an appropriate implementation. Again, {.prototype.} will check this. If something is missing, it will tell you what to do.

Why are these checks useful? An existentially bound Type together with its implementation is a dependent sum type, a pair (a,B(a)) where a type a produces an implementation type B(a) that contains a . The unit type aside, this will not work in general. In fact, you can expect to get (a and C(a), B(a)) where C(a) extends a with constraints, (a and C(a)) being a subtype of a. If C(a) is already in the context, you are lucky. If not, compilation will fail. Another issue : Since the generic parameters are analyzed pointwise, function-per-function only, C(a) is not stable, it depends of how many B(a) are involved (no module-wide generic annotations, this is a severe problem).

In this regard, concepts are a misconception. They add new constraints (they are destructors). They do define new subclassing properties (examples in the concept RFC), very nice, but they don't remove the implicit constraint subtyping. If they could, they had to construct the appropriate context so that (a & C(a)) always matches with the context.

{.prototype.} would clear up the mess. It provides a (very) basic framework for type abstraction. It defines the set of a valid for B(a), a common supertype "Shape". So it gives a pair (a:Shape,B(a)) without further constraints. BTW it can help for vtables too.

I will not claim that {.prototype.} is a perfect solution, but a solution compliant with nim. A rather cheap solution, cheaper than concepts.

If it was easier to experiment with types, people could get creative and I believe some substantial practical progress could be made. A DSL for type calculus and transformation...

Creative and complicated... Well, Haskell relies on higher-order unification (restricted to keep it decidable) for typeclass reasoning. It might be possible to do it simpler. Perhaps Araq could demonstrate how monads get verified with the current concept design. I tried it but I could not figure out how to name type abstractions like (U->V)->V, so I failed.

The last round now: (Uapp,Pmodule[exT]) . Here, the User app doesn't know the witness type exT. It knows about the supertype only, let's call it Shape again. So we have (Shape,B(a:Shape)) then. Example : Uapp can define var x : Shape = Pmodule.new() where x gets "opened" with a type Shape. From now on, any specific operation related to x has to be done with functions that are imported from Pmodule. Again, we could use {.prototype.} for this. Example with a string witness type:

Exttype {.prototype.} = string

The implementation type is a monotype and therefore it can be used in Uapp for declarations. As said, any function that gets applied to Exttype has to be taken from Pmodule for the sake of type abstraction. No functions containing Exttype may be declared in Uapp and the compiler should check this.

TL;DR : Generic annotations [T] introduce implicit and unpredicted constrains/downcasts in Nim that make type reasoning unfeasible. Downcasts can be removed with abstract types. Abstract types are missing but can be brought to Nim without any runtime cost, especially no virtual functions needed.

Mirror of forum.nim-lang.org

7925 :: Nim as a classic programming language