mora (orginal) [2016-07-21T09:21:45+02:00] view original

Hi Guys,

I need your help. In the past two weeks I spent roughly 30 hours debugging the compiler to add proper concept[T] support. It feels like I'm at half of the work. Now I'm stuck at a language design question, how to find out T in concept[T]. I don't have a good proposal. Please take some time to think about this. I really should go back to push my regular work, so I'm trying to close this issue as soon as possible.

When you read the following examples then you might think that these are easy questions, and the compiler should be smart enough. Please keep in mind that we eventually want to use concept[T] for types like Map[ Map[int, string], Map[seq[Map[int, int]], float]]. Also, the main motivation is to create a function like proc getKeyWithLowestValue[K,V]( map: Map[K,V] ): K which could work on multiple map implementation (like HashMap, TreeMap in Java).

So let's start with the current stage. If you write this:

type StrangeConcept[T] = concept c
  c is T # Equivalent to "c is any", which is true
  float(c) is T # Equivalent to "float(c) is any", which is true
  $c is T # Equivalent to "$c is any", which is true
when int is StrangeConcept:
  echo "Yes, it is" # this is printed

So c is T doesn't assign T, right? This could be fixed, however, should the following work?

type NewConcept[T] = concept c
  c is seq[T]
  ...

One could say that yes, the compiler should be smart enough to find out T in this example. However, if c is a seq[seq[int]], then T should be seq[int] and not seq[any] or any. Fair enough. Let's continue:

type YouCanAddOne[T] = concept c
  c is T
  c+1 is T

What should we do in a case like this? The first line assigns T, and the second line is actually a testing? Am I the only one who feels this extremely strange? What happens if you swap c is T and c+1 is T? Then T is assigned by the type of c+1? Do we want the order to matter? (I know, it doesn't matter in this example.)

Let's continue the examples:

type MixWithFunctions1[T] = concept c
  f[T](c)
  ...
type MixWithFunctions2[T] = concept c
  f(c, (new T)[])
  ...

Shall we support something like these? I truly believe that we shouldn't. Just imagine that this needs backtrack algorithms because multiple f function can match. That would slow down the compiler (but it would become a constraint solver :) ).

Somewhat cleaner approach:

type ShouldBeClearConcept[T] = concept c
  type T = type( c[0].getFirstValue() )
  ...

This is easy to follow, there is no hidden logic. This way we wouldn't abuse our little is magic operator for something nesty with assignment. However, defining T after we already wrote T in ShouldBeClearConcept[T] is not very elegant and it is not in line with the rest of the language design. Please keep in mind that you cannot miss assigning T, because proc f[S](c: ShouldBeClearConcept[S]): S needs to know about it.

What's your opinion?

Cheers,: Peter

Araq (orginal) [2016-07-21T09:58:13+02:00] view original

Now I'm stuck at a language design question, how to find out T in concept[T]

Congratulations. you're definitely on the right track! :-) That's the tough nut. I have the following proposal, but I'm not sure it answers every one of your questions. The idea is that concept[T, ...] is always used for containers and thus the T should always be the container's element type. This means that concepts like Comparable cannot have a T since int is Comparable, but not a container.

Let's assume these definitions:

type
  One[T] = concept
    ...
  Two[S, T] = concept
    ...

The body is not relevant, since the body is not used to infer S and T.

Inferences rules

A generic type G[U] matches One[T] and infers the T: T <- U

A generic type G[U, V] matches Two[S, T] and infers the T <- U and S <- V.

A generic type G[U, ...] matches One[T] with T <- U. The other generic parameters are ignored.

array is a special case. array[I, T] matches One[T] ignoring the index type. (It matches a container of T)

However, array[I, U] matches Two[S, T] with S <- I and T <- U. (It matches a container with key, value pairs)

After the inference rules, the concept body is checked and if that matches too, we have a match, otherwise we don't.

Araq (orginal) [2016-07-21T10:08:29+02:00] view original

Let's go through your examples now.

type StrangeConcept[T] = concept c
  c is T # Equivalent to "c is any", which is true
  float(c) is T # Equivalent to "float(c) is any", which is true
  $c is T # Equivalent to "$c is any", which is true
when int is StrangeConcept:
  echo "Yes, it is" # this is printed

Nope, it should be:

type StrangeConcept = concept c
  c is type(c) # Equivalent to "c is any", which is true
  float(c) is type(c) # Equivalent to "float(c) is any", which is true
  $c is type(c) # Equivalent to "$c is any", which is true

No container, hence no T. As a workaround you need to write type(c). A clean solution IMO.

type NewConcept[T] = concept c
  c is seq[T]
  ...

Works.

type YouCanAddOne[T] = concept c
  c is T
  c+1 is T

Doesn't work this way, numbers are not containers. type(c) is the workaround.

type MixWithFunctions1[T] = concept c
  f[T](c)
  ...
type MixWithFunctions2[T] = concept c
  f(c, (new T)[])
  ...

Maybe this implies the inference rules need to cover proc (S, ...): T too? Oh well, you can extend them. But I don't think this needs to be supported for version 1 of the concepts implementation.

andrea (orginal) [2016-07-21T10:54:38+02:00] view original

@Araq I feel that restricting Concept[T] to containers is very limiting and will in fact make it impossible to define many abstractions.

Let me give an example. In emmy I have more or less the following defition of a field.

type Field = concept x, y
  x + y is type(x)
  zero(type(x)) is type(x)
  -x is type(x)
  x - y is type(x)
  x * y is type(x)
  id(type(x)) is type(x)
  x / y is type(x)

Here a field is defined as a type that admits the usual four operations and their identity elements - no need for generic concepts here (and in fact i works, more or less).

Say that now I want to define a vector space over a field. The obvious definition would be

type VectorSpace[K] = concept x, y
  x + y is type(x)
  zero(type(x)) is type(x)
  -x is type(x)
  x - y is type(x)
  var k: K
  k * x is type(x)

and this requires generic concepts.

Now the problem is: how does one infer K in this example (or T in any of the examples above)?

My proposal would be to make the compiler only test for concepts when all types are fully specialized. This is the same that happens for generics right now.

So, there would be no way to check that a type T belongs to VectorSpace[K] for some K, but we could check that it belongs to VectorSpace[float].

But we can still write generic functions having such constraint as T: VectorSpace[K]. The only problem is that the compiler will bail out at this point because K cannot be inferred. This can be easily fixed by

spitting out a suitable error message (K cannot be inferred in concept... defined at line ...)

letting the user explicitly opt in concept, say with a syntax like

static:
  assert T is VectorSpace[float]

or whatever we come up with. The compiler will then accumulate such facts and use them to resolve K in such situations as above.

What do you think?

Araq (orginal) [2016-07-21T11:18:52+02:00] view original

This works:

type
    Node* = concept n
        `==`(n, n) is bool
    
    Graph* = concept g
        var x: Node
        distance(g, x, x) is float

Applying it to your example yields:

type VectorSpace = concept x, y
  x + y is type(x)
  zero(type(x)) is type(x)
  -x is type(x)
  x - y is type(x)
  var k: Field
  k * x is type(x)

mora (orginal) [2016-07-21T12:46:42+02:00] view original

Thanks for taking the time to reply.

Just a quick reply, I didn't have time to think about the answers too much.

@Araq: I think that your proposal would work not only for containers. It would probably work for channels, streams, files as well.

@andrea: I understand the motivation, I'll think about this.

mora (orginal) [2016-07-21T23:35:27+02:00] view original

I can think of an example where the concept = container logic fails. Let's assume that we have Set[T] concept, and one would like to implement it for T=char by using a bitfield (that's 4 times 64-bit int, so it is an array[4, int]). This could be Set[char], however, there is no char in this type's syntax.

Also, Andrea, you are right. You would like to have several concept types with a small difference, where the difference is in one of the types. And that is what generics is for.

Repeating the obvious, these are the two cases we would like to have:

proc giveFirstValue[K,V](m : Map[K,V]): V = ... # here Map[K,V] defines K, V

proc RandomLinearCombination(x, y: VectorSpace[float]): auto = ... # here VectorSpace[T] needs to know T

How about having both of these? We could introduce the generic symbols within the concept, they would be the given types, or if no type is available, then the default value would be a type which doesn't match to anything. And they would be mutable.

For example

type ThisRequiresType[T] = concept c
  c[0][0].foo[0] is T
  c + c is int

This would match a parameter only if T is given (or can be find out from a previous generic match, like proc[S] f(a: S, b: ThisRequiresType[S])). If T is not given, then it would fail at every something is T check.

The other example:

type ContainerWithAddableElements[T] = concept c
  T = type(c[0])
  c[0] + c[0] is T

Here the first line would overwrite T, so it could be used in function definitions like proc first[S](x: ContainerWithAddableElements[S]): S = c[0].

So again, the steps would be: we introduce symbols for every generics at the beginning of the concept, they are mutable, and their values are the given ones, or if we cannot infer them, then they would be a type which matches nothing. The evaluation needs to be successful to continue, and then we check the values of these type symbols, and match them with any generics if we are in a function declaration.

What do you think?

Peter

Araq (orginal) [2016-07-21T23:56:56+02:00] view original

I can think of an example where the concept = container logic fails. Let's assume that we have Set[T] concept, and one would like to implement it for T=char by using a bitfield (that's 4 times 64-bit int, so it is an array[4, int]). This could be Set[char], however, there is no char in this type's syntax.

I don't understand this example at all. Besides, you can always introduce helper types to perform the matching as you please type Swapper[T,S] = Cont[S, T].

proc RandomLinearCombination(x, y: VectorSpace[float]): auto = ... # here VectorSpace[T] needs to know T

Since a VectorSpace[float] is exactly the same as float, there still is no need to parametrize VectorSpace.

What do you think?

Seems to be more complex than my solution, but it also seems to me that you described Zahary's solution (I skyped with him about it), so it could be fine. ;-)

mora (orginal) [2016-07-22T00:27:02+02:00] view original

So my Set[T] example wasn't clear enough. I wanted to have a mathematical set of unsigned bytes (like {3,4,5}, which becomes {1,3,4,5} after adding 1). This can be implemented with array[4, int]. I can't see how this could match for Seq[byte] with your trick, because array[4, int] contains no bytes.

The VectorSpace[float] has generators, dimension, subspace, all the nice math properties (it can be float, array[3, float], array[N, float], etc).

I don't know Zahary, but he must be a smart guy :P

Let's sleep a couple of days on this, then if everybody is on the same page, then I'll try to put together a PR.

Peter

OderWat (orginal) [2016-07-22T00:43:32+02:00] view original

I don't know Zahary, but he must be a smart guy

lmao <3

zahary (orginal) [2016-08-23T11:44:02+02:00] view original

@Araq, I'm more concerned about surprising matching results than the reasoning/implementation details, which I don't expect to be very complicated. complexDisambiguation is on the very last line of cmpCandidates and some of the criterias before it doesn't seem important enough to trump the "more-specific" rule, but we'll see how things will turn out.

Tarmean (orginal) [2016-08-23T13:37:14+02:00] view original

I really don't know much about type inference. I feel like the naive solution would be enough for the common cases and manually adding type annotations would cover the rest, though. Even if it does compute the wrong result it would error out like a normal generic, or do another check for generics after the first pass to get a better error message.

It probably would be possible to collect type constraints before anything else and solve them without exponential runtime explosions? Anyway, that seems like something that could be added at in a later version?.

A naive implementation that goes through all possibilities via backtracking probably wouldn't be too bad either as long as there aren't any procs with huge numbers of overloads and the implementation throws mismatches out early. In that case there probably should be a sanity limit at which the compiler gives up, though, or you get stuff like swift's hours of compile time for seemingly simple expressions.

boia01 (orginal) [2016-08-23T17:03:15+02:00] view original

This is a long shot but I figured I'd ask if only to provide an interesting use-case for concepts.

How far is the current design (& implementation) from supporting the definition of Tuple-like structure? Here's a sketch of what it could look like:

type Tuple[T: static[varargs]] = concept c
  for i in 0 .. T.arity-1:
    c[i] is T[i]

There's a few assumptions here which I'll call out:

passing in a variable number of type arguments (I made up the static[varargs] type)

executing code within the concept's body (metaprogramming), not just represent compilable patterns. How would we distinguish between a for pattern and actually running a for statement? Maybe put the block into a static scope?

the compiler would also need to infer that c[i] is T[i] is not expressing c[int] is T[i] but rather c[static[1]] is T[i] (and so on for each instantiations of i).

Thoughts? Is the current design forward-compatible to eventually express such concepts?

zahary (orginal) [2016-08-25T11:08:50+02:00] view original

Hi @boia01,

It will be possible to create concepts based on type lists, but I don't consider this a feature of the concepts per se, but rather a general support for variadic generics. You can take a look at this github issue:

https://github.com/nim-lang/Nim/issues/1019

As explained in the issue, type lists can be emulated right now with a single parameter, expecting a tuple type. Then within the concept, you can have a static: block containing arbitrary code for the actual checks. I tried to put something together quickly as an example, but I ran into several issues, so this explanation will have to do for now, sorry.

boia01 (orginal) [2016-08-26T02:11:46+02:00] view original

Thanks @Zahary for feedback. Yes, I tried using a static: block myself but ran into issues too. I didn't know enough about the compiler and where concepts are headed so I figured I'd ask instead. Excited to hear that it might be possible some day!

mora (orginal) [2016-09-05T08:59:30+02:00] view original

@zahary It becomes more and more obvious that I won't have time to work on Nim nowadays. It is my regular job which put more load on me.

Coming back to Concept[T], I can see 3 possible ways to continue.

1.: allow inference only if its unique. So c.push(T) only works if there is one push function, or we already know what T type is (e.g., we know that it is string).

2.: allow inference to metatypes. So c.push(T) can lead to T being string|int. I recommend an extra check at the end. That is if you have a match (e.g., T is int and S is int), then reevaluate the concept's body again just to be sure. (Just for the record, the reason for this extra check is an example above somewhere in this thread.)

3.: allow some extremely smart logic which infers everything which is possible. This needs to involve some backtrack logic.

I would vote to 1., but that doesn't work with your Stack[T] case. I'm strongly against with 3., because it can lead to extreme compilation time. 2. looks nice.: Peter

andrea (orginal) [2016-12-02T15:54:16+01:00] view original

@zahary Any news on your work on this topic?

vega (orginal) [2016-12-07T03:45:47+01:00] view original

@andrea, see this log from gitter


Anatoly Galiulin @vegansk Nov 30 17:03
Hi to all. It was a long road back to Nim  I have a question about generic-concepts branch (https://github.com/nim-lang/Nim/tree/generic-concepts). Is it dead? GitHub says "836 commits behind devel".

From IRC (bridge bot) @FromIRC Nov 30 17:04
<Araq> I reviewed it once and it seemed merge-able
<Araq> I know zahary never stops developing nim, so it's not dead either
<Araq> but progress is SLOW ;⁠-)

bpr (orginal) [2016-12-14T04:41:32+01:00] view original

@vega, I believe I saw some message from @zahary where he said he thought the branch could be merged. I hope the merge is made, even if it's incomplete, and concepts hidden behind a feature gate for experimentals.

Mirror of forum.nim-lang.org

2396 :: Concept[T] design question

Inferences rules