nimforum mirror - A talk on HPC

jxy (orginal) [2017-08-23T23:23:16+02:00] view original

Hi, here is the slides of a talk on HPC I just gave.

https://github.com/jcosborn/cudanim/blob/master/demo3/doc/PP-Nim-metaprogramming-DOE-COE-PP-2017.pdf

I post it here for a wish.

I wish the next release of Nim does not break the code.

dom96 (orginal) [2017-08-24T00:34:26+02:00] view original

Wow, awesome library, brilliant slides and amazing work!

Out of curiosity, where did you give this talk?

jxy (orginal) [2017-08-24T01:45:13+02:00] view original

It's written on the title page. Or find it here

http://www.lanl.gov/asc/doe-coe-mtg-2017.php

cdome (orginal) [2017-08-24T09:59:29+02:00] view original

Superb. Well done, really useful material to convince management to develop more in Nim.

gokr (orginal) [2017-08-24T10:16:34+02:00] view original

Just want to chime in here - looks like very impressive work! And it also (I guess) validates Nim as a very suitable language for these directions (parallellism, GPUs etc).

komerdoor (orginal) [2017-08-26T03:20:55+02:00] view original

I started working on a project like this a while ago supporting OpenCL C, GLSL and SPIR-V. Not to be used for physics, but for machine learning and game programming. Nice to see other ways of doing this.

Udiknedormin (orginal) [2017-08-26T17:48:17+02:00] view original

I work in HPC myself (mostly using Fortran). I liked your slides very much. However I have a few a few questions. For now, I'll focus on the one that puzzles me the most.

Am I misunderstanding the purpose of your ArrayObj type? It seems to me your implementation of macro indexArray*(x: ArrayObj{call}, y: ArrayIndex): untyped is little inflexible. After all, not every function returning an ArrayObj will be just doing element-wise calculations (or should it?)... I guess that's a good time for using procedure-modifying macros which would add them to some compile-time collection and then indexArray will choose the right transformation based on data in the collection (also at compile-time). Also, some special cases could be done easier this way. For example, let's consider the difference between element-wise addition and multiplication and circular shifting:

proc `+`*(x: ArrayObj, y: ArrayObj): ArrayObj {.elemental.}
proc `*`*(x: ArrayObj, y: ArrayObj): ArrayObj {.elemental.}

# (x + y * z)[i]  -->  x[i] + y[i] * z[i]


replace:
  proc rshift*(x: ArrayObj, shift: int): ArrayObj = ...
  
  proc opt(x: ArrayObj, shift: int, i: SomeInteger) =
   if shift + i < x.len:
     x[shift + i - x.len]
   else:
     x[shift + i]
  
  proc opt(x: ArrayObj, shift: static[int], i: static[SomeInteger]) =
    when shift + i < x.len:
      x[shift + i - x.len]
    else:
      x[shift + i]
  
  ...

# if x.len == 10:
#   x.cshift(2)[5]  -->  x[6]
#   x.cshift(2)[9]  -->  x[1]

Actually, macro elemental is quite simple. Unless we would like it to use vectorization or other additional optimizations, of course.

jxy (orginal) [2017-08-29T04:25:11+02:00] view original

Am I misunderstanding the purpose of your ArrayObj type? It seems to me your implementation of macro indexArray*(x: ArrayObj{call}, y: ArrayIndex): untyped is little inflexible. After all, not every function returning an ArrayObj will be just doing element-wise calculations (or should it?)...

You are absolutely correct. The current definition of indexArray in the repository is incomplete. You can find an improved one at

https://github.com/jcosborn/qex/blob/devel/src/new/fieldProxy.nim#L100

Your suggestion of using a macro to annotate procs and modify a global compile time list of element-wise calculations is a good option, too. Perhaps that is a nice option for users to define their own element-wise operations. I will consider this. (J's rank system is in the back of my head, but I will not go down that path anytime soon.)

The complexity of shift goes up easily with MPI and vectorization. In QEX, we have something similar, which is probably the most complicated piece in the code base.

Udiknedormin (orginal) [2017-08-30T01:06:34+02:00] view original

Will you publish the optimization part, which is obviously not tightly bounded to the purpose of the framework, separately? It would enable people to easily build other science libraries, which can lead to this optimization library having more contributors, which will probably be good for your framework too.

jxy (orginal) [2017-08-30T04:34:57+02:00] view original

Good suggestion. Thanks. What are the functionalities would you like to see in the optimization library?

Udiknedormin (orginal) [2017-08-31T21:04:37+02:00] view original

Well, all of the ones mentioned in the pdf, that's for sure. I also noticed you use some nice metaprogramming utilities, e.x. the ones from metaUtils.nim file. I didn't read all of your code so I can't really know what functionalities are separable. ;)

Mirror of forum.nim-lang.org

3119 :: A talk on HPC