nimforum mirror - Porting python code to nim

RaphaelHythloday (orginal) [2015-05-07T04:07:35+02:00] view original

I made an algorithm for my science fair project, and since I was mainly just interested in verifying its correctness (10000 correct answers speaks louder than a formal proof when I'm the one doing the proof) I wrote it in python. Now I'd like a high-performance implementation. Can you give me some pointers on porting this function to nim?


#C is a set of signed ints
def negate(C):
    
    return set(map(lambda l: -l, C))

#phi is a list of sets of signed ints, n is an unsigned int, m is an unsigned int, epsilon is a list of unsigned bigints
def memoized(phi,n,m,epsilon):
    
    if filter(lambda c: not c, phi):
        return 2**n
    
    for i in range(m - 1, -1, -1):
        phiprime = [C - phi[i] for C in phi[i+1:] if not negate(C) & phi[i]]
        epsilonprime = map(lambda x: epsilon[x + len(phi) - len(phiprime)]/2**len(phi[i]), range(0,len(phiprime)))
        
        mprime  = len(phi)
        for x in range(len(phi) - 1, i, -1):
            if not phi[x] & phi[i] and not negate(phi[x]) & phi[i]:
                mprime = x
            else:
                break
        mprime -= len(phi) - len(phiprime)
        
        epsilon[i] = 2**(n - len(phi[i])) -  memoized(phiprime, n - len(phi[i]), mprime, epsilonprime)
    
    return sum(epsilon)

Varriount (orginal) [2015-05-07T04:52:31+02:00] view original

Some tips to take into account when porting:

Slices are inclusive on both sides.

Slicing a sequence will create a copy of the sliced contents.

Nim doesn't have native Big Int datatypes.

The lambdas should be turned into templates or procedures.

def (orginal) [2015-05-07T05:16:11+02:00] view original

Nim doesn't have native Big Int datatypes.

Implemented in Nim, not very performant: https://github.com/def-/nim-bigints

Wrapper for GMP, very performant: https://github.com/jovial/nim-gmp

Varriount (orginal) [2015-05-07T10:16:50+02:00] view original

I stand corrected (although the gmp wrapper looks like it could use some work with regards to 'natural' usage via operators)

Araq (orginal) [2015-05-07T11:48:36+02:00] view original

Slicing a sequence will create a copy of the sliced contents.

Python copies too:


x = [1, 2, 3]
y = x[:]
y is x # false

RaphaelHythloday (orginal) [2015-05-07T20:54:01+02:00] view original

Wait... Nim doesn't have lambda? What would the macro to implement lambda be?

def (orginal) [2015-05-07T23:36:34+02:00] view original

Wait... Nim doesn't have lambda? What would the macro to implement lambda be?

There are anonymous procs, but they're not as comfortable to use, mainly because you need to supply the argument and return types. Also, the thread poster asked for "high performance", so lambdas wouldn't be ideal.

Varriount (orginal) [2015-05-08T12:55:55+02:00] view original

Couldn't render post #7457.

RaphaelHythloday (orginal) [2015-05-09T02:39:07+02:00] view original

First of all, thank you so much!!! Answers to your questions:

Yes, and that is a mistake, I fixed it. Really sorry, the function used to return a tuple so I could measure how many recursive calls it was making without worrying about a debugger. I took it out to make the code simpler for porting. Anyway, that line should be, as I fixed it above, return 2**n.

Yes, m will always at most len(phi) - 1. You can put that in as an assertion if you like.

You're right except it doesn't check if the set contains any zeros, it checks if the intersection of negate(C) and phi[i] is empty. This is important because for the algorithm to work there can't be x such that x is in C and -x is in phi[i]. By negating and intersecting we can assure that doesn't happen.

But map and filter are my friends! Also map and filter are never hard to figure out in my opinion, it's only reduce that's sometimes tricky, though I like solving the puzzles!

A question for you: is a collection false or true depending on whether its empty or not in nim like in python? I really like that, as you can tell I use it all the time.

P.S. I also corrected one other mistake on the second to last line related to removing the calls variable. I promise there aren't any others :).

RaphaelHythloday (orginal) [2015-05-09T03:52:39+02:00] view original

Also, regarding the GMP wrapper, wouldn't it be easy to make macros that let you use the bigints with the primitive operators?

kirbyfan64sos (orginal) [2015-05-09T22:26:22+02:00] view original

@Varriount You've never used Haskell, have you? ;)

Jehan (orginal) [2015-05-10T00:56:52+02:00] view original

def: Also, the thread poster asked for "high performance", so lambdas wouldn't be ideal.

I generally use iterators with an enumerate template for this purpose:

template enumerate(s: untyped): auto =
  block:
    iterator temp(): auto = s
    var result = newSeq[type(temp())]()
    for item in temp():
      add(result, item)
    result

import sequtils

proc mapExample =
  let x = toSeq(1..10)
  let y = enumerate do:
    for i in x:
      yield i * i
  echo y

proc filterExample =
  let x = toSeq(1..10)
  let y = enumerate do:
    for i in x:
      if i mod 2 == 0:
        yield i
  echo y

proc filterMapExample =
  let x = toSeq(1..10)
  let y = enumerate do:
    for i in x:
      if i mod 2 == 0:
        yield i*i
  echo y

mapExample()
filterExample()
filterMapExample()

Edit: Copied and pasted the wrong version of enumerate.

Jehan (orginal) [2015-05-10T01:02:42+02:00] view original

RaphaelHythloday: Also, regarding the GMP wrapper, wouldn't it be easy to make macros that let you use the bigints with the primitive operators?

I've taken a first stab at that here if someone is interested in tidying this up.

Note that you may have to pass along a -I option to --passC and a -L option to --passL so that the C compiler knows where to find the include file and the library. The code intentionally does not use the dynlib pragma, because when you're working with GMP, you often want to control exactly which version of the library you're using, which is difficult to do with dynlib's dlopen()-based approach.

jlp765 (orginal) [2015-05-10T08:13:24+02:00] view original

Rather than the enumerate template, sequtils module has

mapIt() which on my machine was faster than using enumerate() template :-)

filter() which on my machine was slower than using enumerate() template :-(

Using the below template which is mashed together from various templates in sequtils.nim, it was of the similar speed as mapIt() :-)

Of course, you can use mapFilterIt for mapping, filtering and mapFiltering

template mapFilterIt(seq1, typ, op, pred: expr): expr {.immediate.} =
  var result {.gensym.}: seq[typ] = @[]
  for it {.inject.} in items(seq1):
    if pred: result.add(op)
  result

let x = toSeq(1..10)

# mapping
var y = x.mapFilterIt(int, it*it, true)
echo y                      # @[1, 4, 9, 16, 25, 36, 49, 64, 81, 100]

# filtering
y = x.mapFilterIt(int, it, (it mod 2 == 0))
echo y                      # @[2, 4, 6, 8, 10]

# mapFiltering
y = x.mapFilterIt(int, it*it, (it mod 2 == 0))
echo y                      # @[4, 16, 36, 64, 100]

Jehan (orginal) [2015-05-10T11:21:14+02:00] view original

mapIt and enumerate generate the same code, so if there's a performance difference [1], there's something strange going on (did you compile with the same options each time?). The point of enumerate is that it's more general and doesn't rely on a hardcoded variable name. The downside is that it's generally more code to write.

[1] Unless you used the in-place version of mapIt, which has different semantics.

jlp765 (orginal) [2015-05-10T13:20:38+02:00] view original

I am guessing the speed difference was because I did the proc like the following:

proc mapExample1 =
  let x = toSeq(1..10)
  let y = x.mapIt(int, it*it)

proc mapExample2 =
  let x = toSeq(1..10)
  let y = x.mapFilterIt(int, it*it, true)

Mirror of forum.nim-lang.org

1203 :: Porting python code to nim