nimforum mirror - Is Object significantly less efficient then ref object over here

solomonthewise (orginal) [2022-05-03T18:34:37+02:00] view original

this is my current code it runs but im wondering if it would scale badly or have weird behaviors.

 MalType* = object
    case mk*: Mk
    of mklist: lis*: seq[MalType]
    of mkint: intval: int   etc

compared to this

 MalType* = ref object
    case mk*: Mk
    of mklist: lis*: seq[MalType]
    of mkint: intval: int

or this

 MalType* = object
    case mk*: Mk
    of mklist: lis*: seq[ref MalType]
    of mkint: intval: int

Yardanico (orginal) [2022-05-03T18:41:25+02:00] view original

As long as you don't need shared ownership and no branch in your object variant will contain the recursive type itself (without any kind of container), then you can continue on using object without ref in seq[MalType]. If you need direct recursion, then you'll probably either want to make it ref object or use seq[ref MalType] and ref MalType in all other places to prevent recursion.

solomonthewise (orginal) [2022-05-03T18:47:02+02:00] view original

thanks so much. what do you mean by direct recursion?

treeform (orginal) [2022-05-03T18:51:53+02:00] view original

It really depends. Its very hard to guess the slow parts of your code I recommend running and setting up benchmarks really early. See: https://github.com/treeform/benchy

The ref object are pretty good at most things. If you come form languages like Java, Python, JS, Ruby... they feel like what you are used too. They are easy to pass around. Always mutable. They are just easy to use and I use it as the default object type.

The plain object can be more efficient as they are usually not allocated on the heap and might require less pointer indirection, be better packed in arrays. But they can be worse by doing copies where you don't expect. Require "var" for speed. The bigger and more complex the plain object gets the worse this problem becomes.

But always test!

Examples:

Pixel is best as plain object because they are stored in arrays and are pretty small.

Vector3 is best as plain object because they basically 3 numbers and you do a lot of math and usually want them to copy.

Image is best as a ref object because it already contains pointers, is usually pretty big, and you never want a copy accidentally.

Model3d is best as a ref object because it already contains textures, nodes, geometry, is usually pretty big, and you never want a copy accidentally.

Yardanico (orginal) [2022-05-03T18:52:25+02:00] view original

Something like this:

type MyType = object
  field: MyType

Even if you have an object variant that has this field in one of its branches, its still direct recursion which makes the object take infinite amount of space, so obviously it won't compile :)

solomonthewise (orginal) [2022-05-03T18:56:13+02:00] view original

thanks so much for the detailed reply. I was mainly worried about the recursive aspects since the type has a field seq[itself] as well as it being arbitrarily long (since its a ast).

solomonthewise (orginal) [2022-05-03T18:56:48+02:00] view original

that makes sense. :) thank you

mratsim (orginal) [2022-05-04T07:49:35+02:00] view original

` Require "var" for speed`

Parameter passing and iteration of plain objects should have been fixed

https://github.com/nim-lang/Nim/issues/14421#issuecomment-632734533=

https://github.com/nim-lang/Nim/issues/16897

In your case we lack information on how you use the object. Will it be copied? Will it be shared? If you copy it, do you want the copies to be independent or updates on one copy to update the other? Is there multithreading involved?

Usually if shared or shared update: use ref object, if independent use plain object. If multithreading, use plain object or ptr object and manual memory management (oor implement atomic refcounting).

Then once your program works, profile and reconsider choices. Because maybe the copies are expensive but the solution is not to use ref object but to introduce a cache instead.

planetis (orginal) [2022-05-04T09:14:16+02:00] view original

I would guess that the reason of the slow down is that your code does unneeded copies. See the example, compiled with --mm:arc|orc:

type
  Mk = enum
    mklist, mkint
  MalType* = object
    case mk*: Mk
    of mklist: lis*: seq[MalType]
    of mkint: intval: int

#proc `=copy`(a: var MalType; b: MalType) {.error.}
proc main =
  var a = MalType(mk: mklist, lis: @[MalType(mk: mklist, lis: @[MalType(mk: mkint, intval: 1), MalType(mk: mkint, intval: 2)])])
  var b = a # use var to force a =copy
  b.lis[0].lis.del(1)
  echo a.lis[0].lis.len # use a to prevent =sink

main()

The line var b = a makes a full copy of the tree and after modifying one of them, they are different. Using let creates a cursor that doesn't require a copy, as you can inspect with --expandArc:main and it should be faster. The commented =copy error line lets you find the location of copies and try to eliminate them.

In the case of any ref, you should be aware that assignments create aliases and copying a ref object is very fast as it just a pointer and incrementing a shared counter.

treeform (orginal) [2022-05-06T20:05:59+02:00] view original

Oh I did not know that was fixed. I'll keep that in mind when I benchmark next. Thanks!

Mirror of forum.nim-lang.org

9142 :: Is Object significantly less efficient then ref object over here