nimforum mirror - How to ensure that a finalizer gets called?

lou15b (orginal) [2015-09-20T01:46:44+02:00] view original

Executing the following code:

import os

type MyRef = ref object of RootObj
  value: int

proc finalizer(myref: MyRef) =
  echo "finalizer was called"

var myref: MyRef
echo "Use new with finalizer to allocate"
new(myref, finalizer)
myref.value = 100
assert(myref.value == 100)
echo "initialization done"
sleep(1000)
echo "nullify the reference"
myref = nil
assert(myref == nil)
echo "sleep for a bit"
sleep(1000)
echo "Done"

gives the following output:


Use new with finalizer to allocate
initialization done
nullify the reference
sleep for a bit
Done

So it seems that the finalizer for the heap object wasn't called when its only reference was invalidated. How do I ensure that it does get called?

filcuc (orginal) [2015-09-20T01:49:29+02:00] view original

GC_fullCollect

lou15b (orginal) [2015-09-20T02:32:21+02:00] view original

Thanks, it worked.

Is that the only way to ensure that the finalizer gets called promptly when the reference count hits zero?

Jehan (orginal) [2015-09-20T12:28:44+02:00] view original

There is no way to guarantee that a finalizer will be called always. I recommend not using finalizers like C++ destructors; they are not the same thing. The primary use of finalizers is to clean up foreign objects that are embedded in Nim objects.; they can also be used to emulate weak references.

lou15b (orginal) [2015-09-20T20:17:51+02:00] view original

I understand your point. However, there are situations where it is important that a finalizer (or equivalent) is guaranteed to be called when a thread is no longer using a resource.

For me, the use case is shared heap memory. I am attempting to implement a framework that manages heap memory to be shared among threads, without the need for a global GC:

A manager thread allocates heap objects on its own heap and supplies other client threads with pointers to those objects.

For each shared object the manager thread keeps a count of the client threads using it.

The client thread allocates a proxy object on its local heap that contains the shared pointer, and accesses the shared pointer through the proxy object.

When the client thread no longer needs the shared pointer it disposes of the proxy object.
- When that happens the proxy object's finalizer (supposedly) is called, which causes the count of referencing threads for the shared heap object to be decremented (perhaps by message, perhaps by direct decrement)
- When the referencing thread count reaches zero, the managing thread disposes of the shared object

The point here is that it is must be guaranteed that when a client thread is no longer using a shared object, the count of referencing threads for that object is decremented. Ideally, that would also include the case where a client thread terminates without explicitly disposing of the shared object. I had hoped to use the existing GC heap allocation/finalization mechanism for that purpose, but that doesn't appear to be possible at this time. I suppose I could layer another type of reference-counted pointer on top of what Nim already has, but that seems more than a bit redundant, and the current inability to override the assignment operator would make it cumbersome/confusing to use.

It would be very nice if one could tag an object (or type?) for guaranteed (preferably eager) finalization - note that I am not including memory recovery in this, that can usually be deferred. I have seen discussions elsewhere that revolve around this need in other GC'ed languages, but I have yet to see a solution.

If there is some other way to accomplish what I need within the existing Nim framework, I would love to learn about it.

Jehan (orginal) [2015-09-21T02:21:11+02:00] view original

Guaranteed finalization actually wouldn't solve your problem even if it worked, since you then still can't control when it happens while the GC is also unaware of how much memory it is managing indirectly. What you need is support for something akin to GC.AddMemoryPressure() in C# (except that this use case is slightly trickier with thread-local heaps).

lou15b (orginal) [2015-09-21T14:06:42+02:00] view original

Agree with the first part of your comment, I need guaranteed and eager finalization.

Regarding the second part of your comment (if I understand it correctly): the GC of the client thread doesn't need to know how much memory is being managed for it. The client thread GC just manages proxy objects, each containing a pointer, on its own heap. The managing thread GC manages the shared objects, together with a count of referencing client threads (actually a count of proxy objects) on its heap. When a client thread allocates a proxy object on its heap, the initializer increments the shared object's proxy object count via dereferencing. When a client thread disposes of a proxy object, the proxy object's finalizer (hypothetically) decrements the shared object's proxy object count. When the proxy object count hits zero the finalizer sends a message to the managing thread to dispose of the actual shared object.

All nice and tidy, if the proxy object finalizer actually gets called.

Jehan (orginal) [2015-09-21T17:21:35+02:00] view original

You need neither guaranteed nor eager finalization; you need only an upper bound on unreachable memory.

And yes, I'm fully familiar with the scheme you are trying to implement.

The fact that stack scanning is conservative already makes it impossible to make any guarantees about finalization.

lou15b (orginal) [2015-09-21T18:01:56+02:00] view original

So, if I understand you correctly, it isn't possible to implement GC'd shared heap unless the GC is global (i.e. stop-the-world)?

Varriount (orginal) [2015-09-21T19:08:50+02:00] view original

@lou15b Would a polling mechanism work instead? As in, calling a procedure which polls the reference count?

lou15b (orginal) [2015-09-21T19:30:22+02:00] view original

@Varriount Thanks. I agree that polling should work to detect a zero reference count, and would be a good alternative mechanism for the managing thread to clean up shared objects on its heap.

However, from the discussion so far it seems that the central issue is how to guarantee that the reference count gets updated when a client thread is no longer using the shared object.

Jehan (orginal) [2015-09-22T13:28:57+02:00] view original

lou15b: So, if I understand you correctly, it isn't possible to implement GC'd shared heap unless the GC is global (i.e. stop-the-world)?

No, I am saying that your assumptions for what is needed are too strong.

mora (orginal) [2015-09-22T14:19:02+02:00] view original

I think I understand Jehan's posts here. Let me illustrate it with an example:

Let's allocate some memory with new. If we delete all the references to one of these memory regions, then the compiler needs some time to realize that it can be freed (it will happen only at one of the next allocations when the GC is triggered). And we are ok with that.

So let's go back to your example. There are several threads, and your reference counter is not decreased immediately, just a little bit later, so you can't free the memory as soon as all the references are deleted. Maybe that's not end of the world (because it is similar to the previously described case), and your solution is still useful.

Araq (orginal) [2015-09-22T14:35:56+02:00] view original

Please also read http://forum.nim-lang.org/t/1656/1#10342

Mirror of forum.nim-lang.org

1652 :: How to ensure that a finalizer gets called?