Possible hang when waiting for an aborted ThreadPoolJob

dbyron · February 9, 2012, 12:06am

Consider this sequence of events on a ThreadPool with one thread:

ThreadPool::addJob successfully adds a job to the array of jobs and starts the thread
before ThreadPool::runNextJob gets to run, someone calls ThreadPoolJob::signalJobShouldExit on the just-added job
when ThreadPool::runNextJob gets to run, the first loop where it iterates to find a job doesn’t come up with anything because job->shouldStop is true

So far this seems fine, the job never runs. I don’t see that it ever gets removed from the jobs array but perhaps that’s not so bad.

The struggle comes when someone calls ThreadPool::waitForJobToFinish waiting for the aborted job to finish, potentially waiting forever. jobFinishedSignal is never signalled so it hangs.

I don’t see a particularly easy way around this…or at least one I can guarantee is less racy than what the code currently does. Any ideas, or at least confirmation that I’ve found a hole?

Thanks much.

-DB

TheVinn · February 9, 2012, 12:13am

Umm…I haven’t looked at the ThreadPoolJob code but one solution is to change job identifiers from a small integer into a reference counted object pointer. This way a job persists until the last reference is gone. So after a job is finished or gets signaled to exit, it can be put into a state suitable for inspection (i.e. usable in a call to waitForJobToFinish).

I believe that is the solution used by boost::thread and std::thread.

jules · February 9, 2012, 10:32am

Hmm, this will require me putting on my threading-head and giving it some hard thought. Please remind me again if I don’t look at this soon!

dbyron · February 27, 2012, 8:42pm

Ping?

TheVinn · February 27, 2012, 8:45pm

A workaround is to use boost::thread and re-implement ThreadPool yourself. It should take less than a day.

jules · February 28, 2012, 3:22pm

The ThreadPool class needed a bit of a spring-clean… I’ve been through and tidied it up now, hopefully sorting out this problem in the process. Let me know if you still have any trouble with it!

TheVinn · February 28, 2012, 3:25pm

Interesting, a quick and pain-free solution. But how do you know when the right time to call removeAllJobs () with deleteInactive==true is?

jules · February 28, 2012, 4:45pm

That method no longer has a deleteInactive argument. Each job now knows whether it should be deleted, so there’s no need to tell it what to do. (God knows why I didn’t write it that way in the first place!)

Topic		Replies	Views
ThreadPoolJob Memory Management General JUCE discussion	1	369	August 11, 2009
ThreadPool::runNextJob bug General JUCE discussion	6	442	May 12, 2017
Reusing of ThreadPoolJob General JUCE discussion	2	366	July 14, 2010
Possible crash in ThreadPool General JUCE discussion	18	1713	June 8, 2011
ThreadPool::addJob nuisance asserts and synchronization with re-runnable ThreadPoolJob instances General JUCE discussion	4	538	January 25, 2021

Possible hang when waiting for an aborted ThreadPoolJob

Purchase

Discover

Learn

Support

About

Events

Possible hang when waiting for an aborted ThreadPoolJob

Related topics

Purchase

Discover

Learn

Support

About

Events