Opened 4 years ago

Last modified 17 months ago

#1012 new defect

Thread Sanitizer warns of data race in geos::util::Interrupt::cancel()

Reported by: macdrevx Owned by: geos-devel@…
Priority: major Milestone: 3.11.0
Component: Default Version: main
Severity: Unassigned Keywords:
Cc: macdrevx

Description

Original ticket: https://github.com/GEOSwift/GEOSwift/issues/192

Actual geos version is 3.7.1 but it's not one of the available options in Version field.

Two threads invoke GEOS_init_r() which in turn invokes geos::util::Interrupt::cancel(). Both threads write to the requested variable, which is detected by the [Thread Sanitizer|https://developer.apple.com/documentation/code_diagnostics/thread_sanitizer] as a data race.

Change History (7)

comment:1 by dbaston, 4 years ago

Both threads write the same value, however. Seems like the bigger issue would be if one thread is checking for an interrupt while another thread is requesting one.

Could change the variable in question to a std::atomic<bool>.

comment:2 by macdrevx, 4 years ago

If I'm reading the source correctly, it seems like the interrupt mechanism can be used from any thread and when it is, it will interrupt only that one thread. Is there any reason why this shouldn't be a thread-specific or context-specific construct?

comment:3 by macdrevx, 4 years ago

Cc: macdrevx added
Version: 3.7.0master

comment:4 by macdrevx, 4 years ago

I've spent some time considering what would be gained by making requested atomic. While it would avoid the data race on that particular memory, it would not result in a thread-safe system overall.

Consider this example: there are long-running operations on 2 threads. A 3rd thread requests an interrupt. The result could be either:

  1. 1 thread is interrupted (passes the check at Interrupt.cpp:66 and executes line 67 to lock out the other thread), or
  2. Both threads are interrupted. (Both threads pass the check at Interrupt.cpp:66 before either one executes line 67).

Which scenario occurs (and within scenario A, which thread is interrupted) depends on the timing.

A better solution would provide an interface that allows the consumer to be specific about their intent. My current thought is that for the thread safe CAPI we could design a new interface that would allow requesting an interrupt only on a specific context. Ideally, any such interface would avoid requiring the CAPI consumer to implement synchronization mechanisms around the context. Optionally, we could maintain the existing GEOS_interruptRequest interface to allow requesting interruption on all contexts.

I have some implementation ideas, but I'll wait to share them until after getting some feedback about whether this general direction is desirable.

comment:5 by pramsey, 3 years ago

Milestone: 3.10.0

comment:6 by robe, 3 years ago

Milestone: 3.10.03.11.0

Retargeting in prep for GEOS 3.10.0 release

comment:7 by dbaston, 17 months ago

A thread-specific interruption mechanism is proposed in https://github.com/libgeos/geos/pull/761

Note: See TracTickets for help on using tickets.