Description
This is an umbrella issue to capture planning and strategy discussions at the sprints.
Our draft plan has three points:
-
Add a new itertool,
serialize()
, which will take a non-threadsafe iterator as input and produce a new iterator that is thread-safe. Multiple threads can access the new iterator which is guaranteed to make serial (one-at-a-time) calls to the upstream iterators. This will be implemented with locks that block__next__
calls while others are pending. The implementation will not buffer calls; instead, it implements blocking to achieve serialization. If applicable,send()
andthrow()
method calls will be forwarded as well. -
The
itertools.tee()
code will have guaranteed semantics. It can take a non-threadsafe iterator from one thread as an input and provide tee objects for other threads to get reliable independent copies of the data stream. The new iterators are only thread-safe if consumed within a single thread. Internally, it buffers data to fulfill this contract. -
Other iterators implemented in C will get only the minimal changes necessary to cause them to not crash in a free-threaded build. The edits should be made in a way that does not impact existing semantics or performance (i.e. do not damage the standard GIL build). Concurrent access is allowed to return duplicate values, skip values, or raise an exception.
Linked PRs
Metadata
Metadata
Assignees
Projects
Status