Add `:greedy` scheduler to `@threads` #52096

Seelengrab · 2023-11-09T11:39:33Z

This implements a very greedy scheduler for @threads, spawning up to threadpoolsize() tasks that greedily work on the elements from the iterator as they are produced. This scheduler supports infinite iterators.

Needs

Tests
More extensive docs
News
thread safety review

base/threadingconstructs.jl

carstenbauer · 2023-11-10T12:54:29Z

Some initial benchmarks: https://github.com/carstenbauer/parallel-julia-zoo/tree/greedy/multithreading

In particular, see the *.greedy.out files.

@threads :greedy looks good (compared to :static/:dynamic) in the juliaset benchmark (non-uniform workload):

 benchmark
	 serial
  2.187 s (0 allocations: 0 bytes)
	 spawn
  274.958 ms (40019 allocations: 4.03 MiB)
	 threads :dynamic
  544.403 ms (42 allocations: 4.25 KiB)
	 threads :static
  544.766 ms (42 allocations: 4.25 KiB)
	 threads :greedy
  281.027 ms (64 allocations: 6.23 KiB)

The worst result is probably found in the mapreduce_small benchmark (small uniform workload per task). Here the overhead (locking of the channel?) shows very clearly:

# 100*nthreads() many chunks, f = sin
 benchmark
		 serial
  589.951 μs (0 allocations: 0 bytes)
		 spawn
  430.513 μs (4814 allocations: 437.91 KiB)
		 threads :static
  76.664 μs (45 allocations: 10.69 KiB)
		 threads :dynamic
  92.247 μs (45 allocations: 10.69 KiB)
		 threads :greedy
  1.490 ms (72 allocations: 14.13 KiB)

Seelengrab · 2023-11-10T13:32:39Z

100*nthreads() many chunks, f = sin benchmark

To reference the conversation on slack, that benchmark was run with N = 10_000 and 8 threads, so 80_000 elements in the input array or about 64 KiB of data. With 800 chunks, that's a miniscule amount of data per chunk, increasing the overhead dramatically. I'll keep that in mind for the docs of :greedy, as a sort of "when is this a good idea to use".

In general, I don't expect :greedy to be used for workitems that take on the order of milliseconds individually. In that regime, the task spawning & locking overhead can become very noticeable.

base/threadingconstructs.jl

simonbyrne · 2023-11-10T18:28:07Z

A bit of an aside, but could we actually make @threads extensible somehow? i.e. give it an actual proper interface, e.g. for partitioning iterators, specifying how many items in the iterator to assign to each task, etc.

Seelengrab · 2023-11-10T19:03:19Z

A bit of an aside, but could we actually make @threads extensible somehow?

I was thinking about that too while implementing this, but that's likely something better suited for another PR later. I thought about this in particular because I originally wanted to just have this implementation be a replacement for :dynamic, but it turns out that we actually guarantee the regions for this to be contiguous:

:dynamic (default)
––––––––––––––––––

:dynamic scheduler executes iterations dynamically to available worker threads. Current implementation assumes that the workload for each iteration is uniform. However, this assumption may be removed in the future.

This scheduling option is merely a hint to the underlying execution mechanism. However, a few properties can be expected. The number of Tasks used by :dynamic scheduler is bounded by a small constant multiple of the number of available worker threads (Threads.threadpoolsize()). Each task processes contiguous regions of the iteration space.

Emphasis mine. The problem with this is that we can't then swap the underlying implementation to be greedily work stealing (or otherwise loadbalancing on a per-item basis), since that would result in the regions/items worked on no longer being contiguous for any given task. In essence, this makes :dynamic not dynamic at all, and rather more like :static (except for being properly nestable across threaded regions).

Seelengrab · 2023-11-23T19:10:54Z

Does this need anything else?

base/threadingconstructs.jl

@vtjnash

Patch by @vtjnash

Seelengrab · 2023-12-21T22:09:33Z

CI failures seem unrelated. Does this need anything else?

base/threadingconstructs.jl

Seelengrab · 2024-01-19T09:25:04Z

No idea what the SparseArrays build/test failures mean. This shouldn't touch them at all. Is this just master being flaky master again? Other than that, @vtjnash if you agree I think this is good to go.

Seelengrab · 2024-01-30T08:00:13Z

What's the status here?

Seelengrab · 2024-02-06T17:49:46Z

Thank you!

Add :greedy scheduler to @threads

8fd13c9

giordano added multithreading Base.Threads and related functionality needs tests Unit tests are required for this change needs docs Documentation for this change is required needs news A NEWS entry is required for this change labels Nov 9, 2023

Seelengrab commented Nov 9, 2023

View reviewed changes

base/threadingconstructs.jl Outdated Show resolved Hide resolved

Seelengrab commented Nov 9, 2023

View reviewed changes

base/threadingconstructs.jl Show resolved Hide resolved

Add greedy threads tests

7a9c853

Seelengrab removed the needs tests Unit tests are required for this change label Nov 9, 2023

Seelengrab added 3 commits November 10, 2023 10:13

Fix race condition in :greedy scheduler

b1db8d3

Add comments explaining the loop

642355d

Don't reinvent the wheel when it comes to channel iteration

013fbf7

JeffBezanson reviewed Nov 10, 2023

View reviewed changes

base/threadingconstructs.jl Outdated Show resolved Hide resolved

Seelengrab added 2 commits November 12, 2023 13:23

Expand documentation of :greedy scheduler

4b43d1d

Add :greedy scheduler to NEWS.md

528fefb

Seelengrab removed needs docs Documentation for this change is required needs news A NEWS entry is required for this change labels Nov 12, 2023

Seelengrab marked this pull request as ready for review November 12, 2023 12:26

jakobnissen added the status: waiting for PR reviewer label Nov 28, 2023

Seelengrab mentioned this pull request Nov 29, 2023

Threads.@threads doesn't play nicely with Iterators.product #52343

Closed

vtjnash reviewed Dec 5, 2023

View reviewed changes

base/threadingconstructs.jl Outdated Show resolved Hide resolved

vtjnash reviewed Dec 5, 2023

View reviewed changes

base/threadingconstructs.jl Outdated Show resolved Hide resolved

Seelengrab added 2 commits December 5, 2023 20:14

Remove unnecessary waiting loop

9dc78a0

Don't rely on the optimizer for ordering of isopen

336ec63

Patch by @vtjnash

IanButterworth requested a review from vtjnash December 30, 2023 02:02

vtjnash reviewed Jan 5, 2024

View reviewed changes

base/threadingconstructs.jl Outdated Show resolved Hide resolved

Restrict channel to be blocking

072f2ca

carstenbauer mentioned this pull request Jan 30, 2024

Scheduling option: :greedy JuliaFolds2/OhMyThreads.jl#11

Closed

Merge branch 'master' into greedy_@threads

927a685

vtjnash added merge me PR is reviewed. Merge when all tests are passing and removed status: waiting for PR reviewer labels Feb 5, 2024

IanButterworth merged commit 94fd312 into JuliaLang:master Feb 6, 2024

IanButterworth removed the merge me PR is reviewed. Merge when all tests are passing label Feb 6, 2024

lkdvos mentioned this pull request Feb 9, 2024

add multi-threading support for mul!, add! and tsvd! Jutho/TensorKit.jl#100

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add `:greedy` scheduler to `@threads` #52096

Add `:greedy` scheduler to `@threads` #52096

Seelengrab commented Nov 9, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

carstenbauer commented Nov 10, 2023 •

edited

Loading

Uh oh!

Seelengrab commented Nov 10, 2023 •

edited

Loading

Uh oh!

Uh oh!

simonbyrne commented Nov 10, 2023

Uh oh!

Seelengrab commented Nov 10, 2023

Uh oh!

Seelengrab commented Nov 23, 2023

Uh oh!

Uh oh!

Uh oh!

Seelengrab commented Dec 21, 2023

Uh oh!

Uh oh!

Seelengrab commented Jan 19, 2024

Uh oh!

Seelengrab commented Jan 30, 2024

Uh oh!

Seelengrab commented Feb 6, 2024

Uh oh!

Uh oh!

Uh oh!

Add :greedy scheduler to @threads #52096

Add :greedy scheduler to @threads #52096

Conversation

Seelengrab commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

carstenbauer commented Nov 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Seelengrab commented Nov 10, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

simonbyrne commented Nov 10, 2023

Uh oh!

Seelengrab commented Nov 10, 2023

Uh oh!

Seelengrab commented Nov 23, 2023

Uh oh!

Uh oh!

Uh oh!

Seelengrab commented Dec 21, 2023

Uh oh!

Uh oh!

Seelengrab commented Jan 19, 2024

Uh oh!

Seelengrab commented Jan 30, 2024

Uh oh!

Seelengrab commented Feb 6, 2024

Uh oh!

Uh oh!

Add `:greedy` scheduler to `@threads` #52096

Add `:greedy` scheduler to `@threads` #52096

Seelengrab commented Nov 9, 2023 •

edited

Loading

carstenbauer commented Nov 10, 2023 •

edited

Loading

Seelengrab commented Nov 10, 2023 •

edited

Loading