Thunk cost estimation, chunk caching, benchmark updates #210

jpsamaroo · 2021-04-08T21:28:20Z

This PR adds runtime estimation of thunk costs (per signature) to get as close to max utilization as possible (without impacting total runtime). It also adds chunk data caches to workers, which will be freed by the scheduler once they're no longer needed.

~~Depends on JuliaData/MemPool.jl#49~~ Not doing this for now, wrong API for performance reasons.

Closes #205

Todo:

Parameterize function cost cache on argument types
Cache Chunk arguments per-process
Also key chunk cache on processor
Track Chunk usage and evict from caches ASAP
Test caching behavior
~~Scale thunk cost by number of running thunks on same processor~~ Something for later, possibly
~~Add lots more benchmarks from https://blog.dask.org/2017/07/03/scaling~~ Add Dask scheduler performance benchmarks #220
Confirm we benchmark better than master

src/sch/Sch.jl

Adds a linked list-based cache of available processors (O(N)->O(1) best case) Adds round-robin scheduling option to SchedulerOptions Concretizes some Ref types in ComputeState

Measure and cache task cost in scheduler (per-function) Use estimated task cost to indicate expected pressure Batch up per-processor task launches into one remote_do Record load average for future usage Reorganization of Sch.jl Fix init_proc capacity detection

Start only a single render server in live mode Add Context copy ctor Allow rendering to fail, not hang Disable rendering by default Reduce bench samples from 5 to 3 Summarize bench results with minimum Add option to automatically run visualize script post-benchmarks

jpsamaroo force-pushed the jps/ucx branch from 4401de1 to 546be66 Compare April 16, 2021 19:24

jpsamaroo changed the title ~~Allow changing default network for transfers~~ Alternate network support for UCX and various scheduler optimizations Apr 16, 2021

vchuravy reviewed Apr 19, 2021

View reviewed changes

src/sch/Sch.jl Outdated Show resolved Hide resolved

jpsamaroo force-pushed the jps/ucx branch 2 times, most recently from 78be8b7 to 2f22449 Compare April 24, 2021 17:24

jpsamaroo changed the title ~~Alternate network support for UCX and various scheduler optimizations~~ Thunk cost estimation, chunk caching, benchmark updates May 10, 2021

jpsamaroo force-pushed the jps/ucx branch from faca41c to 35d6a13 Compare May 10, 2021 21:25

jpsamaroo added performance scheduler data movement labels May 11, 2021

jpsamaroo force-pushed the jps/ucx branch from e522285 to e3e9bd0 Compare May 14, 2021 02:38

jpsamaroo mentioned this pull request May 19, 2021

Help with speedup of parallel task #204

Open

jpsamaroo added 17 commits May 20, 2021 20:23

Use cached worker capacity

80fbb7e

Optimize scheduling with processor cache

c5ef04f

Adds a linked list-based cache of available processors (O(N)->O(1) best case) Adds round-robin scheduling option to SchedulerOptions Concretizes some Ref types in ComputeState

Parameterize thunk cost on signature

b15a40d

Don't lock until capacity is fetched

ead347f

Cache chunks per-process

3ae46f4

Clean-up cached chunks on scheduler exit

88cf25e

Use UInt instead of Float64 for pressure

9d0941e

Split Gantt/Prof into 2 webpages

cb38145

Refcount profiler start/stop

1bca30e

Update visualize script for new output format

80ee287

Key chunk cache also on processor

17cb6c0

Evict Chunks during finishing

7887d15

Test caching behavior

4829f73

LB MemPool to 0.3.4

eff5f53

Remove choose_processor import

5769b09

jpsamaroo force-pushed the jps/ucx branch from 23050d3 to 5769b09 Compare May 21, 2021 14:00

jpsamaroo added 3 commits May 28, 2021 18:26

Fix worker fault reporting

a29a799

Remove dead code in ThreadProc

d7d1b0a

Improve debugging utilities

a71c1f0

jpsamaroo closed this May 29, 2021

jpsamaroo reopened this May 29, 2021

jpsamaroo force-pushed the jps/ucx branch from 145285e to 160f959 Compare May 29, 2021 12:20

jpsamaroo marked this pull request as ready for review May 29, 2021 12:22

Mostly fix fault tolerance

940fdf8

jpsamaroo force-pushed the jps/ucx branch from 160f959 to 940fdf8 Compare May 29, 2021 12:27

jpsamaroo merged commit 2e2badc into master May 29, 2021

jpsamaroo deleted the jps/ucx branch May 29, 2021 12:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thunk cost estimation, chunk caching, benchmark updates #210

Thunk cost estimation, chunk caching, benchmark updates #210

jpsamaroo commented Apr 8, 2021 •

edited

Loading

Thunk cost estimation, chunk caching, benchmark updates #210

Thunk cost estimation, chunk caching, benchmark updates #210

Conversation

jpsamaroo commented Apr 8, 2021 • edited Loading

jpsamaroo commented Apr 8, 2021 •

edited

Loading