Asynchronous Dispatch, JAX developers, 2024 - Official JAX documentation explaining the asynchronous execution model and the use of block_until_ready() for accurate timing on accelerators.
XLA Overview, Google Developers, 2024 - Provides context on XLA, the domain-specific compiler used by JAX to generate optimized device code for accelerators, which supports JAX's asynchronous dispatch.
CUDA C++ Programming Guide, NVIDIA, 2023 (NVIDIA) - Offers detailed information on asynchronous execution in GPU programming, particularly through CUDA streams, which are similar to the underlying mechanisms JAX uses for dispatching computations.