Implementing Lowering Rules for Backends (CPU/GPU/TPU)
Was this section helpful?
How to write JAX XLA Lowering, JAX developers, 2024 (JAX documentation) - Explains how to define lowering rules for custom JAX primitives, covering XLA HLO and API usage like xla_client.register_translation.
XLA: Accelerated Linear Algebra, Google, 2024 - Presents an overview of the XLA compiler, its function, and links to information about its intermediate representation (HLO) and hardware targets.
A Domain-Specific Compiler for Tensor Processing Units, Heejin Jo, Andrew Siena, Mark Weiser, Nicholas P. Johnson, Robert S. French, Paul M. Smith, Kevin S. Lee, Cliff L. Biffle, Eric S. Chung, and Norman P. Jouppi, 2021Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '21) (ACM)DOI: 10.1145/3445814.3446700 - Examines how the XLA compiler is optimized for Google's Tensor Processing Units (TPUs), providing examples of backend-specific compiler strategies for specialized hardware.