A Domain-Specific Architecture for Training Deep Neural Networks, Norman P. Jouppi, Zhifeng Chen, David Dellweg, George N. Garland, Mark P. Herlihy, Gerard N. John, Nguyet Johnson, Liam K. Kavanagh, Adam Lake, Tibor Lindholm, Matthew R. Markidis, Andrew Myatt, Kevin R. Patuto, Katherine E. Polley, Jason Rolfe, Daniel Smith, Shengqi Wang, Richard J. Ward, Mark White, Martin Wicke, Anna You, Peng Zhao, 2021Proceedings of the 47th Annual International Symposium on Computer Architecture (ISCA '20) (ACM)DOI: 10.1145/3400302.3400309 - Describes the architecture and performance characteristics of Google's Tensor Processing Units (TPUs), offering a comparison point for specialized AI hardware.