Machine Learning Engineering, Andriy Burkov, 2020 (True Positive Inc.) - A practical book addressing the engineering aspects of putting machine learning models into production, covering topics like model serving and different deployment environments.
Batch prediction vs. online prediction, Google Cloud, 2024 (Google Cloud) - Official documentation explaining the distinctions, use cases, and typical workflows for batch and online prediction services, providing a clear industry perspective on these deployment strategies.