AI Systems Performance Engineering:
Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch
English | 2026 | ASIN: B0F47689K8 | 1276 Pages | EPUB | 18 MB
Optimizing Model Training and Inference Workloads with GPUs, CUDA, and PyTorch
English | 2026 | ASIN: B0F47689K8 | 1276 Pages | EPUB | 18 MB
Elevate your AI system performance capabilities with this definitive guide to maximizing efficiency across every layer of your AI infrastructure. In today's era of ever-growing generative models, AI Systems Performance Engineering provides engineers, researchers, and developers with a hands-on set of actionable optimization strategies. Learn to co-optimize hardware, software, and algorithms to build resilient, scalable, and cost-effective AI systems that excel in both training and inference. Authored by Chris Fregly, a performance-focused engineering and product leader, this resource transforms complex AI systems into streamlined, high-impact AI solutions.