Shop Generative AI on Kubernetes by Roland Huss

Generative AI on Kubernetes by Roland Huss

1,750.00

Close
Price Summary
  • 1,750.00
  • 1,750.00
  • 1,750.00
In Stock
Highlights:

BLACK & WHITE Final Release Version
Language ‏ : ‎ English
Paperback, 407 Pages, Edition 2026
A+ PDF Printed On Demand Book!
Local Printed Book!
Title May Be Different.
Delivery All Over Pakistan Charges Will Apply.
Due to constant currency fluctuation, prices are subject to change with or without notice.

Compare
Category: Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Description

Generative AI on Kubernetes: Operationalizing Large Language Models

Roland Huss, Daniele Zonca

Generative AI is revolutionizing industries, and Kubernetes has fast become the backbone for deploying and managing these resource-intensive workloads. This book serves as a practical, hands-on guide for MLOps engineers, software developers, Kubernetes administrators, and AI professionals ready to combine AI innovation with the power of cloud native infrastructure. Authors Roland Huß and Daniele Zonca provide a clear road map for training, fine-tuning, deploying, and scaling GenAI models on Kubernetes, addressing challenges like resource optimization, automation, and security along the way.

With actionable insights with real-world examples, readers will learn to tackle the opportunities and complexities of managing GenAI applications in production environments. Whether you’re experimenting with large-scale language models or facing the nuances of AI deployment at scale, you’ll uncover expertise you need to operationalize this exciting technology effectively.

Learn how to deploy LLMs more efficiently with optimized inference runtimes
Get hands-on with GPU scheduling, including hardware detection and multinode scaling
Monitor and understand LLM-specific metrics like Time to First Token and token throughput
Know when to fine-tune a model or when retrieval augmentation is the better choice
Discover how to evaluate models with standardized benchmarks before committing GPU resources
Learn to run agentic applications with secure tool integration, identity management, and persistent state

Reviews (0)
0 ★
0 Ratings
5 ★
0
4 ★
0
3 ★
0
2 ★
0
1 ★
0

There are no reviews yet.

Be the first to review “Generative AI on Kubernetes by Roland Huss”

Your email address will not be published. Required fields are marked *

Scroll To Top
Close
Close
Close

My Cart

Shopping cart is empty!

Continue Shopping

Select at least 2 products
to compare