Hands-On LLM Serving and Optimization - by Chi Wang & Peiheng Hu (Paperback)

Name: Hands-On LLM Serving and Optimization - by Chi Wang & Peiheng Hu (Paperback)
Brand: O'Reilly Media
SKU: 1007733810
Price: 79.99 USD
Availability: PreOrder

New at

$79.99

Pre-order

Eligible for registries and wish lists

About this item

Highlights

Large language models (LLMs) are rapidly becoming the backbone of AI-driven applications.
Author(s): Chi Wang & Peiheng Hu
300 Pages
Computers + Internet, Natural Language Processing

Description

Book Synopsis

Large language models (LLMs) are rapidly becoming the backbone of AI-driven applications. Without proper optimization, however, LLMs can be expensive to run, slow to serve, and prone to performance bottlenecks. As the demand for real-time AI applications grows, along comes Hands-On Serving and Optimizing LLM Models, a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.

In this hands-on book, authors Chi Wang and Peiheng Hu take a real-world approach backed by practical examples and code, and assemble essential strategies for designing robust infrastructures that are equal to the demands of modern AI applications. Whether you're building high-performance AI systems or looking to enhance your knowledge of LLM optimization, this indispensable book will serve as a pillar of your success.

Learn the key principles for designing a model-serving system tailored to popular business scenarios
Understand the common challenges of hosting LLMs at scale while minimizing costs
Pick up practical techniques for optimizing LLM serving performance
Build a model-serving system that meets specific business requirements
Improve LLM serving throughput and reduce latency
Host LLMs in a cost-effective manner, balancing performance and resource efficiency

Dimensions (Overall): 9.19 Inches (H) x 7.0 Inches (W)

Suggested Age: 22 Years and Up

Number of Pages: 300

Genre: Computers + Internet

Sub-Genre: Natural Language Processing

Publisher: O'Reilly Media

Format: Paperback

Author: Chi Wang & Peiheng Hu

Language: English

Street Date: June 2, 2026

TCIN: 1007733810

UPC: 9798341621497

Item Number (DPCI): 247-41-1857

Origin: Made in the USA or Imported

If the item details aren’t accurate or complete, we want to know about it.

Shipping details

Estimated ship dimensions: 1 inches length x 7 inches width x 9.19 inches height

Estimated ship weight: 1 pounds

We regret that this item cannot be shipped to PO Boxes.

This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO