Target New ArrivalsGift Ideas for DadClothing, Shoes & AccessoriesHome & DecorKitchen & DiningOutdoor Living & GardenGroceryHousehold EssentialsBabyBeautyPersonal CareSports & OutdoorsHealthWellnessLuggageSchool & Office SuppliesToysElectronicsVideo GamesMovies, Music & BooksParty SuppliesGift IdeasGift CardsPetsUlta Beauty at TargetShop by CommunityTarget OpticalDealsClearanceNew ArrivalsGift Ideas for DadBack to SchoolCollegeTop DealsTarget Circle DealsWeekly AdShop Order PickupShop Same Day DeliveryRegistryRedCardTarget CircleFind Stores
Quantization and Fast Inference - by  Vivek Kalyanarangan (Paperback) - 1 of 1

Quantization and Fast Inference - by Vivek Kalyanarangan (Paperback)

$59.99

Pre-order

Free & easy returns

Free & easy returns

Return this item by mail or in store within 90 days for a full refund.
Eligible for registries and wish lists

About this item

Highlights

  • Get the eBook free when you register your print book at Manning.
  • About the Author: Vivek Kalyanarangan is an AI/ML architect, researcher, and educator with over twelve years of experience designing and deploying large-scale machine learning systems.
  • 350 Pages
  • Computers + Internet,

Description



Book Synopsis



Get the eBook free when you register your print book at Manning.

Today's AI models demand a lot of memory, compute, and server horsepower--which quickly translates into cost. This book show you how you can optimize AI models without architectural redesigns or task-specific compression. It reveals practical techniques for quantization, systematically reducing numerical precision to achieve faster inference, lower memory usage, and cheaper deployment--all with minimal accuracy loss.

From quantization fundamentals to runtime packaging, the book gives you a complete and comprehensive overview of the full quantization pipeline. It starts by deriving quantization mapping from first principles, and then builds your knowledge and skill through techniques for production-tested PTQ and QAT workflows and a fully compressed deployment. You'll learn to apply post-training quantization to production models, run quantization-aware training using fake quantization and straight-through estimators, and handle subtle tradeoffs like activation outliers in LLMs, KV cache pressure, and sub-8-bit formats like NF4 and FP4.

What's inside

- Applying post-training quantization to production models
- Deploying efficiently on CPUs, edge devices, and mobile
- Framework-agnostic techniques and real cross-framework parity testing
- Flowcharts and checklists for efficient decision making

About the reader

For ML engineers and researchers experienced in Python.

About the author

Vivek Kalyanarangan is an AI/ML architect, researcher, and educator with over twelve years of experience designing and deploying large-scale machine learning systems.



About the Author



Vivek Kalyanarangan is an AI/ML architect, researcher, and educator with over twelve years of experience designing and deploying large-scale machine learning systems.
Manufacturer Suggested Age: 22 Years and Up
Language: English
Genre: Computers + Internet
Format: Paperback
Number of Pages: 350
Author: Vivek Kalyanarangan
Street Date: December 29, 2026
TCIN: 1011742509
UPC: 9781633433915
Item Number (DPCI): 247-01-3203
Origin: Made in the USA or Imported
If the item details aren’t accurate or complete, we want to know about it.

Shipping details

Estimated ship dimensions: 1 inches length x 7.38 inches width x 9.25 inches height
Estimated ship weight: 0.924 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO, Alaska, Hawaii

Return details

This item can be returned to any Target store or Target.com.
This item must be returned within 90 days of the date it was purchased in store, delivered to the guest, delivered by a Shipt shopper, or picked up by the guest.
See the return policy for complete information.

Additional product information and recommendations

Discover more options

Get top deals, latest trends, and more.

Privacy policy