New ArrivalsGift IdeasHoliday Hosting & EntertainingChristmasAI Gift FinderClothing, Shoes & AccessoriesToysElectronicsBeautyGift CardsHomeFurnitureCharacter ShopBabyKitchen & DiningGroceryHousehold EssentialsSchool & Office SuppliesVideo GamesMovies, Music & BooksSports & OutdoorsBackpacks & LuggagePersonal CareHealthPetsUlta Beauty at TargetTarget OpticalParty SuppliesClearanceTarget New Arrivals Target Finds #TargetStyleHanukkahStore EventsAsian-Owned Brands at TargetBlack-Owned or Founded Brands at TargetLatino-Owned Brands at TargetWomen-Owned Brands at TargetLGBTQIA+ ShopTop DealsTarget Circle DealsWeekly AdShop Order PickupShop Same Day DeliveryRegistryRedCardTarget CircleFind Stores
Build a Deepseek Model (from Scratch) - (From Scratch) by  Raj Abhijit Dandekar & Rajat Dandekar & Naman Dwivedi & Sreedath Pana (Paperback) - 1 of 1

Build a Deepseek Model (from Scratch) - (From Scratch) by Raj Abhijit Dandekar & Rajat Dandekar & Naman Dwivedi & Sreedath Pana (Paperback)

$59.99

Pre-order

Eligible for registries and wish lists

Sponsored

About this item

Highlights

  • Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.
  • About the Author: Dr. Raj Abhijit Dandekar is a computer scientist and co-founder of Vizuara AI Labs, an online education platform that has trained over 50,000 students globally.
  • 325 Pages
  • Computers + Internet, Computer Science
  • Series Name: From Scratch

Description



Book Synopsis



Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.

When DeepSeek started making waves in January 2025, it sounded too good to be true. How could a generative AI model get such incredible performance with such low training and operation costs? By creatively blending a variety of strategies and innovations like Mixture of Experts, Latent Attention, Multi-token Prediction, model distillation, and efficient parallelization, DeepSeek set a new standard for what's possible in an open LLM.

Now, in this book you can recreate a laptop-scale version of this cutting-edge model yourself! Learn how to build the features that set DeepSeek apart from other top LLMs!

In Build a DeepSeek Model (From Scratch) you will learn how to:

- Implement DeepSeek's core architectural innovations, including Multi-Head Latent Attention and Mixture-of-Experts layers
- Build a production-ready training pipeline with Multi-Token Prediction and FP8 quantization for efficiency and speed
- Maximize hardware utilization with parallelism strategies like DualPipe
- Apply post-training methods such as supervised fine-tuning and reinforcement learning to unlock reasoning capabilities
- Compress and distill large models into smaller, deployable versions for real-world use

In Build a DeepSeek Model (From Scratch) you'll build your own DeepSeek clone from the ground up. First, you'll quickly review LLM fundamentals, with an eye to where DeepSeek's innovations address the common problems and limitations of standard models. Then, you'll learn everything you need to create your own DeepSeek-inspired model, including the innovations that put DeepSeek on the map: Multihead Latent Attention (MLA), Multi-Token Prediction (MTP), Mixture of Experts (MoE), model distillation, and reasoning.

About the book

Build a DeepSeek Model (From Scratch) uses intuitive visualizations, code walkthroughs, and a problem-solution narrative to transform complex concepts into practical skills. You will start by coding a DeepSeekAttention module, progress to building a fully functional MoE layer, and set up a high-efficiency training pipeline. By the end of the book, you will have a fully operational mini-DeepSeek that runs on your laptop, along with the skills to extend and optimize it for your own research or production applications.

About the reader

For intermediate-to-advanced ML engineers, AI researchers, and graduate students who want to go beyond prebuilt models. You'll need to know deep learning and Python programming.

About the author

Dr. Raj Abhijit Dandekar is a computer scientist and co-founder of Vizuara AI Labs, an online education platform that has trained over 50,000 students globally. He holds a PhD from MIT and is the lead instructor of the popular YouTube series Build DeepSeek from Scratch.

Dr. Rajat Dandekar, PhD in Mechanical Engineering from Purdue University, specializes in applying machine learning to complex physical systems. He co-founded Vizuara AI Labs.

Naman Dwivedi is an AI researcher at Vizuara AI Labs, specializing in turning advanced deep learning concepts into hands-on, practical code.

Dr. Sreedath Pana holds a PhD from MIT and is a co-founder of Vizuara AI Labs. He is an inventor and AI engineer known for creating self-cleaning AI-powered solar technology.



About the Author



Dr. Raj Abhijit Dandekar is a computer scientist and co-founder of Vizuara AI Labs, an online education platform that has trained over 50,000 students globally. He holds a PhD from MIT and is the lead instructor of the popular YouTube series Build DeepSeek from Scratch.

Dr. Rajat Dandekar, PhD in Mechanical Engineering from Purdue University, specializes in applying machine learning to complex physical systems. He co-founded Vizuara AI Labs.

Naman Dwivedi is an AI researcher at Vizuara AI Labs, specializing in turning advanced deep learning concepts into hands-on, practical code.

Dr. Sreedath Pana holds a PhD from MIT and is a co-founder of Vizuara AI Labs. He is an inventor and AI engineer known for creating self-cleaning AI-powered solar technology.

Dimensions (Overall): 9.25 Inches (H) x 7.38 Inches (W)
Weight: .86 Pounds
Suggested Age: 22 Years and Up
Number of Pages: 325
Genre: Computers + Internet
Sub-Genre: Computer Science
Series Title: From Scratch
Publisher: Manning Publications
Format: Paperback
Author: Raj Abhijit Dandekar & Rajat Dandekar & Naman Dwivedi & Sreedath Pana
Language: English
Street Date: June 30, 2026
TCIN: 1008466364
UPC: 9781633434325
Item Number (DPCI): 247-04-4211
Origin: Made in the USA or Imported
If the item details aren’t accurate or complete, we want to know about it.

Shipping details

Estimated ship dimensions: 1 inches length x 7.38 inches width x 9.25 inches height
Estimated ship weight: 0.858 pounds
We regret that this item cannot be shipped to PO Boxes.
This item cannot be shipped to the following locations: American Samoa (see also separate entry under AS), Guam (see also separate entry under GU), Northern Mariana Islands, Puerto Rico (see also separate entry under PR), United States Minor Outlying Islands, Virgin Islands, U.S., APO/FPO

Return details

This item can be returned to any Target store or Target.com.
This item must be returned within 90 days of the date it was purchased in store, shipped, delivered by a Shipt shopper, or made ready for pickup.
See the return policy for complete information.

Related Categories

Get top deals, latest trends, and more.

Privacy policy

Footer

About Us

About TargetCareersNews & BlogTarget BrandsBullseye ShopSustainability & GovernancePress CenterAdvertise with UsInvestorsAffiliates & PartnersSuppliersTargetPlus

Help

Target HelpReturnsTrack OrdersRecallsContact UsFeedbackAccessibilitySecurity & FraudTeam Member ServicesLegal & Privacy

Stores

Find a StoreClinicPharmacyTarget OpticalMore In-Store Services

Services

Target Circle™Target Circle™ CardTarget Circle 360™Target AppRegistrySame Day DeliveryOrder PickupDrive UpFree 2-Day ShippingShipping & DeliveryMore Services
PinterestFacebookInstagramXYoutubeTiktokTermsCA Supply ChainPrivacy PolicyCA Privacy RightsYour Privacy ChoicesInterest Based AdsHealth Privacy Policy