Apress
Building Large Language Models from Scratch: Design, Train, and Deploy Llms with Pytorch
Building Large Language Models from Scratch: Design, Train, and Deploy Llms with Pytorch
Regular price
$59.99
Regular price
Sale price
$59.99
Unit price
per
Shipping calculated at checkout.
Couldn't load pickup availability
This book is a complete, hands-on guide to designing, training, and deploying your own Large Language Models (LLMs)--from the foundations of tokenization to the advanced stages of fine-tuning and reinforcement learning. Written for developers, data scientists, and AI practitioners, it bridges core principles and state-of-the-art techniques, offering a rare, transparent look at how modern transformers truly work beneath the surface. Starting from the essentials, you'll learn how to set up your environment with Python and PyTorch, manage datasets, and implement critical fundamentals such as tensors, embeddings, and gradient descent. You'll then progress through the architectural heart of modern models, covering RMS normalization, rotary positional embeddings (RoPE), scaled dot-product attention, Grouped Query Attention (GQA), Mixture of Experts (MoE), and SwiGLU activations, each explored in depth and built step by step in code. As you advance, the book introduces custom CUDA kernel integration, teaching you how to optimize key components for speed and memory efficiency at the GPU level--an essential skill for scaling real-world LLMs. You'll also gain mastery over the phases of training that define today's leading models:
Author: Dilyan Grigorov
Binding Type: Paperback
Publisher: Apress
Published: 04/28/2026
Pages: 530
Weight: 2.14lbs
Size: 9.96h x 7.02w x 1.11d
ISBN: 9798868822964
- Pretraining - Building general linguistic and semantic understanding. Midtraining - Expanding domain-specific capabilities and adaptability. Supervised Fine-Tuning (SFT) - Aligning behavior with curated, task-driven data. Reinforcement Learning from Human Feedback (RLHF) - Refining responses through reward-based optimization for human alignment.
- How to configure and optimize your development environment using PyTorch The mechanics of tokenization, embeddings, normalization, and attention mechanisms. How to implement transformer components like RMSNorm, RoPE, GQA, MoE, and SwiGLU from scratch. How to integrate custom CUDA kernels to accelerate transformer computations. The full LLM training pipeline: pretraining, midtraining, supervised fine-tuning, and RLHF. Techniques for dataset preparation, deduplication, model debugging, and GPU memory management. How to train, evaluate, and deploy a complete GPT-like architecture for real-world tasks.
Author: Dilyan Grigorov
Binding Type: Paperback
Publisher: Apress
Published: 04/28/2026
Pages: 530
Weight: 2.14lbs
Size: 9.96h x 7.02w x 1.11d
ISBN: 9798868822964
