Napačna izbira? Nič za to! Ponujamo možnost vračila v 30 dneh
Z darilnim bonom ne morete zgrešiti. Obdarovanec lahko v zameno za darilni bon izbere karkoli iz naše ponudbe.
30 dni za vračilo blaga
Most LLM books teach you how to call an API. This one teaches you how to build what's behind it.
As frontier AI shifts toward efficiency, sparsity, and on-device deployment, the engineers who understand the architecture not just the interface are the ones defining what comes next. Building Large Language Models with Python gives you that understanding, from the mathematics of attention to the deployment of a quantized, reasoning-capable model on local hardware.
Written from hard-won production experience, each chapter pairs rigorous theory with complete Python implementations not toy examples, but the kind of code that holds up under the demands of real training runs and live inference pipelines.
What you'll build:
• A Grouped-Query Attention module with KV cache support
• A Top-K sparse MoE layer with load-balancing auxiliary loss
• A BitLinear layer implementing ternary {-1, 0, 1} weights from scratch
• A Vision Transformer encoder with a multimodal projection layer
• A Process Reward Model for step-level reasoning verification
• A full DPO and GRPO training loop for alignment
• A local-first MCP server for agentic tool use
• A speculative decoding pipeline using a draft model
Topics covered include:
Rotary Positional Embeddings (RoPE) · FlashAttention-3 concepts · Quantization-Aware Training vs. Post-Training Quantization · Expert parallelism and All-to-All communication · FSDP vs. DDP distributed training · PagedAttention and KV cache optimization · On-device LoRA fine-tuning · Chain-of-thought reasoning architecture
This book is for you if:
You're a software engineer or ML practitioner comfortable with Python and PyTorch
You understand how a basic transformer works and want to go significantly deeper
You want to move beyond using models to building and owning them
You're building for edge deployment, private AI, or resource-constrained environments
The field is moving fast. This book is written for engineers who intend to move faster.