{"product_id":"hands-on-llm-serving-and-optimization-hosting-llms-at-scale-9798341621497","title":"Hands-On LLM Serving and Optimization: Hosting Llms at Scale","description":"\u003cp\u003eLarge language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era.\u003c\/p\u003e \u003cp\u003eWithout proper optimization, however, LLMs can be expensive and slow to serve. \u003cem\u003eHands-On LLM Serving and Optimization\u003c\/em\u003e is a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.\u003c\/p\u003e \u003cp\u003eIn this hands-on, engineering-focused book, authors Chi Wang and Peiheng Hu combine practical examples, code, and strategies for building robust, performant, and cost-efficient AI token factories. Whether you're building the LLM inference infrastructure or the applications that consume it, a deep understanding of LLM serving will make you a more effective, future-ready engineer as AI transforms how we work and build.\u003c\/p\u003e \u003cul\u003e \u003cli\u003eLearn the foundations of model serving with core concepts, design paradigms, and industry best practices\u003c\/li\u003e \u003cli\u003eUnderstand the common challenges of hosting LLMs at scale\u003c\/li\u003e \u003cli\u003eBalance latency and throughput to meet the demands of AI applications and business requirements\u003c\/li\u003e \u003cli\u003eHost LLMs cost-effectively with practical, code-backed techniques\u003c\/li\u003e \u003c\/ul\u003e \u003cbr\u003e\u003cbr\u003e\u003cb\u003eAuthor:\u003c\/b\u003e Chi Wang,Peiheng Hu\u003cbr\u003e\u003cb\u003eBinding Type:\u003c\/b\u003e Paperback\u003cbr\u003e\u003cb\u003ePublisher:\u003c\/b\u003e O'Reilly Media\u003cbr\u003e\u003cb\u003ePublished:\u003c\/b\u003e 06\/02\/2026\u003cbr\u003e\u003cb\u003ePages:\u003c\/b\u003e 371\u003cbr\u003e\u003cb\u003eWeight:\u003c\/b\u003e 1.31lbs\u003cbr\u003e\u003cb\u003eSize:\u003c\/b\u003e 9.19h x 7.00w x 0.77d\u003cbr\u003e\u003cb\u003eISBN:\u003c\/b\u003e 9798341621497","brand":"O'Reilly Media","offers":[{"title":"Default Title","offer_id":46939916664969,"sku":"9798341621497","price":79.99,"currency_code":"USD","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0636\/9240\/6921\/files\/img_7b3ded0c-3f88-46de-8b8d-94772157ab59.jpg?v=1781660338","url":"https:\/\/sonsanddaughtersbooks.com\/products\/hands-on-llm-serving-and-optimization-hosting-llms-at-scale-9798341621497","provider":"Sons and Daughters Books","version":"1.0","type":"link"}