{"product_id":"hands-on-llm-serving-and-optimization-hosting-llms-at-scale","title":"Hands-On LLM Serving and Optimization Hosting LLMs at Scale","description":"\u003cp\u003e\u003cb\u003eAll Indian Reprints of O'Reilly are printed in Grayscale\u003c\/b\u003e\u003c\/p\u003e\n\u003cp\u003eLarge language models (LLMs) are the reasoning engines of modern AI. Today, a major inflection point has arrived: as the world races to deploy AI at scale, model inference has moved to the center of the stack. Welcome to the inference era.\u003c\/p\u003e\n\u003cp\u003eWithout proper optimization, however, LLMs can be expensive and slow to serve. \u003cem\u003eHands-On LLM Serving and Optimization\u003c\/em\u003e is a comprehensive guide to the complexities of deploying and optimizing LLMs at scale.\u003c\/p\u003e\n\u003cp\u003eIn this hands-on, engineering-focused book, authors Chi Wang and Peiheng Hu combine practical examples, code, and strategies for building robust, performant, and cost-efficient AI token factories. Whether you’re building the LLM inference infrastructure or the applications that consume it, a deep understanding of LLM serving will make you a more effective, future-ready engineer as AI transforms how we work and build.\u003c\/p\u003e\n\u003cul\u003e\n\u003cli\u003eLearn the foundations of model serving with core concepts, design paradigms, and industry best practices\u003c\/li\u003e\n\u003cli\u003eUnderstand the common challenges of hosting LLMs at scale\u003c\/li\u003e\n\u003cli\u003eBalance latency and throughput to meet the demands of AI applications and business requirements\u003c\/li\u003e\n\u003cli\u003eHost LLMs cost-effectively with practical, code-backed techniques\u003c\/li\u003e\n\u003c\/ul\u003e","brand":"BOOKZONE","offers":[{"title":"Default Title","offer_id":45370424557647,"sku":"9789368080527","price":1890.0,"currency_code":"INR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0635\/9583\/9567\/files\/hands-on-llm-serving-and-optimization-hosting-llms-at-scale-9352180.jpg?v=1780202345","url":"https:\/\/bookzoneindia.com\/products\/hands-on-llm-serving-and-optimization-hosting-llms-at-scale","provider":"BOOKZONE","version":"1.0","type":"link"}