Custom cover image
Custom cover image

Optimizing Generative AI Workloads for Sustainability : Balancing Performance and Environmental Impact in Generative AI / by Ishneet Kaur Dua, Parth Girish Patel

By: Contributor(s): Resource type: Ressourcentyp: Buch (Online)Book (Online)Language: English Publisher: Berkeley, CA : Apress, 2024Publisher: Berkeley, CA : Imprint: Apress, 2024Edition: 1st ed. 2024Description: 1 Online-Ressource(XV, 335 p. 101 illus.)ISBN:
  • 9798868809170
Subject(s): Additional physical formats: 9798868809163 | 9798868809187 | Erscheint auch als: 9798868809163 Druck-Ausgabe | Erscheint auch als: 9798868809187 Druck-AusgabeDDC classification:
  • 006.3 23
DOI: DOI: 10.1007/979-8-8688-0917-0Online resources: Summary: Chapter 1: Introduction to Generative AI and Sustainability -- Chapter 2: Fundamentals of Efficient AI Workload Management -- Chapter 3: Hardware Optimization for Generative AI -- Chapter 4: Software Optimization for Generative AI -- Chapter 5: Data Management and Preprocessing -- Chapter 6: Model Training and Inference Optimization -- Chapter 7: Cloud and Edge Computing for Generative AI -- Chapter 8: Energy-efficient AI Deployment and Scaling -- Chapter 9: Sustainable AI Life Cycle Management -- Chapter 10: Case Studies and Best Practices.Summary: This comprehensive guide provides practical strategies for optimizing Generative AI systems to be more sustainable and responsible. As advances in Generative AI such as large language models accelerate, optimizing these resource-intensive workloads for efficiency and alignment with human values grows increasingly urgent. The book starts with the concept of Generative AI and its wide-ranging applications, while also delving into the environmental impact of AI workloads and the growing importance of adopting sustainable AI practices. It then delves into the fundamentals of efficient AI workload management, providing insights into understanding AI workload characteristics, measuring performance, and identifying bottlenecks and inefficiencies. Hardware optimization strategies are explored in detail, covering the selection of energy-efficient hardware, leveraging specialized AI accelerators, and optimizing hardware utilization and scheduling for sustainable operations. You are also guided through software optimization techniques tailored for Generative AI, including efficient model architecture, compression, and quantization methods, and optimization of software libraries and frameworks. Data management and preprocessing strategies are also addressed, emphasizing efficient data storage, cleaning, preprocessing, and augmentation techniques to enhance sustainability throughout the data life cycle. The book further explores model training and inference optimization, cloud and edge computing strategies for Generative AI, energy-efficient deployment and scaling techniques, and sustainable AI life cycle management practices, and concludes with real-world case studies and best practices By the end of this book, you will take away a toolkit of impactful steps you can implement to minimize the environmental harms and ethical risks of Generative AI. For organizations deploying any type of generative model at scale, this essential guide provides a blueprint for developing responsible AI systems that benefit society. What You Will Learn Understand how Generative AI can be more energy-efficient through improvements such as model compression, efficient architecture, hardware optimization, and carbon footprint tracking Know the techniques to minimize data usage, including evaluation, filtering, synthesis, few-shot learning, and monitoring data demands over time Understand spanning efficiency, data minimization, and alignment for comprehensive responsibility Know the methods for detecting, understanding, and mitigating algorithmic biases, ensuring diversity in data collection, and monitoring model fairness .PPN: PPN: 1909517402Package identifier: Produktsigel: ZDB-2-SEB | ZDB-2-CWD | ZDB-2-SXPC
No physical items for this record