More organizations than ever understand the importance of data lake architectures for deriving value from their data. Building a robust, scalable, and performant data lake remains a complex proposition, however, with a buffet of tools and options that need to work together to provide a seamless end-to-end pipeline from data to insights.
This book provides a concise yet comprehensive overview on the setup, management, and governance of a cloud data lake. Author Rukmani Gopalan, a product management leader and data enthusiast, guides data architects and engineers through the major aspects of working with a cloud data lake, from design considerations and best practices to data format optimizations, performance optimization, cost management, and governance.
Learn the benefits of a cloud-based big data strategy for your organizationGet guidance and best practices for designing performant and scalable data lakesExamine architecture and design choices, and data governance principles and strategiesBuild a data strategy that scales as your organizational and business needs increaseImplement a scalable data lake in the cloudUse cloud-based advanced analytics to gain more value from your data
You can read this ebook online in a web browser, without downloading anything or installing software.
This ebook is available in file types:
This ebook is available in:
After you've bought this ebook, you can choose to download either the PDF version or the ePub, or both.
The publisher has supplied this book in DRM Free form with digital watermarking.
You can read this eBook on any device that supports DRM-free EPUB or DRM-free PDF format.
The publisher has supplied this book in encrypted form, which means that you need to install free software in order to unlock and read it.
To read this ebook on a mobile device (phone or tablet) you'll need to install one of these free apps:
To download and read this eBook on a PC or Mac:
The publisher has set limits on how much of this ebook you may print or copy. See details.