Data Processing and Modeling with Hadoop

Data Processing and Modeling with Hadoop
Author: Vinicius Aquino do Vale
Publisher: BPB Publications
Total Pages: 196
Release: 2021-10-12
Genre: Computers
ISBN: 9391392288


Download Data Processing and Modeling with Hadoop Book in PDF, Epub and Kindle

Understand data in a simple way using a data lake. KEY FEATURES ● In-depth practical demonstration of Hadoop/Yarn concepts with numerous examples. ● Includes graphical illustrations and visual explanations for Hadoop commands and parameters. ● Includes details of dimensional modeling and Data Vault modeling. ● Includes details of how to create and define a structure to a data lake. DESCRIPTION The book 'Data Processing and Modeling with Hadoop' explains how a distributed system works and its benefits in the big data era in a straightforward and clear manner. After reading the book, you will be able to plan and organize projects involving a massive amount of data. The book describes the standards and technologies that aid in data management and compares them to other technology business standards. The reader receives practical guidance on how to segregate and separate data into zones, as well as how to develop a model that can aid in data evolution. It discusses security and the measures that are utilized to reduce the impact of security. Self-service analytics, Data Lake, Data Vault 2.0, and Data Mesh are discussed in the book. After reading this book, the reader will have a thorough understanding of how to structure a data lake, as well as the ability to plan, organize, and carry out the implementation of a data-driven business with full governance and security. WHAT YOU WILL LEARN ● Learn the basics of components to the Hadoop Ecosystem. ● Understand the structure, files, and zones of a Data Lake. ● Learn to implement the security part of the Hadoop Ecosystem. ● Learn to work with the Data Vault 2.0 modeling. ● Learn to develop a strategy to define good governance. ● Learn new tools to work with Data and Big Data WHO THIS BOOK IS FOR This book caters to big data developers, technical specialists, consultants, and students who want to build good proficiency in big data. Knowing basic SQL concepts, modeling, and development would be good, although not mandatory. TABLE OF CONTENTS 1. Understanding the Current Moment 2. Defining the Zones 3. The Importance of Modeling 4. Massive Parallel Processing 5. Doing ETL/ELT 6. A Little Governance 7. Talking About Security 8. What Are the Next Steps?


Data Processing and Modeling with Hadoop
Language: en
Pages: 196
Authors: Vinicius Aquino do Vale
Categories: Computers
Type: BOOK - Published: 2021-10-12 - Publisher: BPB Publications

GET EBOOK

Understand data in a simple way using a data lake. KEY FEATURES ● In-depth practical demonstration of Hadoop/Yarn concepts with numerous examples. ● Include
Hadoop Application Architectures
Language: en
Pages: 399
Authors: Mark Grover
Categories: Computers
Type: BOOK - Published: 2015-06-30 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

Get expert guidance on architecting end-to-end data management solutions with Apache Hadoop. While many sources explain how to use various components in the Had
Data Processing and Modeling with Hadoop
Language: en
Pages: 198
Authors: Vinicius Aquino Do Vale
Categories: Apache Hadoop
Type: BOOK - Published: 2021 - Publisher:

GET EBOOK

The book describes the standards and technologies that aid in data management and compares them to other technology business standards. The reader receives prac
Modern Big Data Processing with Hadoop
Language: en
Pages: 390
Authors: V Naresh Kumar
Categories: Computers
Type: BOOK - Published: 2018-03-30 - Publisher: Packt Publishing Ltd

GET EBOOK

A comprehensive guide to design, build and execute effective Big Data strategies using Hadoop Key Features -Get an in-depth view of the Apache Hadoop ecosystem
Big Data Analytics with Hadoop 3
Language: en
Pages: 471
Authors: Sridhar Alla
Categories: Computers
Type: BOOK - Published: 2018-05-31 - Publisher: Packt Publishing Ltd

GET EBOOK

Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3 Key Features Learn Hadoop 3 to build effective big data anal