Learning Spark

Learning Spark
Author: Jules S. Damji
Publisher: O'Reilly Media
Total Pages: 400
Release: 2020-07-16
Genre: Computers
ISBN: 1492050016


Download Learning Spark Book in PDF, Epub and Kindle

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark. Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, you’ll be able to: Learn Python, SQL, Scala, or Java high-level Structured APIs Understand Spark operations and SQL Engine Inspect, tune, and debug Spark operations with Spark configurations and Spark UI Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka Perform analytics on batch and streaming data using Structured Streaming Build reliable data pipelines with open source Delta Lake and Spark Develop machine learning pipelines with MLlib and productionize models using MLflow


Learning Spark
Language: en
Pages: 400
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: O'Reilly Media

GET EBOOK

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you
Learning Spark
Language: en
Pages: 390
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to be processed at scale for analytics or machine learning. But how can yo
Spark the Brain, Ignite the Pen (SECOND EDITION)
Language: en
Pages: 241
Authors: Samuel Totten
Categories: Education
Type: BOOK - Published: 2009-04-01 - Publisher: IAP

GET EBOOK

A NEW emphasis IN THIS edition of Spark the Brain, Ignite the Pen is writing to learn in the content areas. This edition of the work first published in 2006 inc
Stream Processing with Apache Spark
Language: en
Pages: 396
Authors: Gerard Maas
Categories: Computers
Type: BOOK - Published: 2019-06-05 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

Before you can build analytics tools to gain quick insights, you first need to know how to process data in real time. With this practical guide, developers fami
Cost-Effective Data Pipelines
Language: en
Pages: 283
Authors: Sev Leonard
Categories: Computers
Type: BOOK - Published: 2023-07-13 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

The low cost of getting started with cloud services can easily evolve into a significant expense down the road. That's challenging for teams developing data pip