Spark in Action, Second Edition

(Author) Jean-Georges Perrin
Format: Paperback
£47.99 Price: £45.59 (5% off)
Generally dispatched in 1 to 2 days

Summary The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark’s powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop. Foreword by Rob Thomas. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem. About the book Spark in Action, Second Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you’ll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you’ll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms. What's inside Writing Spark applications in Java Spark application architecture Ingestion through files, databases, streaming, and Elasticsearch Querying distributed datasets with Spark SQL About the reader This book does not assume previous experience with Spark, Scala, or Hadoop. About the author Jean-Georges Perrin is an experienced data and software architect. He is France’s first IBM Champion and has been honored for 12 consecutive years. Table of Contents PART 1 - THE THEORY CRIPPLED BY AWESOME EXAMPLES 1 So, what is Spark, anyway? 2 Architecture and flow 3 The majestic role of the dataframe 4 Fundamentally lazy 5 Building a simple app for deployment 6 Deploying your simple app PART 2 - INGESTION 7 Ingestion from files 8 Ingestion from databases 9 Advanced ingestion: finding data sources and building your own 10 Ingestion through structured streaming PART 3 - TRANSFORMING YOUR DATA 11 Working with SQL 12 Transforming your data 13 Transforming entire documents 14 Extending transformations with user-defined functions 15 Aggregating your data PART 4 - GOING FURTHER 16 Cache and checkpoint: Enhancing Spark’s performances 17 Exporting data and building full data pipelines 18 Exploring deployment

Information
Publisher:
Manning
Format:
Paperback
Number of pages:
574
Language:
en
ISBN:
9781617295522
Publish year:
2020
Publish date:
June 22, 2020

Jean-Georges Perrin

Jean-Georges Perrin is a French author known for his unique blend of surrealism and existentialism in his works. He is best known for his novel "The Book of Disquiet," which explores themes of alienation, isolation, and the search for meaning in a chaotic world. Perrin's writing style is characterized by its poetic language, introspective tone, and philosophical depth. His contributions to literature have had a significant impact on the genre of existential fiction, inspiring countless writers to delve into the complexities of human existence. Through his thought-provoking narratives, Perrin continues to challenge readers to question the nature of reality and their place within it.

Reviews

Leave a review

Please login to leave a review.

Be the first to review this product

Other related

Love Machines

Love Machines

How Artificial Intelligence is Transforming Our Relationships

James Muldoon
Paperback
Published: 2026
Agentic AI For Dummies

Agentic AI For Dummies

Pam Baker
Paperback
Published: 2026
ChatGPT for Students

ChatGPT for Students

Frank Blackwell
Fold-outboo
Published: 2026
Nexus

Nexus

A Brief History of Information Networks from the Stone Age to AI

Yuval Noah Harari
Paperback
Published: 2025
The Immortalists

The Immortalists

The Death of Death and the Race for Eternal Life

Aleks Krotoski, Krotoski Aleks
Hardcover
Published: 2025
If Anyone Builds It, Everyone Dies

If Anyone Builds It, Everyone Dies

The Case Against Superintelligent AI

Eliezer Yudkowsky
Hardcover
Published: 2025