Tag

apache spark

2 posts collected under apache spark.

data analytics

Mastering Distributed Data: My Honest Experience Building Pipelines with Java

A data analyst's practical guide to learning Apache Spark with Java. Covering the Dataset API, ETL pipelines, performance tuning, and distributed computing.

data analytics

Mastering Big Data Scale: A Guide to Databricks and Apache Spark for Analysts

Learn Databricks and Apache Spark fundamentals. Michael Park shares insights on Lakehouse Architecture, Spark SQL, and optimizing big data ETL pipelines.