Fundamentals of Scalable Data Science

Number of badges issued: 3609

This badge earner has proven a deep understanding of massive parallel data processing on ApacheSpark. They have mastered low-level functional programming using python on the Resilient Distributed Dataset (RDD) API and mastered relational data processing using Apache SparkSQL & the DataFrame API. Earners understand how data processing & machine learning can be parallelized using scale-out clusters, & can compute statistical measures, integrate & transform data, & create advanced visualizations.

More Details

Fundamentals of Scalable Data Science

What is needed to earn this credential or badge

Course

Complete the Coursera course "Fundamentals of Scalable Data Science" including all hands-on labs and assignments.

Assessment

Pass the Coursera course assessment criteria.

Fundamentals of Scalable Data Science

Alignment to standards

no alignment to standards

Fundamentals of Scalable Data Science

Recommended next steps

No next steps