Big Data, Hadoop and Spark Essentials
Number of badges issued: 7
This credential earner is able to describe Big Data, its impact, processing methods and tools, and use cases. They understand the Hadoop architecture, ecosystem, practices, and applications, including Distributed File System (HDFS), HBase, Spark, and MapReduce. The earner can describe Spark programming basics, including parallel programming basics, for DataFrames, data sets, and SparkSQL. They know how Spark uses RDDs, creates data sets, and uses Catalyst and Tungsten to optimize SparkSQL.