What you will learn
- Differentiate between the four main categories of NoSQL repositories and work hands-on with MongoDB, Cassandra and IBM Cloudant.
- Apply your knowledge of the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools, including Hadoop, HDFS, Hive and HBase.
- Describe parallel programming using Resilient Distributed Datasets (RDDs), DataFrames and SparkSQL. Understand how Catalyst and Tungsten benefit Spark programmer and see how ETL work using DataFrames.
- Acquire real-world data engineering and machine learning skills using Spark Structured Streaming, DataFrames, GraphFrames, Spark ML, Regression, Classification, and clustering, including the k-means algorithm and ETL using Spark.
- Gain hands-on experience using SparkSQL, Apache Spark on IBM Cloud.
- Learn about scaling out using the IBM Spark Environment in Watson Studio, running Spark on Kubernetes, setting Spark configurations, and performing monitoring and performance tuning.
Program Overview
Expert instruction
3 skill-building courses
Self-paced
Progress at your own speed
4 months
2 - 3 hours per week
Discounted price: $222.30
Pre-discounted price: $247USD
For the full program experience
Courses in this program
IBM's NoSQL, Big Data and Spark Fundamentals Professional Certificate
- NoSQL Database Basics
- Big Data, Hadoop, and Spark Basics
- Apache Spark for Data Engineering and Machine Learning
- Job Outlook
Meet your instructors from IBM
See instructor bios
Experts from IBM committed to teaching online learning
Enrolling Now
Discounted price: $222.30
Pre-discounted price: $247USD
3 courses in 4 months
Get started in computer science
Browse other computer science coursesWhether you are looking to accelerate your career, earn a degree, or learn something for personal reasons, edX has the courses for you.