Course overview

Machine Learning: Data Preparation v10.x


Online Certification Course         

Who is this course for?

 The Data Preparation 10.x course introduces the in-database SQL functions used to adjust and clear up anomalies in your data set before building machine learning models. 

The target audience includes:
  • Data Scientists
  • Data Analysts

Course prerequisites

Although there are no formal prerequisites for this course, a basic understanding and appreciation of the following would be beneficial:
  • Relational Database Management Systems (RDBMS)
  • Structured Query Language (SQL)
  • Linux

Course content

Although this  course is designed as an introduction to the built-in data preparation functions, please do not underestimate the amount of material presented:
  • 11 Modules
  • 52 Videos
  • 12 Hands-On Lab Exercises
  • 56 Knowledge Check questions
As such, expect to need to put aside at least hours to complete this course.

Knowledge Checks
At the end of each module, where that module contains Hands-On Lab Exercises, there is a Knowledge Check covering the material presented in that module and the exercise(s). 
You will be required to score at least 80% to pass a Knowledge Check before progressing to the next module.

Certification Exam
Once you have completed all the modules and Knowledge Checks, you are welcome to apply to take the Data Preparation 10.x Exam. This exam comprises 50 questions.
To pass this exam, you must score at least 80%.

Data Preparation 10.x Certification
On successful completion of this course and its exam, you will be awarded the Data Preparation 10.x Certificate. 
Following the successful completion of the exam, you will also be receiving your digital credentials (Badge) via Credly Acclaim.

Course delivery and expectations

The course is delivered as a series of on-line, self-paced modules.
Each module is broken into a number of bite-sized sections, presented as short (<5 minute) videos to introduce the subject.

Most of the modules have hands-on lab exercises and a Knowledge Check. The lab environment is required to complete and pass the Knowledge Checks.
To undertake the exercises, Vertica Academy students will be provided with a 3-node Vertica cluster, already configured and ready to go. 

Prior to commencing the first exercise, you will need to request access to your Vertica cluster - instructions on how to do this are provided within the course content.

Although we have students successfully completing the course in less than one day, appreciating that some may require longer to complete the course, we provide the Vertica cluster for up to 7 days.

And finally...
The Vertica Academy Team wish you all the very best of luck with this course, and hope you thoroughly enjoy the experience working with Vertica.
Should you have any comments, questions, feedback (good or bad), we would really like to hear from you!
Please feel free to reach out to us via the contact pages on the Vertica Academy or directly via email:

YOur teachers

Mark Whalley

Manager, vertica education

Drea Brandford

Education portfolio lead

Dina Love

vertica academy instructor