This document discusses MLlib, Spark's machine learning library. It provides an overview of MLlib, describing what MLlib is, the types of algorithms it includes for classification, regression, collaborative filtering, clustering and decomposition. It also discusses concepts relevant to MLlib like vectors, matrices, labeled points and statistics. Finally, it describes hands-on exercises for movie recommendation using collaborative filtering and clustering on the MovieLens dataset.