Mining of Massive Data Sets by Jure Leskovec; Anand Rajaraman; Jeffrey David UllmanCall Number: E-Book. Read online.
ISBN: 9781108476348
Publication Date: 2020
Essential reading for students and practitioners alike. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the MapReduce framework, an important tool for parallelizing algorithms automatically. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets, and clustering. The third edition includes new and extended coverage on decision trees, deep learning, and mining social-network graphs.