Result: Large Scale Machine Learning with Spark
copyrighted
https://univ.scholarvox.com/book/88843447
https://static2.cyberlibris.com/books_upload/136pix/9781785883712.jpg
1268589652
From OAIsterĀ®, provided by the OCLC Cooperative.
Further Information
Discover everything you need to build robust machine learning applications with Spark 2.0About This BookGet the most up-to-date book on the market that focuses on design, engineering, and scalable solutions in machine learning with Spark 2.0.0Use Spark's machine learning library in a big data environmentYou will learn how to develop high-value applications at scale with ease and a develop a personalized designWho This Book Is ForThis book is for data science engineers and scientists who work with large and complex data sets. You should be familiar with the basics of machine learning concepts, statistics, and computational mathematics. Knowledge of Scala and Java is advisable.What You Will LearnGet solid theoretical understandings of ML algorithmsConfigure Spark on cluster and cloud infrastructure to develop applications using Scala, Java, Python, and RScale up ML applications on large cluster or cloud infrastructuresUse Spark ML and MLlib to develop ML pipelines with recommendation system, classification, regression, clustering, sentiment analysis, and dimensionality reductionHandle large texts for developing ML applications with strong focus on feature engineeringUse Spark Streaming to develop ML applications for real-time streamingTune ML models with cross-validation, hyperparameters tuning and train splitEnhance ML models to make them adaptable for new data in dynamic and incremental environmentsIn DetailData processing, implementing related algorithms, tuning, scaling up and finally deploying are some crucial steps in the process of optimising any application.Spark is capable of handling large-scale batch and streaming data to figure out when to cache data in memory and processing them up to 100 times faster than Hadoop-based MapReduce. This means predictive analytics can be applied to streaming and batch to develop complete machine learning (ML) applications a lot quicker, making Spark an ideal candidate for large data-intensive applications.This book focuses o