Apache Mahout Clustering Designs
商品資訊
ISBN13:9781783284436
出版社:Packt Pub Ltd
作者:Ashish Gupta
出版日:2015/09/30
裝訂/頁數:平裝/130頁
規格:23.5cm*19.1cm (高/寬)
商品簡介
Explore clustering algorithms used with Apache Mahout
About This Book
- Use Mahout for clustering datasets and gain useful insights
- Explore the different clustering algorithms used in day-to-day work
- A practical guide to create and evaluate your own clustering models using real world data sets
Who This Book Is For
This book is for developers who want to try out clustering on large datasets using Mahout. It will also be useful for those users who don't have background in Mahout, but have knowledge of basic programming and are familiar with basics of machine learning and clustering. It will be helpful if you know about clustering techniques with some other tool.
What You Will Learn
- Explore clustering algorithms and cluster evaluation techniques
- Learn different types of clustering and distance measuring techniques
- Perform clustering on your data using K-Means clustering
- Discover how canopy clustering is used as pre-process step for K-Means
- Use the Fuzzy K-Means algorithm in Apache Mahout
- Implement Streaming K-Means clustering in Mahout
- Learn Spectral K-Means clustering implementation of Mahout
In Detail
As more and more organizations are discovering the use of big data analytics, interest in platforms that provide storage, computation, and analytic capabilities has increased. Apache Mahout caters to this need and paves the way for the implementation of complex algorithms in the field of machine learning to better analyse your data and get useful insights into it.
Starting with the introduction of clustering algorithms, this book provides an insight into Apache Mahout and different algorithms it uses for clustering data. It provides a general introduction of the algorithms, such as K-Means, Fuzzy K-Means, StreamingKMeans, and how to use Mahout to cluster your data using a particular algorithm. You will study the different types of clustering and learn how to use Apache Mahout with real world data sets to implement and evaluate your clusters.
This book will discuss about cluster improvement and visualization using Mahout APIs and also explore model-based clustering and topic modelling using Dirichlet process. Finally, you will learn how to build and deploy a model for production use.
Style and approach
This book is a hand's-on guide with examples using real-world datasets. Each chapter begins by explaining the algorithm in detail and follows up with showing how to use mahout for that algorithm using example data-sets.
主題書展
更多書展購物須知
外文書商品之書封,為出版社提供之樣本。實際出貨商品,以出版社所提供之現有版本為主。部份書籍,因出版社供應狀況特殊,匯率將依實際狀況做調整。
無庫存之商品,在您完成訂單程序之後,將以空運的方式為你下單調貨。為了縮短等待的時間,建議您將外文書與其他商品分開下單,以獲得最快的取貨速度,平均調貨時間為1~2個月。
為了保護您的權益,「三民網路書店」提供會員七日商品鑑賞期(收到商品為起始日)。
若要辦理退貨,請在商品鑑賞期內寄回,且商品必須是全新狀態與完整包裝(商品、附件、發票、隨貨贈品等)否則恕不接受退貨。

