Bayesian Variable Selection In High Dimensional Applications

Download Bayesian Variable Selection In High Dimensional Applications full books in PDF, epub, and Kindle. Read online Bayesian Variable Selection In High Dimensional Applications ebook anywhere anytime directly on your device. Fast Download speed and no annoying ads. We cannot guarantee that every ebooks is available!

Handbook of Bayesian Variable Selection

Author : Mahlet G. Tadesse
Publisher : CRC Press
ISBN 13 : 1000510255
Total Pages : 762 pages
Book Rating : 4.0/5 (5 download)

DOWNLOAD NOW!

Book Synopsis Handbook of Bayesian Variable Selection by : Mahlet G. Tadesse

Download or read book Handbook of Bayesian Variable Selection written by Mahlet G. Tadesse and published by CRC Press. This book was released on 2021-12-24 with total page 762 pages. Available in PDF, EPUB and Kindle. Book excerpt: Bayesian variable selection has experienced substantial developments over the past 30 years with the proliferation of large data sets. Identifying relevant variables to include in a model allows simpler interpretation, avoids overfitting and multicollinearity, and can provide insights into the mechanisms underlying an observed phenomenon. Variable selection is especially important when the number of potential predictors is substantially larger than the sample size and sparsity can reasonably be assumed. The Handbook of Bayesian Variable Selection provides a comprehensive review of theoretical, methodological and computational aspects of Bayesian methods for variable selection. The topics covered include spike-and-slab priors, continuous shrinkage priors, Bayes factors, Bayesian model averaging, partitioning methods, as well as variable selection in decision trees and edge selection in graphical models. The handbook targets graduate students and established researchers who seek to understand the latest developments in the field. It also provides a valuable reference for all interested in applying existing methods and/or pursuing methodological extensions. Features: Provides a comprehensive review of methods and applications of Bayesian variable selection. Divided into four parts: Spike-and-Slab Priors; Continuous Shrinkage Priors; Extensions to various Modeling; Other Approaches to Bayesian Variable Selection. Covers theoretical and methodological aspects, as well as worked out examples with R code provided in the online supplement. Includes contributions by experts in the field. Supported by a website with code, data, and other supplementary material

Handbook of Bayesian Variable Selection

Author : Mahlet G. Tadesse
Publisher : CRC Press
ISBN 13 : 1000510204
Total Pages : 491 pages
Book Rating : 4.0/5 (5 download)

DOWNLOAD NOW!

Book Synopsis Handbook of Bayesian Variable Selection by : Mahlet G. Tadesse

Download or read book Handbook of Bayesian Variable Selection written by Mahlet G. Tadesse and published by CRC Press. This book was released on 2021-12-24 with total page 491 pages. Available in PDF, EPUB and Kindle. Book excerpt: Bayesian variable selection has experienced substantial developments over the past 30 years with the proliferation of large data sets. Identifying relevant variables to include in a model allows simpler interpretation, avoids overfitting and multicollinearity, and can provide insights into the mechanisms underlying an observed phenomenon. Variable selection is especially important when the number of potential predictors is substantially larger than the sample size and sparsity can reasonably be assumed. The Handbook of Bayesian Variable Selection provides a comprehensive review of theoretical, methodological and computational aspects of Bayesian methods for variable selection. The topics covered include spike-and-slab priors, continuous shrinkage priors, Bayes factors, Bayesian model averaging, partitioning methods, as well as variable selection in decision trees and edge selection in graphical models. The handbook targets graduate students and established researchers who seek to understand the latest developments in the field. It also provides a valuable reference for all interested in applying existing methods and/or pursuing methodological extensions. Features: Provides a comprehensive review of methods and applications of Bayesian variable selection. Divided into four parts: Spike-and-Slab Priors; Continuous Shrinkage Priors; Extensions to various Modeling; Other Approaches to Bayesian Variable Selection. Covers theoretical and methodological aspects, as well as worked out examples with R code provided in the online supplement. Includes contributions by experts in the field. Supported by a website with code, data, and other supplementary material

Advanced Mean Field Methods

Author : Manfred Opper
Publisher : MIT Press
ISBN 13 : 9780262150545
Total Pages : 300 pages
Book Rating : 4.1/5 (55 download)

DOWNLOAD NOW!

Book Synopsis Advanced Mean Field Methods by : Manfred Opper

Download or read book Advanced Mean Field Methods written by Manfred Opper and published by MIT Press. This book was released on 2001 with total page 300 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book covers the theoretical foundations of advanced mean field methods, explores the relation between the different approaches, examines the quality of the approximation obtained, and demonstrates their application to various areas of probabilistic modeling. A major problem in modern probabilistic modeling is the huge computational complexity involved in typical calculations with multivariate probability distributions when the number of random variables is large. Because exact computations are infeasible in such cases and Monte Carlo sampling techniques may reach their limits, there is a need for methods that allow for efficient approximate computations. One of the simplest approximations is based on the mean field method, which has a long history in statistical physics. The method is widely used, particularly in the growing field of graphical models. Researchers from disciplines such as statistical physics, computer science, and mathematical statistics are studying ways to improve this and related methods and are exploring novel application areas. Leading approaches include the variational approach, which goes beyond factorizable distributions to achieve systematic improvements; the TAP (Thouless-Anderson-Palmer) approach, which incorporates correlations by including effective reaction terms in the mean field theory; and the more general methods of graphical models. Bringing together ideas and techniques from these diverse disciplines, this book covers the theoretical foundations of advanced mean field methods, explores the relation between the different approaches, examines the quality of the approximation obtained, and demonstrates their application to various areas of probabilistic modeling.

Statistical Learning with Sparsity

Author : Trevor Hastie
Publisher : CRC Press
ISBN 13 : 1498712177
Total Pages : 354 pages
Book Rating : 4.4/5 (987 download)

DOWNLOAD NOW!

Book Synopsis Statistical Learning with Sparsity by : Trevor Hastie

Download or read book Statistical Learning with Sparsity written by Trevor Hastie and published by CRC Press. This book was released on 2015-05-07 with total page 354 pages. Available in PDF, EPUB and Kindle. Book excerpt: Discover New Methods for Dealing with High-Dimensional DataA sparse statistical model has only a small number of nonzero parameters or weights; therefore, it is much easier to estimate and interpret than a dense model. Statistical Learning with Sparsity: The Lasso and Generalizations presents methods that exploit sparsity to help recover the underl

Lifetime Data: Models in Reliability and Survival Analysis

Author : Nicholas P. Jewell
Publisher : Springer Science & Business Media
ISBN 13 : 1475756542
Total Pages : 392 pages
Book Rating : 4.4/5 (757 download)

DOWNLOAD NOW!

Book Synopsis Lifetime Data: Models in Reliability and Survival Analysis by : Nicholas P. Jewell

Download or read book Lifetime Data: Models in Reliability and Survival Analysis written by Nicholas P. Jewell and published by Springer Science & Business Media. This book was released on 2013-04-17 with total page 392 pages. Available in PDF, EPUB and Kindle. Book excerpt: Statistical models and methods for lifetime and other time-to-event data are widely used in many fields, including medicine, the environmental sciences, actuarial science, engineering, economics, management, and the social sciences. For example, closely related statistical methods have been applied to the study of the incubation period of diseases such as AIDS, the remission time of cancers, life tables, the time-to-failure of engineering systems, employment duration, and the length of marriages. This volume contains a selection of papers based on the 1994 International Research Conference on Lifetime Data Models in Reliability and Survival Analysis, held at Harvard University. The conference brought together a varied group of researchers and practitioners to advance and promote statistical science in the many fields that deal with lifetime and other time-to-event-data. The volume illustrates the depth and diversity of the field. A few of the authors have published their conference presentations in the new journal Lifetime Data Analysis (Kluwer Academic Publishers).

High-dimensional Data Analysis

Author : Tony Cai;Xiaotong Shen
Publisher :
ISBN 13 : 9787894236326
Total Pages : 318 pages
Book Rating : 4.2/5 (363 download)

DOWNLOAD NOW!

Book Synopsis High-dimensional Data Analysis by : Tony Cai;Xiaotong Shen

Download or read book High-dimensional Data Analysis written by Tony Cai;Xiaotong Shen and published by . This book was released on with total page 318 pages. Available in PDF, EPUB and Kindle. Book excerpt: Over the last few years, significant developments have been taking place in highdimensional data analysis, driven primarily by a wide range of applications in many fields such as genomics and signal processing. In particular, substantial advances have been made in the areas of feature selection, covariance estimation, classification and regression. This book intends to examine important issues arising from highdimensional data analysis to explore key ideas for statistical inference and prediction. It is structured around topics on multiple hypothesis testing, feature selection, regression, cla.

Flexible Imputation of Missing Data, Second Edition

Author : Stef van Buuren
Publisher : CRC Press
ISBN 13 : 0429960352
Total Pages : 444 pages
Book Rating : 4.4/5 (299 download)

DOWNLOAD NOW!

Book Synopsis Flexible Imputation of Missing Data, Second Edition by : Stef van Buuren

Download or read book Flexible Imputation of Missing Data, Second Edition written by Stef van Buuren and published by CRC Press. This book was released on 2018-07-17 with total page 444 pages. Available in PDF, EPUB and Kindle. Book excerpt: Missing data pose challenges to real-life data analysis. Simple ad-hoc fixes, like deletion or mean imputation, only work under highly restrictive conditions, which are often not met in practice. Multiple imputation replaces each missing value by multiple plausible values. The variability between these replacements reflects our ignorance of the true (but missing) value. Each of the completed data set is then analyzed by standard methods, and the results are pooled to obtain unbiased estimates with correct confidence intervals. Multiple imputation is a general approach that also inspires novel solutions to old problems by reformulating the task at hand as a missing-data problem. This is the second edition of a popular book on multiple imputation, focused on explaining the application of methods through detailed worked examples using the MICE package as developed by the author. This new edition incorporates the recent developments in this fast-moving field. This class-tested book avoids mathematical and technical details as much as possible: formulas are accompanied by verbal statements that explain the formula in accessible terms. The book sharpens the reader’s intuition on how to think about missing data, and provides all the tools needed to execute a well-grounded quantitative analysis in the presence of missing data.

Contemporary Multivariate Analysis and Design of Experiments

Author : Kaitai Fang
Publisher : World Scientific
ISBN 13 : 9812567763
Total Pages : 470 pages
Book Rating : 4.8/5 (125 download)

DOWNLOAD NOW!

Book Synopsis Contemporary Multivariate Analysis and Design of Experiments by : Kaitai Fang

Download or read book Contemporary Multivariate Analysis and Design of Experiments written by Kaitai Fang and published by World Scientific. This book was released on 2005 with total page 470 pages. Available in PDF, EPUB and Kindle. Book excerpt: Index. Subject index -- Author index

Case Studies in Applied Bayesian Data Science

Author : Kerrie L. Mengersen
Publisher : Springer Nature
ISBN 13 : 3030425533
Total Pages : 415 pages
Book Rating : 4.0/5 (34 download)

DOWNLOAD NOW!

Book Synopsis Case Studies in Applied Bayesian Data Science by : Kerrie L. Mengersen

Download or read book Case Studies in Applied Bayesian Data Science written by Kerrie L. Mengersen and published by Springer Nature. This book was released on 2020-05-28 with total page 415 pages. Available in PDF, EPUB and Kindle. Book excerpt: Presenting a range of substantive applied problems within Bayesian Statistics along with their Bayesian solutions, this book arises from a research program at CIRM in France in the second semester of 2018, which supported Kerrie Mengersen as a visiting Jean-Morlet Chair and Pierre Pudlo as the local Research Professor. The field of Bayesian statistics has exploded over the past thirty years and is now an established field of research in mathematical statistics and computer science, a key component of data science, and an underpinning methodology in many domains of science, business and social science. Moreover, while remaining naturally entwined, the three arms of Bayesian statistics, namely modelling, computation and inference, have grown into independent research fields. While the research arms of Bayesian statistics continue to grow in many directions, they are harnessed when attention turns to solving substantive applied problems. Each such problem set has its own challenges and hence draws from the suite of research a bespoke solution. The book will be useful for both theoretical and applied statisticians, as well as practitioners, to inspect these solutions in the context of the problems, in order to draw further understanding, awareness and inspiration.

Bayesian Mediation Analysis using R

Author : Atanu Bhattacharjee
Publisher : CRC Press
ISBN 13 : 1040009484
Total Pages : 204 pages
Book Rating : 4.0/5 (4 download)

DOWNLOAD NOW!

Book Synopsis Bayesian Mediation Analysis using R by : Atanu Bhattacharjee

Download or read book Bayesian Mediation Analysis using R written by Atanu Bhattacharjee and published by CRC Press. This book was released on 2024-07-04 with total page 204 pages. Available in PDF, EPUB and Kindle. Book excerpt: Delve into the realm of statistical methodology for mediation analysis with a Bayesian perspective in high dimensional data through this comprehensive guide. Focused on various forms of time-to-event data methodologies, this book helps readers master the application of Bayesian mediation analysis using R. Across ten chapters, this book explores concepts of mediation analysis, survival analysis, accelerated failure time modeling, longitudinal data analysis, and competing risk modeling. Each chapter progressively unravels intricate topics, from the foundations of Bayesian approaches to advanced techniques like variable selection, bivariate survival models, and Dirichlet process priors. With practical examples and step-by-step guidance, this book empowers readers to navigate the intricate landscape of high-dimensional data analysis, fostering a deep understanding of its applications and significance in diverse fields.

E-Technologies: Embracing the Internet of Things

Author : Esma Aïmeur
Publisher : Springer
ISBN 13 : 3319590413
Total Pages : 325 pages
Book Rating : 4.3/5 (195 download)

DOWNLOAD NOW!

Book Synopsis E-Technologies: Embracing the Internet of Things by : Esma Aïmeur

Download or read book E-Technologies: Embracing the Internet of Things written by Esma Aïmeur and published by Springer. This book was released on 2017-05-10 with total page 325 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book constitutes the refereed proceedings of the 7th International Conference on E-Technologies, MCETECH 2017, held in Ottawa, ON, Canada, in May 2017. This year’s conference drew special attention to the ever-increasing role of the Internet of Things (IoT); and the contributions span a variety of application domains such as e-Commerce, e-Health, e-Learning, and e-Justice, comprising research from models and architectures, methodology proposals, prototype implementations, and empirical validation of theoretical models. The 19 papers presented were carefully reviewed and selected from 48 submissions. They were organized in topical sections named: pervasive computing and smart applications; security, privacy and trust; process modeling and adaptation; data analytics and machine learning; and e-health and e-commerce.

Principles and Methods for Data Science

Author :
Publisher : Elsevier
ISBN 13 : 0444642129
Total Pages : 498 pages
Book Rating : 4.4/5 (446 download)

DOWNLOAD NOW!

Book Synopsis Principles and Methods for Data Science by :

Download or read book Principles and Methods for Data Science written by and published by Elsevier. This book was released on 2020-05-28 with total page 498 pages. Available in PDF, EPUB and Kindle. Book excerpt: Principles and Methods for Data Science, Volume 43 in the Handbook of Statistics series, highlights new advances in the field, with this updated volume presenting interesting and timely topics, including Competing risks, aims and methods, Data analysis and mining of microbial community dynamics, Support Vector Machines, a robust prediction method with applications in bioinformatics, Bayesian Model Selection for Data with High Dimension, High dimensional statistical inference: theoretical development to data analytics, Big data challenges in genomics, Analysis of microarray gene expression data using information theory and stochastic algorithm, Hybrid Models, Markov Chain Monte Carlo Methods: Theory and Practice, and more. - Provides the authority and expertise of leading contributors from an international board of authors - Presents the latest release in the Handbook of Statistics series - Updated release includes the latest information on Principles and Methods for Data Science

Statistics for High-Dimensional Data

Author : Peter Bühlmann
Publisher : Springer Science & Business Media
ISBN 13 : 364220192X
Total Pages : 568 pages
Book Rating : 4.6/5 (422 download)

DOWNLOAD NOW!

Book Synopsis Statistics for High-Dimensional Data by : Peter Bühlmann

Download or read book Statistics for High-Dimensional Data written by Peter Bühlmann and published by Springer Science & Business Media. This book was released on 2011-06-08 with total page 568 pages. Available in PDF, EPUB and Kindle. Book excerpt: Modern statistics deals with large and complex data sets, and consequently with models containing a large number of parameters. This book presents a detailed account of recently developed approaches, including the Lasso and versions of it for various models, boosting methods, undirected graphical modeling, and procedures controlling false positive selections. A special characteristic of the book is that it contains comprehensive mathematical theory on high-dimensional statistics combined with methodology, algorithms and illustrations with real data examples. This in-depth approach highlights the methods’ great potential and practical applicability in a variety of settings. As such, it is a valuable resource for researchers, graduate students and experts in statistics, applied mathematics and computer science.

Applications

Author : Katharina Morik
Publisher : Walter de Gruyter GmbH & Co KG
ISBN 13 : 3110785986
Total Pages : 478 pages
Book Rating : 4.1/5 (17 download)

DOWNLOAD NOW!

Book Synopsis Applications by : Katharina Morik

Download or read book Applications written by Katharina Morik and published by Walter de Gruyter GmbH & Co KG. This book was released on 2022-12-31 with total page 478 pages. Available in PDF, EPUB and Kindle. Book excerpt: Machine learning is part of Artificial Intelligence since its beginning. Certainly, not learning would only allow the perfect being to show intelligent behavior. All others, be it humans or machines, need to learn in order to enhance their capabilities. In the eighties of the last century, learning from examples and modeling human learning strategies have been investigated in concert. The formal statistical basis of many learning methods has been put forward later on and is still an integral part of machine learning. Neural networks have always been in the toolbox of methods. Integrating all the pre-processing, exploitation of kernel functions, and transformation steps of a machine learning process into the architecture of a deep neural network increased the performance of this model type considerably. Modern machine learning is challenged on the one hand by the amount of data and on the other hand by the demand of real-time inference. This leads to an interest in computing architectures and modern processors. For a long time, the machine learning research could take the von-Neumann architecture for granted. All algorithms were designed for the classical CPU. Issues of implementation on a particular architecture have been ignored. This is no longer possible. The time for independently investigating machine learning and computational architecture is over. Computing architecture has experienced a similarly rampant development from mainframe or personal computers in the last century to now very large compute clusters on the one hand and ubiquitous computing of embedded systems in the Internet of Things on the other hand. Cyber-physical systems’ sensors produce a huge amount of streaming data which need to be stored and analyzed. Their actuators need to react in real-time. This clearly establishes a close connection with machine learning. Cyber-physical systems and systems in the Internet of Things consist of diverse components, heterogeneous both in hard- and software. Modern multi-core systems, graphic processors, memory technologies and hardware-software codesign offer opportunities for better implementations of machine learning models. Machine learning and embedded systems together now form a field of research which tackles leading edge problems in machine learning, algorithm engineering, and embedded systems. Machine learning today needs to make the resource demands of learning and inference meet the resource constraints of used computer architecture and platforms. A large variety of algorithms for the same learning method and, moreover, diverse implementations of an algorithm for particular computing architectures optimize learning with respect to resource efficiency while keeping some guarantees of accuracy. The trade-off between a decreased energy consumption and an increased error rate, to just give an example, needs to be theoretically shown for training a model and the model inference. Pruning and quantization are ways of reducing the resource requirements by either compressing or approximating the model. In addition to memory and energy consumption, timeliness is an important issue, since many embedded systems are integrated into large products that interact with the physical world. If the results are delivered too late, they may have become useless. As a result, real-time guarantees are needed for such systems. To efficiently utilize the available resources, e.g., processing power, memory, and accelerators, with respect to response time, energy consumption, and power dissipation, different scheduling algorithms and resource management strategies need to be developed. This book series addresses machine learning under resource constraints as well as the application of the described methods in various domains of science and engineering. Turning big data into smart data requires many steps of data analysis: methods for extracting and selecting features, filtering and cleaning the data, joining heterogeneous sources, aggregating the data, and learning predictions need to scale up. The algorithms are challenged on the one hand by high-throughput data, gigantic data sets like in astrophysics, on the other hand by high dimensions like in genetic data. Resource constraints are given by the relation between the demands for processing the data and the capacity of the computing machinery. The resources are runtime, memory, communication, and energy. Novel machine learning algorithms are optimized with regard to minimal resource consumption. Moreover, learned predictions are applied to program executions in order to save resources. The three books will have the following subtopics: Volume 1: Machine Learning under Resource Constraints - Fundamentals Volume 2: Machine Learning and Physics under Resource Constraints - Discovery Volume 3: Machine Learning under Resource Constraints - Applications Volume 3 describes how the resource-aware machine learning methods and techniques are used to successfully solve real-world problems. The book provides numerous specific application examples. In the areas of health and medicine, it is demonstrated how machine learning can improve risk modelling, diagnosis, and treatment selection for diseases. Machine learning supported quality control during the manufacturing process in a factory allows to reduce material and energy cost and save testing times is shown by the diverse real-time applications in electronics and steel production as well as milling. Additional application examples show, how machine-learning can make traffic, logistics and smart cities more efficient and sustainable. Finally, mobile communications can benefit substantially from machine learning, for example by uncovering hidden characteristics of the wireless channel.

Statistical Diagnostics for Cancer

Author : Matthias Dehmer
Publisher : John Wiley & Sons
ISBN 13 : 3527665455
Total Pages : 301 pages
Book Rating : 4.5/5 (276 download)

DOWNLOAD NOW!

Book Synopsis Statistical Diagnostics for Cancer by : Matthias Dehmer

Download or read book Statistical Diagnostics for Cancer written by Matthias Dehmer and published by John Wiley & Sons. This book was released on 2012-11-28 with total page 301 pages. Available in PDF, EPUB and Kindle. Book excerpt: This ready reference discusses different methods for statistically analyzing and validating data created with high-throughput methods. As opposed to other titles, this book focusses on systems approaches, meaning that no single gene or protein forms the basis of the analysis but rather a more or less complex biological network. From a methodological point of view, the well balanced contributions describe a variety of modern supervised and unsupervised statistical methods applied to various large-scale datasets from genomics and genetics experiments. Furthermore, since the availability of sufficient computer power in recent years has shifted attention from parametric to nonparametric methods, the methods presented here make use of such computer-intensive approaches as Bootstrap, Markov Chain Monte Carlo or general resampling methods. Finally, due to the large amount of information available in public databases, a chapter on Bayesian methods is included, which also provides a systematic means to integrate this information. A welcome guide for mathematicians and the medical and basic research communities.

Statistical Analysis for High-Dimensional Data

Author : Arnoldo Frigessi
Publisher : Springer
ISBN 13 : 3319270990
Total Pages : 313 pages
Book Rating : 4.3/5 (192 download)

DOWNLOAD NOW!

Book Synopsis Statistical Analysis for High-Dimensional Data by : Arnoldo Frigessi

Download or read book Statistical Analysis for High-Dimensional Data written by Arnoldo Frigessi and published by Springer. This book was released on 2016-02-16 with total page 313 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book features research contributions from The Abel Symposium on Statistical Analysis for High Dimensional Data, held in Nyvågar, Lofoten, Norway, in May 2014. The focus of the symposium was on statistical and machine learning methodologies specifically developed for inference in “big data” situations, with particular reference to genomic applications. The contributors, who are among the most prominent researchers on the theory of statistics for high dimensional inference, present new theories and methods, as well as challenging applications and computational solutions. Specific themes include, among others, variable selection and screening, penalised regression, sparsity, thresholding, low dimensional structures, computational challenges, non-convex situations, learning graphical models, sparse covariance and precision matrices, semi- and non-parametric formulations, multiple testing, classification, factor models, clustering, and preselection. Highlighting cutting-edge research and casting light on future research directions, the contributions will benefit graduate students and researchers in computational biology, statistics and the machine learning community.

Model-Based Clustering and Classification for Data Science

Author : Charles Bouveyron
Publisher : Cambridge University Press
ISBN 13 : 1108640591
Total Pages : 447 pages
Book Rating : 4.1/5 (86 download)

DOWNLOAD NOW!

Book Synopsis Model-Based Clustering and Classification for Data Science by : Charles Bouveyron

Download or read book Model-Based Clustering and Classification for Data Science written by Charles Bouveyron and published by Cambridge University Press. This book was released on 2019-07-25 with total page 447 pages. Available in PDF, EPUB and Kindle. Book excerpt: Cluster analysis finds groups in data automatically. Most methods have been heuristic and leave open such central questions as: how many clusters are there? Which method should I use? How should I handle outliers? Classification assigns new observations to groups given previously classified observations, and also has open questions about parameter tuning, robustness and uncertainty assessment. This book frames cluster analysis and classification in terms of statistical models, thus yielding principled estimation, testing and prediction methods, and sound answers to the central questions. It builds the basic ideas in an accessible but rigorous way, with extensive data examples and R code; describes modern approaches to high-dimensional data and networks; and explains such recent advances as Bayesian regularization, non-Gaussian model-based clustering, cluster merging, variable selection, semi-supervised and robust classification, clustering of functional data, text and images, and co-clustering. Written for advanced undergraduates in data science, as well as researchers and practitioners, it assumes basic knowledge of multivariate calculus, linear algebra, probability and statistics.