更新时间:2021-07-02 23:44:57
封面
版权信息
Credits
About the Author
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
Data Science Using Java
Data science
Machine learning
Supervised learning
Unsupervised learning
Clustering
Dimensionality reduction
Natural Language Processing
Data science process models
CRISP-DM
A running example
Data science in Java
Data science libraries
Data processing libraries
Math and stats libraries
Machine learning and data mining libraries
Text processing
Summary
Data Processing Toolbox
Standard Java library
Collections
Input/Output
Reading input data
Writing ouput data
Streaming API
Extensions to the standard library
Apache Commons
Commons Lang
Commons IO
Commons Collections
Other commons modules
Google Guava
AOL Cyclops React
Accessing data
Text data and CSV
Web and HTML
JSON
Databases
DataFrames
Search engine - preparing data
Exploratory Data Analysis
Exploratory data analysis in Java
Search engine datasets
Apache Commons Math
Joinery
Interactive Exploratory Data Analysis in Java
JVM languages
Interactive Java
Joinery shell
Supervised Learning - Classification and Regression
Classification
Binary classification models
Smile
JSAT
LIBSVM and LIBLINEAR
Encog
Evaluation
Accuracy
Precision recall and F1
ROC and AU ROC (AUC)
Result validation
K-fold cross-validation
Training validation and testing
Case study - page prediction
Regression
Machine learning libraries for regression
Other libraries
MSE
MAE
Case study - hardware performance
Unsupervised Learning - Clustering and Dimensionality Reduction
Unsupervised dimensionality reduction
Principal Component Analysis
Truncated SVD