更新时间:2021-07-02 18:22:09
封面
书名页
Java Data Analysis
Credits
About the Author
About the Reviewers
www.PacktPub.com
eBooks discount offers and more
Customer Feedback
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Chapter 1. Introduction to Data Analysis
Origins of data analysis
The scientific method
Actuarial science
Calculated by steam
A spectacular example
Herman Hollerith
ENIAC
VisiCalc
Data information and knowledge
Why Java?
Java Integrated Development Environments
Summary
Chapter 2. Data Preprocessing
Data types
Variables
Data points and datasets
Relational database tables
Hash tables
File formats
Generating test datasets
Chapter 3. Data Visualization
Tables and graphs
Time series
Java implementation
Moving average
Data ranking
Frequency distributions
The normal distribution
The exponential distribution
Java example
Chapter 4. Statistics
Descriptive statistics
Random sampling
Random variables
Probability distributions
Cumulative distributions
The binomial distribution
Multivariate distributions
Conditional probability
The independence of probabilistic events
Contingency tables
Bayes' theorem
Covariance and correlation
The standard normal distribution
The central limit theorem
Confidence intervals
Hypothesis testing
Chapter 5. Relational Databases
The relation data model
Relational databases
Foreign keys
Relational database design
Chapter 6. Regression Analysis
Linear regression
Polynomial regression
Chapter 7. Classification Analysis
Decision trees
Bayesian classifiers
Logistic regression
Chapter 8. Cluster Analysis
Measuring distances
The curse of dimensionality
Hierarchical clustering
Chapter 9. Recommender Systems
Utility matrices
Similarity measures
Cosine similarity
A simple recommender system
Amazon's item-to-item collaborative filtering recommender
Implementing user ratings
Large sparse matrices
Using random access files
The Netflix prize
Chapter 10. NoSQL Databases
The Map data structure