Statistics And Data Analysis
In this course, we study some basic probility and sampling theories which are commonly used in machine learning.
This GitBook notes are maintained by zealscott.
Syllabus
| Lecture | Key | Notes | Reading Material |
|---|---|---|---|
| 1. Statistics review | - Random Variable - pdf/cdf/pmf - Joint distribution - Expectation/covariance - Conditional expectation |
Probability theory Review | - Chernoff bound |
| 2. common distributions | - Discrete - Continuous |
Useful distribution Conjugate Prior Normal - Gamma Conjugate |
- expoential family |
| 3. sample and limit theory | - sampling mean and variance - order statistics distribution - weak and strong law of large numbers - central limit theorem |
sample and limit theory | |
| 4. T and F distribution | - distribution - distribution - distribution |
T and F distribution | |
| 5. Sampling | - Sample Random sampling - Confident interval - MCMC - Gibbs Sampling |
Survey Sampling | - 拒绝采样 - 别名采样 - MCMC |
| 6. Estimation of Parameters&Fitting of Probability Distribution | - MLE(parameter) - MLE for exponential family - Conjugate family (Bayes) - EM algorithm |
LDA思考与总结 | - 文本主题模型之LDA(一) LDA基础 |
| 7. Testing Hypotheses and assessing Goodness of fit | - Neyman-Perason lemma - Likelihood Ratio Tests |
||
| 8. Testing and Summarizing | Testing and Summarizing) | ||
| 9. The analysis of Variance | The Analysis Of Variance | ||
| 10. notes for exam | Notes For Exam |