To truly understand machine learning, a strong grasp of probability theory and statistics is essential because machine learning is an elegant combination of statistics and algorithms. In Section I: Linear Algebra, we intentionally avoided applications related to statistics, focusing instead on foundational concepts. Now, it's time to build upon the probabilistic basis of machine learning. This section introduces the essential concepts of probability, providing the tools and insights necessary to understand and apply machine learning techniques. At its core, statistics involves inferring unknown parameters from outcomes. This process is the inverse of probability theory. Two main approaches dominate statistical inference: frequentist statistics, which treats parameters as fixed and data as random, and in contrast, Bayesian statistics, which treats data as fixed and parameters as random. In particular, Bayesian statistics forms the foundation of many machine learning algorithms.