Delves into the fundamental limits of gradient-based learning on neural networks, covering topics such as binomial theorem, exponential series, and moment-generating functions.
Covers Likelihood Ratio Tests, their optimality, and extensions in hypothesis testing, including Wilks' Theorem and the relationship with Confidence Intervals.