Contents
Contents
Mental model
Abstraction
The practice of hiding complexity behind a simpler interface, enabling reasoning
I send a newsletter every week — free, no spam, unsubscribe anytime.
— Auer, Cesa-Bianchi & Fischer, Finite-time Analysis of the Multiarmed Bandit Problem (2002)"UCB1 selects the arm that maximizes the sum of the empirical mean and an upper confidence bound." The bound is chosen so that the true mean lies below it with high probability; the policy is optimistic in the face of uncertainty.
Upper Confidence Bound applied the Explore-exploit Tradeoff mental model
Upper Confidence Bound applied the Algorithms mental model
Upper Confidence Bound applied the The Gittens Index mental model
Upper Confidence Bound applied the Uncertainty mental model
Upper Confidence Bound applied the Upper Confidence Bound mental model
Upper Confidence Bound applied the Signal vs Noise mental model