![]() |
Nicolò Cesa-Bianchi and Gábor Lugosi
Cambridge University Press, 2006 ISBN 0521841089 Published March 2006, 406 pages ...beware of mathematicians, and all those who make empty prophecies. The danger already exists that the mathematicians have made a covenant with the devil to darken the spirit and to confine man in the bonds of Hell. St. Augustine, De Genesi ad Litteram libri duodecim. Liber Secundus, 17, 37. Errata (as of September 8, 2006) A review (in French) appeared on the Gazette of Mathématiciens, 110, October 2006. |
This book offers the first comprehensive treatment of the problem
of predicting "individual sequences". Unlike standard statistical
approaches to forecasting, prediction of individual sequences does
not impose any probabilistic assumption on the data-generating
mechanism. Yet, prediction algorithms can be constructed that work
well for all possible sequences, in the sense that their performance
is always nearly as good as the best forecasting strategy in a given
reference class.
The central theme is the model of "prediction using expert advice",
a general framework within which many related problems can be cast
and discussed. Repeated game playing, adaptive data compression,
sequential investment in the stock market, sequential pattern analysis,
and several other problems are viewed as instances of the experts'
framework and analyzed from a common nonstochastic standpoint that
often reveals new and intriguing connections. Old and new forecasting
methods are described in a mathematically precise way with the purpose
of characterizing their theoretical limitations and possibilities.
Preface
1 Introduction
2 Prediction with expert advice
2.1 Weighted average prediction
2.2 An optimal bound
2.3 Bounds that hold uniformly over time
2.4 An improvement for small losses
2.5 Forecasters using the gradient of the loss
2.6 Scaled losses and signed games
2.7 The multilinear forecaster
2.8 The exponential forecaster for signed games
2.9 Simulatable experts
2.10 Minimax regret
2.11 Discounted regret
2.12 Bibliographic remarks
2.13 Exercises
3 Tight bounds for specific losses
3.1 Introduction
3.2 Follow the best expert
3.3 Exp-concave loss functions
3.4 The greedy forecaster
3.5 The aggregating forecaster
3.6 Mixability for certain losses
3.7 General lower bounds
3.8 Bibliographic remarks
3.9 Exercises
4 Randomized prediction
4.1 Introduction
4.2 Weighted average forecasters
4.3 Follow the perturbed leader
4.4 Internal regret
4.5 Calibration
4.6 Generalized regret
4.7 Calibration with checking rules
4.8 Bibliographic remarks
4.9 Exercises
5 Efficient forecasters for large classes of experts
5.1 Introduction
5.2 Tracking the best expert
5.3 Tree experts
5.4 The shortest path problem
5.5 Tracking the best of many actions
5.6 Bibliographic remarks
5.7 Exercises
6 Prediction with limited feedback
6.1 Introduction
6.2 Label efficient prediction
6.3 Lower bounds
6.4 Partial monitoring
6.5 A general forecaster for partial monitoring
6.6 Hannan consistency and partial monitoring
6.7 Multi-armed bandit problems
6.8 An improved bandit strategy
6.9 Lower bounds for the bandit problem
6.10 How to select the best action
6.11 Bibliographic remarks
6.12 Exercises
7 Prediction and playing games
7.1 Games and equilibria
7.2 Minimax theorems
7.3 Repeated two-player zero-sum games
7.4 Correlated equilibrium and internal regret
7.5 Unknown games: game-theoretic bandits
7.6 Calibration and correlated equilibrium
7.7 Blackwell's approachability theorem
7.8 Potential-based approachability
7.9 Convergence to Nash equilibria
7.10 Convergence in unknown games
7.11 Playing against opponents that react
7.12 Bibliographic remarks
7.13 Exercises
8 Absolute loss
8.1 Simulatable experts
8.2 Optimal algorithm for simulatable experts
8.3 Static experts
8.4 A simple example
8.5 Bounds for classes of static experts
8.6 Bounds for general classes
8.7 Bibliographic remarks
8.8 Exercises
9 Logarithmic loss
9.1 Sequential probability assignment
9.2 Mixture forecasters
9.3 Gambling and data compression
9.4 The minimax optimal forecaster
9.5 Examples
9.6 The Laplace mixture
9.7 A refined mixture forecaster
9.8 Lower bounds for most sequences
9.9 Prediction with side information
9.10 A general upper bound
9.11 Further examples
9.12 Bibliographic remarks
9.13 Exercises
10 Sequential investment
10.1 Portfolio selection
10.2 The minimax wealth ratio
10.3 Prediction and investment
10.4 Universal portfolios
10.5 The EG investment strategy
10.6 Investment with side information
10.7 Bibliographic remarks
10.8 Exercises
11 Linear pattern recognition
11.1 Prediction with side information
11.2 Bregman divergences
11.3 Potential-based gradient descent
11.4 The transfer function
11.5 Forecasters using Bregman projections
11.6 Time-varying potentials
11.7 The elliptic potential
11.8 A nonlinear forecaster
11.9 Lower bounds
11.10 Mixture forecasters
11.11 Bibliographic remarks
11.12 Exercises
12 Linear classification
12.1 The zero-one loss
12.2 The hinge loss
12.3 Maximum margin classifiers
12.4 Label efficient classifiers
12.5 Kernel-based classifiers
12.6 Bibliographic remarks
12.7 Exercises
13 Appendix
13.1 Inequalities from probability theory
13.1.1 Hoeffding's inequality
13.1.2 Bernstein's inequality
13.1.3 Hoeffding-Azuma inequality and related results
13.1.4 Khinchine's inequality
13.1.5 Slud's inequality
13.1.6 A simple limit theorem
13.1.7 Proof of Theorem 8.3
13.1.8 Rademacher averages
13.1.9 The Beta distribution
13.2 Basic information theory
13.3 Basics of classification
Bibliography
Author index