A PDE Approach to the Problems of Optimality of Expectations

Main Article Content

Mahir Hasanov

Abstract

Let (X, Z) be a bivariate random vector. A predictor of X based on Z is just a Borel function g(Z). The problem of "least squares prediction" of X given the observation Z is to find the global minimum point of the functional E[(X − g(Z))2] with respect to all random variables g(Z), where g is a Borel function. It is well known that the solution of this problem is the conditional expectation E(X|Z). We also know that, if for a nonnegative smooth function F: R×R → R, arg ming(Z)E[F(X, g(Z))] = E[X|Z], for all X and Z, then F(x, y) is a Bregmann loss function. It is also of interest, for a fixed ϕ to find F (x, y), satisfying, arg ming(Z)E[F(X, g(Z))] = ϕ(E[X|Z]), for all X and Z. In more general setting, a stronger problem is to find F (x, y) satisfying arg miny∈RE[F (X, y)] = ϕ(E[X]), ∀X. We study this problem and develop a partial differential equation (PDE) approach to solution of these problems.

Article Details

References

  1. K.B. Athreya, S.N. Lahiri, Measure Theory and Probability Theory, Springer Texts in Statistics, Springer, New York, 2006.
  2. A. Banerjee, X. Guo, H. Wang, On the Optimality of Conditional Expectation as a Bregman Predictor, IEEE Trans. Inform. Theory. 51 (2005), 2664–2669. https://doi.org/10.1109/tit.2005.850145.
  3. H.H. Bauschke, M.S. Macklem, J.B. Sewell, X. Wang, Klee Sets and Chebyshev Centers for the Right Bregman Distance, J. Approx. Theory. 162 (2010), 1225–1244. https://doi.org/10.1016/j.jat.2010.01.001.
  4. A. Ben-Tal, A. Charnes, M. Teboulle, Entropic Means, J. Math. Anal. Appl. 139 (1989), 537–551. https://doi.org/10.1016/0022-247x(89)90128-5.
  5. Y. Censor, S. Zenios, Parallel Optimization: Theory, Algorithms, and Applications, Oxford University Press, London, 1998.
  6. I. Csiszar, Why Least Squares and Maximum Entropy? An Axiomatic Approach to Inference for Linear Inverse Problems, Ann. Stat. 19 (1991), 2032–2066. https://doi.org/10.1214/aos/1176348385.
  7. I. Csiszar, Generalized Projections for Non-Negative Functions, in: Proceedings of 1995 IEEE International Symposium on Information Theory, IEEE, Whistler, BC, Canada, 1995: p. 6. https://doi.org/10.1109/ISIT.1995.531108.
  8. F. Deutsch, Best Approximation in Inner Product Spaces, Springer-Verlag, New York, 2021.
  9. G. Grimmett, D. Stirzaker, Probability and Random Processes, Oxford University Press, Oxford, 2004.
  10. S. Karlin, H.M. Taylor, A Second Course in Stochastic Processes, 2nd ed. Academic Press, San Diego, 1991.
  11. M. Hasanov, The Spectra of Two-Parameter Quadratic Operator Pencils, Math. Computer Model. 54 (2011), 742–755. https://doi.org/10.1016/j.mcm.2011.03.018.
  12. D. Reem, S. Reich, A. De Pierro, Re-Examination of Bregman Functions and New Properties of Their Divergences, Optimization. 68 (2018), 279–348. https://doi.org/10.1080/02331934.2018.1543295.
  13. D. Williams, Probability with Martingales, Cambridge Mathematical Textbooks, Cambridge University Press, Cambridge, 2001.