Empirical Optimal Transport between Different Measures Adapts to Lower Complexity

2022-02-21 | preprint. A publication with affiliation to the University of Göttingen.

Jump to:Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Empirical Optimal Transport between Different Measures Adapts to Lower Complexity​
Hundrieser, S.; Staudt, T.& Munk, A. ​ (2022)

Documents & Media

License

GRO License GRO License

Details

Authors
Hundrieser, Shayan; Staudt, Thomas; Munk, Axel 
Abstract
The empirical optimal transport (OT) cost between two probability measures from random data is a fundamental quantity in transport based data analysis. In this work, we derive novel guarantees for its convergence rate when the involved measures are different, possibly supported on different spaces. Our central observation is that the statistical performance of the empirical OT cost is determined by the less complex measure, a phenomenon we refer to as lower complexity adaptation of empirical OT. For instance, under Lipschitz ground costs, we find that the empirical OT cost based on $ observations converges at least with rate ^{-1/d}$ to the population quantity if one of the two measures is concentrated on a ehBdimensional manifold, while the other can be arbitrary. For semi-concave ground costs, we show that the upper bound for the rate improves to ^{-2/d}$. Similarly, our theory establishes the general convergence rate ^{-1/2}$ for semi-discrete OT. All of these results are valid in the two-sample case as well, meaning that the convergence rate is still governed by the simpler of the two measures. On a conceptual level, our findings therefore suggest that the curse of dimensionality only affects the estimation of the OT cost when both measures exhibit a high intrinsic dimension. Our proofs are based on the dual formulation of OT as a maximization over a suitable function class $\mathcal{F}_c$ and the observation that the ehBtransform of $\mathcal{F}_c$ under bounded costs has the same uniform metric entropy as $\mathcal{F}_c$ itself.
Issue Date
21-February-2022
Project
EXC 2067: Multiscale Bioimaging 
SFB 1456: Mathematik des Experiments: Die Herausforderung indirekter Messungen in den Naturwissenschaften 
SFB 1456 | Cluster A | A04: Dynamics of cytoskeletal networks: From geometric structure to cell mechanics 
SFB 1456 | Cluster C | C06: Optimal transport based colocalization 
Organization
Campus-Institut Data Science 
Working Group
RG Munk 
External URL
https://mbexc.uni-goettingen.de/literature/publications/452

Reference

Citations