Quick Adaptive Ternary Segmentation: An Efficient Decoding Procedure For Hidden Markov Models

2023-05-29 | preprint. A publication with affiliation to the University of Göttingen.

Jump to: Cite & Linked | Documents & Media | Details | Version history

Cite this publication

​Quick Adaptive Ternary Segmentation: An Efficient Decoding Procedure For Hidden Markov Models​
Mösching, A.; Li, H.  & Munk, A. ​ (2023). DOI: https://doi.org/10.48550/arxiv.2305.18578 

Documents & Media

License

GRO License GRO License

Details

Authors
Mösching, Alexandre; Li, Housen ; Munk, Axel 
Abstract
Hidden Markov models (HMMs) are characterized by an unobservable (hidden) Markov chain and an observable process, which is a noisy version of the hidden chain. Decoding the original signal (i.e., hidden chain) from the noisy observations is one of the main goals in nearly all HMM based data analyses. Existing decoding algorithms such as the Viterbi algorithm have computational complexity at best linear in the length of the observed sequence, and sub-quadratic in the size of the state space of the Markov chain. We present Quick Adaptive Ternary Segmentation (QATS), a divide-and-conquer procedure which decodes the hidden sequence in polylogarithmic computational complexity in the length of the sequence, and cubic in the size of the state space, hence particularly suited for large scale HMMs with relatively few states. The procedure also suggests an effective way of data storage as specific cumulative sums. In essence, the estimated sequence of states sequentially maximizes local likelihood scores among all local paths with at most three segments. The maximization is performed only approximately using an adaptive search procedure. The resulting sequence is admissible in the sense that all transitions occur with positive probability. To complement formal results justifying our approach, we present Monte-Carlo simulations which demonstrate the speedups provided by QATS in comparison to Viterbi, along with a precision analysis of the returned sequences. An implementation of QATS in C++ is provided in the R-package QATS and is available from GitHub.
Issue Date
29-May-2023
Project
SFB 1456: Mathematik des Experiments: Die Herausforderung indirekter Messungen in den Naturwissenschaften 
SFB 1456 | Cluster B: Data with Incomplete Information 
SFB 1456 | Cluster B | B04: Collective dynamics of ion channels: statistical modeling and analysis 
EXC 2067: Multiscale Bioimaging 
Organization
Institut für Mathematische Stochastik 
Working Group
RG Munk 
Extent
37
Language
English

Reference

Citations


Social Media