TY - JOUR
T1 - Multivariate functional linear discriminant analysis for partially-observed time series
AU - Bordoloi, Rahul
AU - Réda, Clémence
AU - Trautmann, Orell
AU - Bej, Saptarshi
AU - Wolkenhauer, Olaf
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/3
Y1 - 2025/3
N2 - The more extensive access to time-series data, especially for biomedical purposes, raises new methodological challenges, particularly regarding missing values. Functional linear discriminant analysis (FLDA) extends Linear Discriminant Analysis (LDA)-mediated multiclass classification and dimension reduction to data in the form of fragmented observations of a univariate function. For large multivariate and partially-observed data, there are two challenges: (i) statistical dependencies between different components of a multivariate function and (ii) heterogeneous sampling times with missing features. We here develop a multivariate version of FLDA, called MUDRA, to tackle these challenges and describe a computationally efficient expectation/conditional-maximisation (ECM) algorithm to infer its parameters without any tensor inversions. We assess its predictive power on the “Articulary Words” dataset and show its improvement over the state-of-the-art, especially in the case of missing data. This advancement in dimension reduction of multivariate functional data holds promise for enhancing classification accuracy in scenarios like partially observed short multivariate time series analysis.
AB - The more extensive access to time-series data, especially for biomedical purposes, raises new methodological challenges, particularly regarding missing values. Functional linear discriminant analysis (FLDA) extends Linear Discriminant Analysis (LDA)-mediated multiclass classification and dimension reduction to data in the form of fragmented observations of a univariate function. For large multivariate and partially-observed data, there are two challenges: (i) statistical dependencies between different components of a multivariate function and (ii) heterogeneous sampling times with missing features. We here develop a multivariate version of FLDA, called MUDRA, to tackle these challenges and describe a computationally efficient expectation/conditional-maximisation (ECM) algorithm to infer its parameters without any tensor inversions. We assess its predictive power on the “Articulary Words” dataset and show its improvement over the state-of-the-art, especially in the case of missing data. This advancement in dimension reduction of multivariate functional data holds promise for enhancing classification accuracy in scenarios like partially observed short multivariate time series analysis.
KW - Functional data analysis
KW - Linear discriminant analysis
KW - Missing data
KW - Time-series classification
UR - https://www.scopus.com/pages/publications/85218335324
U2 - 10.1007/s10994-025-06741-0
DO - 10.1007/s10994-025-06741-0
M3 - Article
AN - SCOPUS:85218335324
SN - 0885-6125
VL - 114
JO - Machine Learning
JF - Machine Learning
IS - 3
M1 - 80
ER -