Abstract
The emergence of single-cell multi-omics sequencing technology has enabled the simultaneous profiling of diverse omics data within individual cells. It offers a more comprehensive perspective on cellular phenotypes and heterogeneity. However, single-cell multi-omics data are inherently high-dimensional and heterogeneous. Due to technical limitations and scarce starting materials, the data are often affected by noise and dropout effects. To address these challenges, we propose a novel multitask driven multi-level dynamical fusion algorithm for single-cell multi-omics cell type annotation, named scMMDyn. Our approach incorporates reconstruction and classification auxiliary tasks to guide the training of trustworthy modules at both the feature and modality levels. It executes dynamical fusion during these stages and finally achieves cross-modality fusion via an attention mechanism. This method effectively mitigates data quality issues through reconstruction tasks and feature-level dynamical fusion while providing interpretability at both feature and modality levels. Experimental results across diverse single-cell multi-omics datasets show that our method surpasses existing approaches in cell type annotation.