Abstract
Classifying diseases in electronic medical records into corresponding ICD codes requires not only a large amount of medical knowledge but also a large number of coders, which is time-consuming and labor-consuming. Therefore, automatic coding is of great significance. This paper aims to build a deep learning model for automatic ICD-10 coding from a batch of Chinese electronic medical records. The data enhancement, convolutional neural network, attention mechanism, and the gating residual network proposed by the author were used to code ICD code corresponding to the distribution of medical record information by supervised learning. The benchmark model and ablation model were tested on a data set of Chinese electronic medical records. The effectiveness of the proposed modules, such as feature aggregation, multi-head attention mechanism, dilated convolution, and gating residuals, was verified. In the automatic ICD coding task for 104 diseases, the accuracy of the proposed method is 91.71%, and the F1-Score is 92.11%.1