1673-159X

CN 51-1686/N

基于深度学习的空管语音识别

Deep Learning-based Speech Recognition for Air Traffic Control

  • 摘要: 文章在分析空中交通管制业务的基础上,研究一种适用于我国民航管制通话的端到端语音识别算法。文章设计了基于卷积神经网络和循环神经网络的深度学习模型,以CTC作为损失函数使用已标注数据进行迭代训练,从而优化模型参数。以空中交通管理中的管制通话语音作为模型的输入,最终输出中文汉字和空管专有名词。使用真实采集的管制通话语音数据进行实验,在10 h的训练数据上词错误率为9.49%。实验结果表明,与传统的语音识别算法比较,该算法有更优异的识别效果。

     

    Abstract: Based on the analysis of air traffic control (ATC), in this paper, an end-to-end automatic speech recognition model is studied to address the speech recognition task for ATC in China. A deep learning model based on convolutional neural network and recurrent neural network is designed and implemented to translate the speech into text. Based on the CTC loss function, the proposed model is iteratively optimized with real samples that were collected from civil airports by voice recorder. The input and output of the proposed model are the ATC speech and the Chinese character with dedicated ATC terms respectively. Experiments are conducted to determine an optimal architecture and confirm the performance over existing approaches, achieving 9.49%-character error rate on a 10-hour training dataset. The result shows that the proposed model obtains a desired performance of the speech recognition.

     

/

返回文章
返回