時至今日為止,聲學回聲消除 (Acoustic Echo Cancellation, AEC) 都是一個在語音和信號處理中常見的問題。應用的場景如電話會議,免持聽筒和移動通信。在過去我們用可適性濾波器來處理聲學回聲消除的問題,而今日我們可以用深度學習的方式來解決聲學回聲消除中複雜的問題。 本篇論文提出的方法則是把聲學回聲消除視為語音分離的問題,取代傳統的可適性濾波器估測聲學回聲。並利用深度學習中的遞迴神經網路 (Recurrent Neural Network, RNN) 架構去訓練模型。由於遞迴神經網絡模擬時變函數的能力良好,所以可以在解決聲學回聲消除問題中發揮作用。我們訓練具有記憶的雙向的長短期記憶網路 (Long Short Term Memory Network, LSTM) 及雙向的門控遞迴單元 (Gated Recurrent Unit, GRU) 的遞迴神經網絡。從單講語音以及雙講語音中提取特徵,並透過調整權重來控制特徵之間的大小比例,來估計理想比例掩蔽(Ideal Ratio Mask, IRM)。利用這種方式來分離信號,從而達到去除回聲的目的。實驗結果表明該方法消除回聲的效果良好。;Acoustic echo cancellation is a common problem in speech and signal processing until now. Application scenarios such as telephone conference, hands-free handsets and mobile communications. In the past we used adaptive filters to deal with acoustic echo cancellation, and today we can use deep learning to solve complex problems in acoustic echo cancellation. The method proposed in this work is to consider acoustic echo cancellation as a problem of speech separation, instead of the traditional adaptive filter to estimate acoustic echo. And use the recurrent neural network architecture in deep learning to train the model. Since the recurrent neural network has a good ability to simulate time-varying functions, it can play a role in solving the problem of acoustic echo cancellation. We train a bidirectional long short-term memory network and a bidirectional gated recurrent unit. Features are extracted from single-talk speech and double-talk speech. Adjust weights to control the ratio between double-talk speech and single-talk speech, and estimate the ideal ratio mask. This way to separate the signal, in order to achieve the purpose of removing the echo. The experimental results show that the method has good effect in echo cancellation.