中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/72059
English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 42734507      線上人數 : 1338
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/72059


    題名: 語音合成及語者轉換之應用與設計;Application and Design of Speech Synthesis and Speaker Conversion
    作者: 張敦鈞;CHANG,TUN-CHUN
    貢獻者: 資訊工程學系在職專班
    關鍵詞: 語音合成;語者轉換;Speech Synthesis;Speaker Conversion
    日期: 2016-08-25
    上傳時間: 2016-10-13 14:24:00 (UTC+8)
    出版者: 國立中央大學
    摘要: 本論文結合語音合成及語者轉換的技術做相關的應用與設計,語音合成是真人聲音經合成引擎轉成機器音,語者轉換是以原來語者的聲音為基礎,轉換為另一語者的聲型發聲。要使這兩個技術能應用於生活及娛樂上,需要設計系統來供實作,本系統的設計為,輸入文字,經語音合成產生出來源語者的聲模,再加上文脈相依資料,經合成軟體,擷取出來源語者的頻譜特徵參數,再將目的語者語音擷取出頻譜特徵參數。兩者的頻譜特徵參數,經DTW比對,產生音框特徵向量匹配表,經LBG演算法形成高斯混合模型,用EM演算法,做高斯混合模型訓練,再經由GMM對應參數的方法,當輸入來源語者的頻譜參數,會轉出輸目的語者的頻譜參數,另外,激發出來源語者的音高特徵參數,再與目的語者的頻譜特徵參數,經合成濾波器形成目的語者的合成音。本論文提出語音合成及語者轉換之多項應用與設計。;The document combines speech synthesis and speaker conversion and these have relevant application and design. Speech synthesis is that the voice of real man is converted machine voice by synthesis engine. Speaker conversion is based on source speaker and it converts another voice of speaker. To let two techniques can be used in life and entertainment, it needs system to provide implement. The design of the system is that spectrum feather parameter of source speaker is extracted by synthetic software Data of text dependence produced by inputting words and voice model of source speaker input it. And the parameter of target speaker is extracted from voice of target speaker. Both of parameter generate the match table of feather vector of frame by DTW comparing, then GMM is formed by LBG algorithm. After that, using EM algorithm is order to train GMM. When finishing train, parameter correspondence method has transform function. When inputting source spectrum, target spectrum can be got. Besides, synthetic voice of target speaker is formed by speaker is formed by putting pitch feather parameter of source speaker excited and spectrum feather parameter of target speaker together through MLSA (Mel Log Spectrum Approximation). This document proposes many applications and designs of Speech Synthesis and Speaker Conversion.
    顯示於類別:[資訊工程學系碩士在職專班 ] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML433檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明