English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 42757139      線上人數 : 2132
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/72260


    題名: 多通道之多重音頻串流方法之研究;Multi-Channel Method For Multiple Pitch streaming
    作者: 官志誼;Kuan,Chih-Yi
    貢獻者: 資訊工程學系
    關鍵詞: 基礎頻率分析;多重音頻串流;粒子群最佳化;pitch detection;Multi pitch streaming;PSO
    日期: 2016-08-25
    上傳時間: 2016-10-13 14:35:18 (UTC+8)
    出版者: 國立中央大學
    摘要: 基礎頻率分析在數位訊號處理中是一項重要課題並可以延伸到許多相關的研究,無論是在音樂或者語音上皆有其中要性,本論文主要討論多個單音音源的音頻串流方法,本論文提出之系統需要三個輸入,分別為音源個數、基頻偵測結果、混合音檔。而整體系統可以分為兩個階段,第一階段為依據基頻偵測結果將每一個音高取得相對應特徵參數,第二階段則將上述所有資料進行的聚類,最後輸出各個音源的音頻串流,簡單來說即是每個時刻每個音源演奏哪些音高的資訊。
    本論文在特徵參數方面我提出了新的多通道方位特徵參數,並與其他音色特徵參數融合成為更加強健的特徵參數,聚類方面我們基於粒子群最佳化演算法提出了兩種不同架構,並融合領域知識於其中來提高準確率。另外本論文特別針對音源音域接近、音頻串流纏繞頻繁的音檔來設計並能有更好的準確率。
    ;Fundamental frequency analysis of multiple sound mixtures is a important information in audio signal processing. To know the information of fundamental frequency can be extended for several applications like in music information retrieval, automatic music transcription, melody extraction, instrument identification. In speech research, like speech separation, speech recognition and prosody analysis. This paper aims at source transcription of polyphonic audio, can be consisting of two stages .Stage one is to detecting each pitches values provide by different sources in every time frame is known as multiple F0 estimation. Stage two is to clustering all the pitch which detected in stage one into a single pitch trajectory originating from the corresponding sources. The main focus on this paper is to do source clustering of the detected pitch in polyphonic audio signal which the pitch provide by different sources playing at the same time.
    Although many works have been proposed to do source transcription, multi pitch streaming, multi F0 source clustering there are still various challenges in this task. In feature extraction, since the different sound sources playing simultaneously, the pitch contour numerous overlap in the mixture audio, is hard to generating the source characterizing feature corresponding to different F0 values, especially in music case. To solve this problem we adopt multi -channel approach to improve the source characterizing feature.
    While source characterizing feature corresponding to each pitches has been extracted. The next step is to clustering all the detected pitches into corresponding sources. Since the supervised approaches require more information of isolated recordings to training models, our approach focus on the unsupervised way. We introduce a new Constrained PSO clustering which can deal with this task more precise.
    This paper introduce a novel scheme for the source transcription of polyphonic sound mixture. Our approach need three inputs: multi-channel mixture sound, multi pith estimation (MPE) values, number of sources. We use the Ground truths multi pitch values and some other multi pitch estimation work provide by Duan et al. as MPE input. Then, use this MPE value to extract both timbre and direction feature, and concatenation two feature with the STD weight. After feature extraction, we use the Constrained PSO clustering to try all the possible of the clustering distribution and to find minimize timbre and direction inconsistency. Finally, we can map the clustering result back to each individual pitches labels and output the every single pitch trajectory from corresponding sources.
    顯示於類別:[資訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML506檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明