中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/82290
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 80990/80990 (100%)
造访人次 : 42701731      在线人数 : 1427
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/82290


    题名: 基於深度學習之360度視訊分析與處理;Deep Learning Based 360-Degree Vi Deo Processing and Analysis
    作者: 唐之瑋
    贡献者: 國立中央大學通訊工程學系
    关键词: 360度視訊;深度學習;CMP投影影像;視訊編碼之位元率控制;視覺追蹤;;360 degree video;deep learning;cubemap projection;rate control of video coding;visual tracking.
    日期: 2020-01-13
    上传时间: 2020-01-13 14:37:15 (UTC+8)
    出版者: 科技部
    摘要: 近年虛擬實境(virtual reality, VR)興起,360度視訊(360 degree video)提供使用者身臨其境的體驗,而由於360度視訊須由球體域投影至二維平面(sphere-to-plane projection),以致不同區域有不均勻的幾何形變(geometric deformation),因此,幾乎現有基於傳統2D影像而設計的視訊分析與處理方案,於此類投影影像皆無法達到最佳效能。雖然現今360度視訊壓縮標準相關研究顯示,cubemap (CMP) 投影影像的編碼rate-distortion (R-D)效能優於equirectangular projection (ERP)影像,但因球體域上均勻取樣的各點,非均勻地投影至CMP的各面(face)的不同區域,其相關編碼方案仍需改進,其中,目前幾乎無任何已發表文獻探討編碼CMP投影影像的位元率控制(rate control)。另一方面,視覺追蹤為許多電腦視覺應用必要的前端處理,但現有的視覺追蹤方案,卻僅極少數基於360度視訊之CMP投影影像而設計。由於近年電腦視覺領域研究顯示深度學習(deep learning)可有效提升其準確率,因此,本一年期計畫之研究目標,為對360度視訊的CMP投影影像處理與分析的兩個主題提出設計: (1)基於深度學習的視訊編碼(video coding)之位元率控制(rate control)。(2) 基於深度學習的視覺追蹤(visual tracking)。本計畫研究成果預期將有效提升編碼CMP投影影像的位元率控制之準確率,與CMP投影影像的視覺追蹤準確率,進而提升360度視訊相關應用之品質。 ;With the rise of virtual reality, 360 degree videos provide users immersive experiences. However, sphere-to-plane projections of 360 degree videos lead to non-uniform geometric deformations in various regions of 2-D projected images. Accordingly, 2-D projections of 360 degree videos degrade performance of existing video analysis and processing schemes that designed for conventional 2-D images. Although the state-of-the-art in 360-degree video coding indicates that the R-D performance of the cubemap (CMP) projection outperforms that of the equirectangular projection (ERP), the uniform samples in the sphere domain are still projected non-uniformly onto different regions of each face of the CMP projection. Thus, the existing CMP projection based encoder can be further improved while there is few rate control scheme designed for CMP encoding. On the other hand, visual tracking is essential to many applications of computer vision. However, there are few trackers designed for the CMP projection based 360 degree videos. Since research results indicate that deep learning can significantly improve performance of applications of computer vision, this project focuses on two topics of the CMP projection of 360 degree video processing and analysis: (1) Deep learning based rate control of video coding. (2) Deep learning based visual tracking. For the CMP projection of 360 degree videos, this project expects to increase accuracy of rate control of video coding and accuracy of visual tracking. Accordingly, quality of the 360 degree video related applications can be significantly improved.
    關聯: 財團法人國家實驗研究院科技政策研究與資訊中心
    显示于类别:[通訊工程學系] 研究計畫

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML305检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明