中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/82290
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 80990/80990 (100%)
Visitors : 42701987      Online Users : 1358
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/82290


    Title: 基於深度學習之360度視訊分析與處理;Deep Learning Based 360-Degree Vi Deo Processing and Analysis
    Authors: 唐之瑋
    Contributors: 國立中央大學通訊工程學系
    Keywords: 360度視訊;深度學習;CMP投影影像;視訊編碼之位元率控制;視覺追蹤;;360 degree video;deep learning;cubemap projection;rate control of video coding;visual tracking.
    Date: 2020-01-13
    Issue Date: 2020-01-13 14:37:15 (UTC+8)
    Publisher: 科技部
    Abstract: 近年虛擬實境(virtual reality, VR)興起,360度視訊(360 degree video)提供使用者身臨其境的體驗,而由於360度視訊須由球體域投影至二維平面(sphere-to-plane projection),以致不同區域有不均勻的幾何形變(geometric deformation),因此,幾乎現有基於傳統2D影像而設計的視訊分析與處理方案,於此類投影影像皆無法達到最佳效能。雖然現今360度視訊壓縮標準相關研究顯示,cubemap (CMP) 投影影像的編碼rate-distortion (R-D)效能優於equirectangular projection (ERP)影像,但因球體域上均勻取樣的各點,非均勻地投影至CMP的各面(face)的不同區域,其相關編碼方案仍需改進,其中,目前幾乎無任何已發表文獻探討編碼CMP投影影像的位元率控制(rate control)。另一方面,視覺追蹤為許多電腦視覺應用必要的前端處理,但現有的視覺追蹤方案,卻僅極少數基於360度視訊之CMP投影影像而設計。由於近年電腦視覺領域研究顯示深度學習(deep learning)可有效提升其準確率,因此,本一年期計畫之研究目標,為對360度視訊的CMP投影影像處理與分析的兩個主題提出設計: (1)基於深度學習的視訊編碼(video coding)之位元率控制(rate control)。(2) 基於深度學習的視覺追蹤(visual tracking)。本計畫研究成果預期將有效提升編碼CMP投影影像的位元率控制之準確率,與CMP投影影像的視覺追蹤準確率,進而提升360度視訊相關應用之品質。 ;With the rise of virtual reality, 360 degree videos provide users immersive experiences. However, sphere-to-plane projections of 360 degree videos lead to non-uniform geometric deformations in various regions of 2-D projected images. Accordingly, 2-D projections of 360 degree videos degrade performance of existing video analysis and processing schemes that designed for conventional 2-D images. Although the state-of-the-art in 360-degree video coding indicates that the R-D performance of the cubemap (CMP) projection outperforms that of the equirectangular projection (ERP), the uniform samples in the sphere domain are still projected non-uniformly onto different regions of each face of the CMP projection. Thus, the existing CMP projection based encoder can be further improved while there is few rate control scheme designed for CMP encoding. On the other hand, visual tracking is essential to many applications of computer vision. However, there are few trackers designed for the CMP projection based 360 degree videos. Since research results indicate that deep learning can significantly improve performance of applications of computer vision, this project focuses on two topics of the CMP projection of 360 degree video processing and analysis: (1) Deep learning based rate control of video coding. (2) Deep learning based visual tracking. For the CMP projection of 360 degree videos, this project expects to increase accuracy of rate control of video coding and accuracy of visual tracking. Accordingly, quality of the 360 degree video related applications can be significantly improved.
    Relation: 財團法人國家實驗研究院科技政策研究與資訊中心
    Appears in Collections:[Department of Communication Engineering] Research Project

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML305View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明