中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/72055
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 80990/80990 (100%)
造访人次 : 42724187      在线人数 : 1278
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/72055


    题名: 基於語意之輿情分析系統;Semantic Based Public Opinion Analysis System
    作者: 曾昱智;ZENG,YU-ZHI
    贡献者: 資訊工程學系在職專班
    关键词: 語意;輿情;Semantic;Opinion Analysis
    日期: 2016-08-25
    上传时间: 2016-10-13 14:23:51 (UTC+8)
    出版者: 國立中央大學
    摘要: 在分析語句情緒的研究中,為了提升準確率,通常會加入一些因素規則,比如情緒關鍵字的使用與人工定義的情緒規則;這些自制化的因素,往往會因為需求龐大的數據與漫長的訓練要求,造成系統架構的不靈活性與效能不佳。因此在論文的研究中,將以上述的需求為考量,建立一個能分析文句語意內容,並具有快速特性與一定效能的系統架構。
    論文的系統架構分為三大部分,分別為資料訓練:其為情緒及情緒心理學的相關研究,主要根據知網的語料庫 (HowNet) 與中研院中文詞知識庫小組的中文詞類分析技術報告為參考資料生成情緒規則,產生稀疏表示特徵,建立稀疏表示字典,透過解出稀疏係數後,將兩類別各自的字典及係數還原原向量,並與原向量計算誤差,獲得最小誤差者即為所屬類別;再者為議題輸入與評論資料取得描述如何取得時下論壇的熱門討論文章之評論內容;最後為資料分類:可以根據資料訓練之結果分析議題分類的準確度。另外,在研究實驗上,論文將逐一辨識時下的流行論點作為情緒分類模組的實作議題。;In the research of semantic sentiment analysis, it will normally use some factor rules such as the utilization of emotional keywords and the emotional rules defined manually to increase the accuracy. Because of the demand for large amounts of data and the training take lots of time, these manual factors will usually make the construction of system unportable and decrease efficiency. In this thesis, based on the above demands, we propose a semantic sentiment analysis system, and it also have better quality and increase efficiency.
    The system structure of this thesis is organized as follows. First, the data training: It is the research of emotion and emotion psychology. According to the linguistic definition such as HowNet and CKIP technical report, we could make the emotional rules to generate the sparse representation characteristic, and build the sparse representation dictionary. By solved the sparse coefficient, return the dictionary and coefficient of two categories to original vector respectively. Then calculate the error with original vector, the dependent category which is obtain minimum error. Second, the input topic and the obtainment of comments: It present how to get the comments of the hot topic in the internet forum. Finally, the data classification: we will analyze the accuracy of classified topics by the result of data training. Besides, the experimental results will identify the hot topic as the implementation of semantic classification models.
    显示于类别:[資訊工程學系碩士在職專班 ] 博碩士論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML423检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明