Mostrar o rexistro simple do ítem

dc.contributor.advisorGamallo Otero, Pablo
dc.contributor.authorAl-Matarneh Mohammad Ata, Sattam
dc.date.accessioned2018-12-28T09:31:17Z
dc.date.available2018-12-28T09:31:17Z
dc.date.issued2018
dc.identifier.urihttp://hdl.handle.net/10347/18042
dc.description.abstractStudies in sentiment analysis and opinion mining focused on many aspects related to opinions, particularly polarity classification by making use of positive, negative or neutral values. However, most studies overlooked the identification of extreme opinions (very negative and very positive opinions) in spite of their vast significance in many applications. This doctoral thesis describes a strategy to build sentiment lexicons from corpora, namely lexicons adapted to extreme values. This strategy has been used to build some lexicons and to know its effectiveness in determining the polarity of opinions. First, we will construct a domain-specific lexicon from a corpus of movie reviews. Polarity words of the lexicon are assigned weights standing for different degrees of positiveness and negativeness. This lexicon is will be combined into a sentiment analysis system to evaluate its performance in the task of sentiment classification. Second, two lexicons will be built of extremely negative and positive words from labeled corpora. We will integrate the lexicons that have been built into classifiers, whether supervised or unsupervised classifier. We will use a supervised classifier, more precisely, Support Vector Machine (SVM) with some linguistic features such as a bag of words, word embedding, polarity lexicons, and set of textual features, in order to identify extreme opinions and provide a comprehensive analysis of the relative importance of each set of features. We also will compare our lexicons with four well-known sentiment lexicons. For this purpose, an indirect evaluation is carried out. The lexicons will be integrated into supervised sentiment classifiers, and their performance is evaluated in two sentiment classification tasks to identify i) the most negative vs. not most negative opinions, and ii) the most positive vs. not most positive. Moreover, a set of textual features is integrated into the classifiers to analyze how these textual features improve the lexicon performance. On the other hand, we also tested the efficiency of our lexicons in determining extreme opinions through the use of unsupervised classifiers. Our classification algorithm is based on a fundamental word-matching scheme to carry out unsupervised sentiment analysis.
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectSentiment Analysis
dc.subjectSentiment Lexicon
dc.subjectPolarity Classification
dc.subjectMachine Learning
dc.subject.classificationMaterias::Investigación::12 Matemáticas::1203 Ciencia de los ordenadores::120304 Inteligencia artificial
dc.subject.classificationMaterias::Investigación::57 Lingüística::5701 Lingüística aplicada
dc.titleCorpus-based Construction of Sentiment Lexicon to Identify Extreme Opinions by Supervised and Unsupervised Machine learning Methods
dc.typedoctoral thesis
dc.rights.accessRightsopen access
dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro Internacional de Estudos de Doutoramento e Avanzados (CIEDUS)
dc.contributor.affiliationUniversidade de Santiago de Compostela. Escola de Doutoramento Internacional en Ciencias e Tecnoloxía
dc.contributor.affiliationUniversidade de Santiago de Compostela. Programa de Doutoramento en Investigación en Tecnoloxías da Información


Ficheiros no ítem

application/pdf
Nome: rep_1690.pdf
Tamaño: 22.04 Mb
Formato: PDF


Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
A licenza do ítem descríbese como
 Attribution-NonCommercial-NoDerivatives 4.0 Internacional





Recolectores:Enlaces de interese:
Universidade de Santiago de Compostela | Teléfonos: +34 881 811 000 e +34 982 820 000 | Contacto | Suxestións