Quantcast

Positive and Negative Sentiment Words in a Blog Corpus Written in Hebrew ☆

Research paper by Yaakov HaCohen-Kerner, Haim Badash

Indexed on: 05 Oct '16Published on: 04 Sep '16Published in: Procedia Computer Science



Abstract

In this research, given a corpus containing blog posts written in Hebrew and two seed sentiment lists, we analyze the positive and negative sentences included in the corpus, and special groups of words that are associated with the positive and negative seed words. We discovered many new negative words (around half of the top 50 words) but only one positive word. Among the top words that are associated with the positive seed words, we discovered various first-person and third-person pronouns. Intensifiers were found for both the positive and negative seed words. Most of the corpus’ sentences are neutral. For the rest, the rate of positive sentences is above 80%. The sentiment scores of the top words that are associated with the positive words are significantly higher than those of the top words that are associated with the negative words.