Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Dec 7;10(12):e0144296.
doi: 10.1371/journal.pone.0144296. eCollection 2015.

Sentiment of Emojis

Affiliations

Sentiment of Emojis

Petra Kralj Novak et al. PLoS One. .

Abstract

There is a new generation of emoticons, called emojis, that is increasingly being used in mobile communications and social media. In the past two years, over ten billion emojis were used on Twitter. Emojis are Unicode graphic symbols, used as a shorthand to express concepts and ideas. In contrast to the small number of well-known emoticons that carry clear emotional contents, there are hundreds of emojis. But what are their emotional contents? We provide the first emoji sentiment lexicon, called the Emoji Sentiment Ranking, and draw a sentiment map of the 751 most frequently used emojis. The sentiment of the emojis is computed from the sentiment of the tweets in which they occur. We engaged 83 human annotators to label over 1.6 million tweets in 13 European languages by the sentiment polarity (negative, neutral, or positive). About 4% of the annotated tweets contain emojis. The sentiment analysis of the emojis allows us to draw several interesting conclusions. It turns out that most of the emojis are positive, especially the most popular ones. The sentiment distribution of the tweets with and without emojis is significantly different. The inter-annotator agreement on the tweets with emojis is higher. Emojis tend to occur at the end of the tweets, and their sentiment polarity increases with the distance. We observe no significant differences in the emoji rankings between the 13 languages and the Emoji Sentiment Ranking. Consequently, we propose our Emoji Sentiment Ranking as a European language-independent resource for automated sentiment analysis. Finally, the paper provides a formalization of sentiment and a novel visualization in the form of a sentiment bar.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have the following interests: The sentiment annotations were supported by the Goldfinch platform, provided by Sowa Labs (http://www.sowalabs.com) for free. Most of the tweets (except English) were collected during a joint project with Gama System (http://www.gama-system.si), using their PerceptionAnalytics platform (http://www.perceptionanalytics.net). There are no patents, products in development or marketed products to declare. This does not alter the authors’ adherence to all the PLOS ONE policies on sharing data and materials.

Figures

Fig 1
Fig 1. Top 10 emojis.
Emojis are ordered by the number of occurrences N. The average position ranges from 0 (the beginning of the tweets) to 1 (the end of the tweets). p c, c ∈ {−1, 0, +1}, are the negativity, neutrality, and positivity, respectively. s¯ is the sentiment score.
Fig 2
Fig 2. Sentiment map of the 751 emojis.
Left: negative (red), right: positive (green), top: neutral (yellow). Bubble size is proportional to log10 of the emoji occurrences in the Emoji Sentiment Ranking. Sections A, B, and C are references to the zoomed-in panels in Fig 3.
Fig 3
Fig 3. Emojis in sections A, B, and C of Fig 2.
Shown are emojis that occur at least 100 times in the Emoji Sentiment Ranking. Panel A: negative emojis, panel B: neutral (top) and bipolar (bottom) emojis, panel C: positive emojis.
Fig 4
Fig 4. Distribution of emojis by sentiment score.
The mean sentiment score of the 751 emojis (in bins of 0.05) is +0.305.
Fig 5
Fig 5. Distribution of occurrences and sentiment of the 751 emojis.
The emojis are ranked by their occurrence (log scale). The column color indicates the sentiment score. The partitioning into two equally weighted halfs is indicated by a line at R 1/2. The first 33 emojis are zoomed-in in Fig 6.
Fig 6
Fig 6. Top 33 emojis by occurrence.
Column color represents the emoji sentiment score.
Fig 7
Fig 7. Average positions of the 751 emojis in tweets.
Bubble size is proportional to log10 of the emoji occurrences in the Emoji Sentiment Ranking. Left: the beginning of tweets, right: the end of tweets, bottom: negative (red), top: positive (green).
Fig 8
Fig 8. Negativity, neutrality, and positivity regressed with position (from left to right).
The trendlines are functions p c(d) of the distance d from the beginning of the tweets.
Fig 9
Fig 9. Sentiment bars of the ‘thumbs down sign’, ‘flushed face’, and ‘chocolate bar’ emojis.
The colored bar extends from −1 to +1, the range of the sentiment score s¯. The grey bar is centered at s¯ and extended for ±1.96SEM, but never beyond the range of s¯. Colored parts are proportional to negativity (p , red), neutrality (p 0, yellow), and positivity (p +, green).

References

    1. Dimson T. Emojineering part 1: Machine learning for emoji trends [blog]; 2015. http://instagram-engineering.tumblr.com/post/117889701472/emojineering-p....
    1. SwiftKey PT. Most-used emoji revealed: Americans love skulls, Brazilians love cats, the French love hearts [blog]; 2015. http://swiftkey.com/en/blog/americans-love-skulls-brazilians-love-cats-s....
    1. Liu B. Sentiment Analysis and Opinion Mining. Synthesis Lectures on Human Language Technologies. 2012;5(1):1–167. 10.2200/S00416ED1V01Y201204HLT016 - DOI
    1. Liu B. Sentiment Analysis: Mining Opinions, Sentiments, and Emotions. Cambridge University Press; 2015.
    1. Boia M, Faltings B, Musat CC, Pu P. A:) is worth a thousand words: How people attach sentiment to emoticons and words in tweets. In: Intl. Conf. on Social Computing (SocialCom). IEEE; 2013. p. 345–350.

Publication types