- Name: ntuer at SemEval-2019 Task 3: Emotion Classification with Word and Sentence Representations in RCNN
- Authors: Peixiang Zhong, Chunyan Miao
- ArXiv: https://arxiv.org/abs/1902.07867
Task & Contributions
The task of this paper is to detect emotions from textual conversations, as described in SemEval-2019 Task 3.
The main contributions of this paper is listed as follows:
- A RCNN model extended with word and sentence representations has been proposed, as can be seen from the headline.
- A list of experiments with micro-averaged F1 as the evaluation criteria were conducted to prove the effectiveness and roubustness of the proposed model. The proposed model achieved good performances in the competition.
The architecture of the proposed method(they should think a good name) is shown in the figure above. Given a conversation consisting of three utterances, the model first concatenated them as one sentence. For the word-level represention, the word embedding pre-trained on 330M English Twitter messages using Glove is used as the input of RCNN. The RCNN model mainly comprises of a BiLSTM, a linear transformation layer(dimension reduction) and a max-pooling layer(discriminative features extraction). In the sentence-level representation, DeepMoji is used as sentence embedding, which is concatenated with the outputs of RCNN. Finally, the sentence representations are fed to a softmax layer.
The focus of experiments can be divided into three classes: word embeddings, sentence embeddings and hyper-parameters.
The model proposed in this paper is quite a simple model. The reason I read it carefully is the thought of adding sentence representation to word representation and experimental results proved the effectiveness of the method. When using S-LSTM, I can obtain a general sentence representation, but how to deal with it is a problem. Concatenating and adding are the simplest ways.
When writing related work, if no work doing the same task can be found, we can choose work in the related field. By the way, the frame figure in this paper is quite misunderstanding.
- 【CNN】CNN-based methods can capture local dependencies, discriminative features, and are parallelizable for efficient computation.
- 【LSTM】LSTM-based methods can capture the word-ordering information and have achieved state-of-the-art performances on many sentiment analysis datasets.