Publication
Improving Health Mention Classification of Social Media Content using Contrastive Adversarial Training
Pervaiz Iqbal Khan; Shoaib Ahmed Siddiqui; Imran Razzak; Andreas Dengel; Sheraz Ahmed
In: IEEE Access, Vol. 10, Pages 87900-87910, Institute of Electrical and Electronics Engineers (IEEE), 8/2022.
Abstract
Health mention classification (HMC) involves the classification of an input text as health
mention or not. Figurative and non-health mention of disease words makes the classification task challeng-
ing. Learning the context of the input text is the key to this problem. The idea is to learn word representation
by its surrounding words and utilize emojis in the text to help improve the classification results. In this paper,
we improve the word representation of the input text using adversarial training that acts as a regularizer
during fine-tuning of the model. We generate adversarial examples by perturbing the word embeddings of
the model and then train the model on a pair of clean and adversarial examples. Additionally, we utilize
contrastive loss that tries to learn similar representations for the clean example and its perturbed version.
We train and evaluate the method on three public datasets. Experiments show that contrastive adversarial
training improves the performance significantly in terms of F1-score over the baseline methods of both
BERTLarge and RoBERTaLarge on all three datasets. Furthermore, we provide a brief analysis of the results
by utilizing the power of explainable AI.