SimEmotion: A Simple Knowledgeable Prompt Tuning Method for Image Emotion Classification - Details

Author：

Indexed by：

CPCI-S EI Scopus

Abstract：

Image　emotion　classification　is　an　important　computer　vision　task　to　extract　emotions　from　images.　The　state-of-the-art　methods　for　image　emotion　classification　are　primarily　based　on　proposing　new　architectures　and　fine-tuning　them　on　pre-trained　Convolutional　Neural　Networks.　Recently,　learning　transferable　visual　models　from　natural　language　supervision　has　shown　great　success　in　zero-shot　settings　due　to　the　easily　accessible　web-scale　training　data,　i.e.,　CLIP.　In　this　paper,　we　present　a　conceptually　simple　while　empirically　powerful　framework　for　supervised　image　emotion　classification,　SimEmotion,　to　effectively　leverage　the　rich　image　and　text　semantics　entailed　in　CLIP.　Specifically,　we　propose　a　prompt-based　fine-tuning　strategy　to　learn　task-specific　representations　while　preserving　knowledge　contained　in　CLIP.　As　image　emotion　classification　tasks　lack　text　descriptions,　sentiment-level　concept　and　entity-level　information　are　introduced　to　enrich　text　semantics,　forming　knowledgeable　prompts　and　avoiding　considerable　bias　introduced　by　fixed　designed　prompts,　further　improving　the　model＇s　ability　to　distinguish　emotion　categories.　Evaluations　on　four　widely-used　affective　datasets,　namely,　Flickr　and　Instagram　(FI),　EmotionROI,　Twitter　I,　and　Twitter　II,　demonstrate　that　the　proposed　algorithm　outperforms　the　state-of-the-art　methods　to　a　large　margin　(i.e.,　5.27%　absolute　accuracy　gain　on　FI)　on　image　emotion　classification　tasks.

Keyword：

Fine-tuning Prompt tuning Image emotion classification

Author Community：

[ 1 ] [Deng, Sinuo]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Shi, Ge]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Wu, Lifang]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 4 ] [Xing, Lehao]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 5 ] [Hu, Wenjin]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 6 ] [Zhang, Heng]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 7 ] [Xiang, Ye]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

Reprint Author's Address：

Email：

lfwu@bjut.edu.cn

Show more details

Related Keywords：

Simple But Powerful, a Language-Supervised Method for Image Emotion Classification
2023，IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
A comparative study of fine-tuning deep learning models for plant disease identification
2019，COMPUTERS AND ELECTRONICS IN AGRICULTURE
ERAT-DLoRA: Parameter-efficient tuning with enhanced range adaptation in time and depth aware dynamic LoRA
2024，NEUROCOMPUTING
Biomedical-domain pre-trained language model for extractive summarization
2020，KNOWLEDGE-BASED SYSTEMS

Source ：

DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT III

ISSN： 0302-9743

Year： 2022

Page： 222-229

Cited Count：

WoS CC Cited Count： 7

SCOPUS Cited Count： 12

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to