Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Bera, Asish

DSpace Home
→
BITS Faculty Publications
→
Department of Computer Science and Information Systems
→
View Item

dc.contributor.author	Bera, Asish
dc.date.accessioned	2023-01-12T10:31:30Z
dc.date.available	2023-01-12T10:31:30Z
dc.date.issued	2021
dc.identifier.uri	https://ojs.aaai.org/index.php/AAAI/article/view/16176
dc.identifier.uri	http://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8476
dc.description.abstract	Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative object pose and parts information for image recognition. For fine-grained recognition, context-aware rich feature representation of object/scene plays a key role since it exhibits a significant variance in the same subcategory and subtle variance among different subcategories. Finding the subtle variance that fully characterizes the object/scene is not straightforward. To address this, we propose a novel context-aware attentional pooling (CAP) that effectively captures subtle changes via sub-pixel gradients, and learns to attend informative integral regions and their importance in discriminating different subcategories without requiring the bounding-box and/or distinguishable part annotations. We also introduce a novel feature encoding by considering the intrinsic consistency between the informativeness of the integral regions and their spatial structures to capture the semantic correlation among them. Our approach is simple yet extremely effective and can be easily applied on top of a standard classification backbone network. We evaluate our approach using six state-of-the-art (SotA) backbone networks and eight benchmark datasets. Our method significantly outperforms the SotA approaches on six datasets and is very competitive with the remaining two	en_US
dc.language.iso	en	en_US
dc.publisher	Association for the Advancement of Artificial Intelligence	en_US
dc.subject	Computer Science	en_US
dc.subject	Scene Analysis & Understanding	en_US
dc.subject	Applications	en_US
dc.subject	Image and Video Retrieval	en_US
dc.subject	Object Detection & Categorization	en_US
dc.title	Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification	en_US
dc.type	Article	en_US

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Department of Computer Science and Information Systems [1099]

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Context-aware Attentional Pooling (CAP) for Fine-grained Visual Classification

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account