DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/8476
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBera, Asish-
dc.date.accessioned2023-01-12T10:31:30Z-
dc.date.available2023-01-12T10:31:30Z-
dc.date.issued2021-
dc.identifier.urihttps://ojs.aaai.org/index.php/AAAI/article/view/16176-
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8476-
dc.description.abstractDeep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative object pose and parts information for image recognition. For fine-grained recognition, context-aware rich feature representation of object/scene plays a key role since it exhibits a significant variance in the same subcategory and subtle variance among different subcategories. Finding the subtle variance that fully characterizes the object/scene is not straightforward. To address this, we propose a novel context-aware attentional pooling (CAP) that effectively captures subtle changes via sub-pixel gradients, and learns to attend informative integral regions and their importance in discriminating different subcategories without requiring the bounding-box and/or distinguishable part annotations. We also introduce a novel feature encoding by considering the intrinsic consistency between the informativeness of the integral regions and their spatial structures to capture the semantic correlation among them. Our approach is simple yet extremely effective and can be easily applied on top of a standard classification backbone network. We evaluate our approach using six state-of-the-art (SotA) backbone networks and eight benchmark datasets. Our method significantly outperforms the SotA approaches on six datasets and is very competitive with the remaining twoen_US
dc.language.isoenen_US
dc.publisherAssociation for the Advancement of Artificial Intelligenceen_US
dc.subjectComputer Scienceen_US
dc.subjectScene Analysis & Understandingen_US
dc.subjectApplicationsen_US
dc.subjectImage and Video Retrievalen_US
dc.subjectObject Detection & Categorizationen_US
dc.titleContext-aware Attentional Pooling (CAP) for Fine-grained Visual Classificationen_US
dc.typeArticleen_US
Appears in Collections:Department of Computer Science and Information Systems

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.