
Please use this identifier to cite or link to this item:
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/19225
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Sinha, Yash | - |
dc.date.accessioned | 2025-08-25T09:30:26Z | - |
dc.date.available | 2025-08-25T09:30:26Z | - |
dc.date.issued | 2024-10 | - |
dc.identifier.uri | https://arxiv.org/abs/2410.17050 | - |
dc.identifier.uri | http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/19225 | - |
dc.description.abstract | The key components of machine learning are data samples for training, model for learning patterns, and loss function for optimizing accuracy. Analogously, unlearning can potentially be achieved through anti-data samples (or anti-samples), unlearning method, and reversed loss function. While prior research has explored unlearning methods and reversed loss functions, the potential of anti-samples remains largely untapped. In this paper, we introduce UnSTAR: Unlearning with Self-Taught Anti-Sample Reasoning for large language models (LLMs). Our contributions are threefold; first, we propose a novel concept of anti-sample-induced unlearning; second, we generate anti-samples by leveraging misleading rationales, which help reverse learned associations and accelerate the unlearning process; and third, we enable fine-grained targeted unlearning, allowing for the selective removal of specific associations without impacting related knowledge - something not achievable by previous works. Results demonstrate that anti-samples offer an efficient, targeted unlearning strategy for LLMs, opening new avenues for privacy-preserving machine learning and model modification. | en_US |
dc.language.iso | en | en_US |
dc.subject | Computer Science | en_US |
dc.subject | Machine unlearning | en_US |
dc.subject | Anti-samples | en_US |
dc.subject | Large language models (LLMs) | en_US |
dc.subject | Privacy-preserving machine learning | en_US |
dc.title | Unstar: unlearning with self-taught anti-sample reasoning for ILMS | en_US |
dc.type | Plan or blueprint | en_US |
Appears in Collections: | Department of Computer Science and Information Systems |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.