On-Device Generative AI: The Need, Architectures, and Challenges

dc.contributor.authorChamola, Vinay
dc.date.accessioned2025-01-03T04:32:57Z
dc.date.available2025-01-03T04:32:57Z
dc.date.issued2024-12
dc.description.abstractThe area of Generative Artificial Intelligence (GenAI) is rapidly expanding, as seen by the regular release of new models and applications every few months. While these GenAI models have impressive capabilities, their computational intensity has presented issues, especially in applications demanding low latency. Hence, substantial research is being conducted to develop ways to scale down these models so that they may be used for on-device computing on edge devices. Examining successful examples of GenAI models implemented on mobile devices with minimum latency becomes critical in understanding the practical consequences of these breakthroughs. Notable instances, such as the deployment of Diffusion-based GenAI models on flagship smartphones like Samsung S23 Ultra and iPhone 14, demonstrate the possibility and promise of bringing GenAI applications to consumers' fingertips. We further analyze and find out the approaches and strategies that make these on-device deployments successful.en_US
dc.identifier.urihttps://ieeexplore.ieee.org/abstract/document/10804065
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16683
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectEEEen_US
dc.subjectComputational modelingen_US
dc.subjectData modelsen_US
dc.subjectAdaptation modelsen_US
dc.subjectOptimizationen_US
dc.subjectTrainingen_US
dc.titleOn-Device Generative AI: The Need, Architectures, and Challengesen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: