DSpace Repository

Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students

Show simple item record

dc.contributor.author Kumar, Dhruv
dc.date.accessioned 2024-08-12T11:18:53Z
dc.date.available 2024-08-12T11:18:53Z
dc.date.issued 2024-04
dc.identifier.uri https://arxiv.org/abs/2402.01687
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/15213
dc.description.abstract This study evaluates the effectiveness of various large language models (LLMs) in performing tasks common among undergraduate computer science students. Although a number of research studies in the computing education community have explored the possibility of using LLMs for a variety of tasks, there is a lack of comprehensive research comparing different LLMs and evaluating which LLMs are most effective for different tasks. Our research systematically assesses some of the publicly available LLMs such as Google Bard, ChatGPT(3.5), GitHub Copilot Chat, and Microsoft Copilot across diverse tasks commonly encountered by undergraduate computer science students in India. These tasks include code explanation and documentation, solving class assignments, technical interview preparation, learning new concepts and frameworks, and email writing. Evaluation for these tasks was carried out by pre-final year and final year undergraduate computer science students and provides insights into the models' strengths and limitations. This study aims to guide students as well as instructors in selecting suitable LLMs for any specific task and offers valuable insights on how LLMs can be used constructively by students and instructors. en_US
dc.language.iso en en_US
dc.subject Computer Science en_US
dc.subject Large Language Models (LLMs) en_US
dc.subject ChatGPT(3.5) en_US
dc.title Which LLM should I use?": Evaluating LLMs for tasks performed by Undergraduate Computer Science Students en_US
dc.type Preprint en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account