A Comparative Performance Analysis of Locally Deployed Large Language Models through a Retrieval-Augmented Generation Educational Assistant Application for Textual Data Extraction

Amitabh Mishra; Nagaraju Brahmanapally

doi:10.3390/ai6060119

Back

A Comparative Performance Analysis of Locally Deployed Large Language Models through a Retrieval-Augmented Generation Educational Assistant Application for Textual Data Extraction

Journal article

Open access

Peer reviewed

A Comparative Performance Analysis of Locally Deployed Large Language Models through a Retrieval-Augmented Generation Educational Assistant Application for Textual Data Extraction

Amitabh Mishra and Nagaraju Brahmanapally

AI (Basel), Vol.6(6), 119

06/01/2025

DOI: https://doi.org/10.3390/ai6060119

Web of Science ID: WOS:001515302800001

Metrics

7 File views/ downloads

37 Record Views

Abstract

Background: Rapid advancements in large language models (LLMs) have significantly enhanced Retrieval-Augmented Generation (RAG) techniques, leading to more accurate and context-aware information retrieval systems. Methods: This article presents the creation of a RAG-based chatbot tailored for university course catalogs, aimed at answering queries related to course details and other essential academic information, and investigates its performance by testing it on several locally deployed large language models. By leveraging multiple LLM architectures, we evaluate performance of the models under test in terms of context length, embedding size, computational efficiency, and relevance of responses. Results: The experimental analysis obtained by this research, which builds on recent comparative studies, reveals that while larger models achieve higher relevance scores, they incur greater response times than smaller, more efficient models. Conclusions: The findings underscore the importance of balancing accuracy and efficiency for real-time educational applications. Overall, this work contributes to the field by offering insights into optimal RAG configurations and practical guidelines for deploying AI-powered educational assistants.

Files and links (2)

pdf

A Comparative Performance Analysis of Locally Deployed Large Language Models...1.47 MBDownload View

Published (Version of record)Article pdfCC BY V4.0, Open Access

url

A Comparative Performance Analysis of Locally Deployed Large Language Models...View

Published (Version of record)link to articleCC BY V4.0, Open

Details

Title: A Comparative Performance Analysis of Locally Deployed Large Language Models through a Retrieval-Augmented Generation Educational Assistant Application for Textual Data Extraction
Publication Details: AI (Basel), Vol.6(6), 119
Resource Type: Journal article
Publisher: MDPI AG
Number of pages: 26
Identifiers: WOS:001515302800001; 99381437737206600
Academic Unit: Cybersecurity and Information Technology; Hal Marcus College of Science and Engineering
Language: English

A Comparative Performance Analysis of Locally Deployed Large Language Models through a Retrieval-Augmented Generation Educational Assistant Application for Textual Data Extraction

Metrics

Abstract

Files and links (2)

Related links

Details

University of West Florida Social media