Improving large language model (LLM) performance with retrieval augmented generation (RAG): development of a transparent generative artificial intelligence (GEN AI) university support system for educational purposes

Nishitha Chidipothu; Jim Samuel; Julia Esguerra; Rick Anderson; Alexander Pelaez; Md Nurul Hoque

doi:10.54116/jbdai.v3i1.50

Back

Improving large language model (LLM) performance with retrieval augmented generation (RAG): development of a transparent generative artificial intelligence (GEN AI) university support system for educational purposes

Preprint

Open access

Improving large language model (LLM) performance with retrieval augmented generation (RAG): development of a transparent generative artificial intelligence (GEN AI) university support system for educational purposes

Nishitha Chidipothu, Jim Samuel, Julia Esguerra, Rick Anderson, Alexander Pelaez and Md Nurul Hoque

12/01/2024

DOI: https://doi.org/10.54116/jbdai.v3i1.50

Abstract

This study works on the development of a generative artificial intelligence (AI) university support system (GenAI-USS) by improvising retrieval augmented generation (RAG) architecture to improve the performance of large language models (LLM) in a way that supports stepwise transparency. We aim to achieve better transparency and flexibility, and improved accuracy of responses to queries based on university data assimilated from university webpages and knowledge sources. We use RAG to develop a plug-and-play mechanism, along with prompt selection to boost LLM accuracy. One of the key components in our GenAI-USS is the capture and integration of real-time information via live retrieval into the generative AI process. This domain-specific knowledge assimilation with real-time updates to capture changes and new information serves as a specialized dynamic expert knowledge database for RAG. Our RAG mechanism pulls in relevant, up-to-date information from the dynamic database, which pulls real-time data from targeted predetermined knowledge sources. The other key component in our GenAI-USS design is the deliberately designed information processing visibility at each stage of the process to ensure full transparency, and this includes the following: overview, data collection, storage encoding, testing, chatbot interaction, and search. The testing module allows for interactive viewing of generated responses and their sources. Our strategy is expected to lead to higher-quality AI-generated output via targeted information retrieval, hallucination mitigation, accuracy improvement, and timely data updates. Essentially, on the submission of a query, the RAG-dependent GenAI-USS first identifies the most relevant information from the specialized expert knowledge database and then factors this into the generative AI response development process. This results in a successful implementation of our primary objectives of a transparent and flexible user-choice–driven RAG-based generative AI system, which also provided heuristically notable improvements in the quality of output produced.

Files and links (3)

pdf

JBDai-3-1-Chidipothu-Gen-AI-University-RAG-LLM-202516.09 MBDownload View

Version of Record (VoR) Journal of big data and artificial intelligence Open Access

url

https://doi.org/10.54116/jbdai.v3i1.50View

Version of Record (VoR) Journal of big data and artificial intelligence Open

url

Report an accessibility issueView

Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

6640 File downloads

307 Record Views

Details

Title: Subtitle: Improving large language model (LLM) performance with retrieval augmented generation (RAG): development of a transparent generative artificial intelligence (GEN AI) university support system for educational purposes
Creators: Nishitha Chidipothu (Author) - Rutgers, The State University of New Jersey
Jim Samuel (Author) - Rutgers University, Bloustein School of Planning and Public Policy - Faculty Affairs
Julia Esguerra (Author)
Rick Anderson (Author)
Alexander Pelaez (Author)
Md Nurul Hoque (Author)
Date published: 12/01/2024
Academic Unit: Bloustein School of Planning and Public Policy - Faculty Affairs
Language: English
Resource Type: Preprint
Identifiers: 991032165917804646