Retrieval Augmented Generation for Ukrainian Government Services: A Comparative Evaluation of the Approaches
| dc.contributor.advisor | Курочкін, Андрій | uk_UA |
| dc.contributor.author | Маринич, Антон | uk_UA |
| dc.date.accessioned | 2025-09-05T06:53:49Z | |
| dc.date.available | 2025-09-05T06:53:49Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Retrieval Augmented Generation or RAG is a method that is used to improve the quality of retrieval for LLMs, to avoid hallucinations and be aware of all the changes in the data. This approach integrates LLMs with external data source by building a vector index. This thesis presents a comprehensive study on how different RAG approaches perform in Ukrainian Governmental Services domain. I establish a non-RAG baseline using GPT-4.1-mini model and iteratively perform tests on different configurations of RAG approaches. I have also created a dataset with 500 open questions about Ukrainian Governmental Services using GPT-o4-minihigh model. My best results comparing to the baseline are 13.25% improvement in LLM Judge Score using CRAG with Hypothetical Document Embedding and Reranking and 10% improvement on Factual Correctness using CRAG with Reranking. | en_US |
| dc.identifier.uri | https://ekmair.ukma.edu.ua/handle/123456789/36460 | |
| dc.language.iso | en_US | en_US |
| dc.status | first published | en_US |
| dc.subject | Retrieval Augmented Generation (RAG) | en_US |
| dc.subject | Large language models (LLMs) | en_US |
| dc.subject | hallucinations | en_US |
| dc.subject | Ukrainian Governmental Services domain | en_US |
| dc.subject | bachelor`s thesis | en_US |
| dc.title | Retrieval Augmented Generation for Ukrainian Government Services: A Comparative Evaluation of the Approaches | en_US |
| dc.type | Other | en_US |
Files
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: