As one of the most advanced techniques in AI, Retrieval-Augmented Generation (RAG) can provide reliable and up-to-date external knowledge, offering significant convenience for numerous tasks. Particularly in the era of AI-generated content (AIGC), RAG's powerful retrieval capacity provides additional knowledge, enabling retrieval-augmented generation to assist existing generative AI in producing high-quality outputs. Recently, Large Foundation Models (LFMs) have demonstrated revolutionary abilities in language understanding and generation, while still facing inherent limitations such as hallucinations and out-of-date internal knowledge. Given RAG's ability to provide the latest and most helpful auxiliary information, retrieval-augmented large foundation models have emerged to harness external, authoritative knowledge bases rather than solely relying on a model's internal knowledge to improve generation quality.
In this tutorial, we comprehensively review existing research studies in retrieval-augmented large foundation models (RA-LFMs), covering three primary technical perspectives: architectures, training strategies, and applications. As preliminary knowledge, we briefly introduce the foundations and recent advances of LFMs. Then, to illustrate the practical significance of RAG for LFMs, we categorize mainstream relevant work by application area, detailing the challenges of each and the corresponding capabilities of RA-LFMs. Finally, to deliver deeper insights, we discuss current limitations and several promising directions for future research.
Related prior tutorials: KDD 2024 and ICDE 2025
Slides: PAKDD 2026 Tutorial Slides
This tutorial targets college students, academic researchers, and industrial practitioners interested in Retrieval-Augmented Generation (RAG) and Large Foundation Models (LFMs). Participants are expected to have basic knowledge of artificial intelligence, language models, and retrieval techniques. The tutorial is designed to be accessible to junior/senior undergraduate level audiences while remaining valuable to experienced researchers and engineers who want a structured entry point. After attending, participants are expected to gain a comprehensive understanding of RA-LFMs and learn how to design solutions for customized problems.
PAKDD 2026 Tutorial Session (Date and time: June 9, 2026, 13:30-17:00)
The topics of this tutorial include (but are not limited to) the following:
The tutorial outline is shown below:
PhD Student
The Hong Kong Polytechnic University
Research Assistant Professor
The Hong Kong Polytechnic University
PhD Student
The Hong Kong Polytechnic University
PhD Student
The Hong Kong Polytechnic University
MPhil Student
The Hong Kong Polytechnic University
Assistant Professor
National University of Defense Technology
Assistant Professor
The Hong Kong Polytechnic University
Chair Professor and Head
The Hong Kong Polytechnic University