Unlocking the Untapped Potential of Retrieval-Augmented Era (RAG) Pipelines | by Saleh Alkhalifa | Dec, 2024

Important metrics and strategies to reinforce efficiency throughout retrieval, era, and end-to-end pipelines

Introduction

Once we consider among the commonest functions of Generative AI, Retrieval-Augmented Era (RAG) has definitely surfaced to develop into of the commonest matters of dialogue inside this area. Not like conventional engines like google that relied on optimizing retrieval mechanisms utilizing key phrase searches to seek out related data for a given question, RAG goes a step additional in producing a well-rounded reply for a given query utilizing the retrieved content material.

The determine under illustrates a graphical illustration of RAG wherein paperwork of curiosity are encoded utilizing an embedding mannequin, and are then listed and saved in a vector retailer. When a question is submitted, it’s usually embedded in an analogous method, adopted by two steps (1) the retrieval step that searches for comparable paperwork, after which (2) a generative step that makes use of the retrieved content material to synthesize a response.