mirror of https://github.com/lancedb/lancedb.git synced 2026-01-08 12:52:58 +00:00

Files

Prashant Dixit b3bf6386c3 docs: rag section in guide (#1619 )

This PR adds the RAG section in the Guides. It includes all the RAGs
with code snippet and some advanced techniques which improves RAG.

2024-09-11 21:13:55 +05:30

1.7 KiB

Raw Blame History

Adaptive RAG 🤹‍♂️

Adaptive RAG introduces a RAG technique that combines query analysis with self-corrective RAG.

For Query Analysis, it uses a small classifier(LLM), to decide the query’s complexity. Query Analysis helps routing smoothly to adjust between different retrieval strategies No retrieval, Single-shot RAG or Iterative RAG.

Official Paper

![agent-based-rag](https://raw.githubusercontent.com/lancedb/assets/main/docs/assets/rag/adaptive_rag.png)

Adaptive-RAG: Source

Offical Implementation

Here’s a code snippet for query analysis

from langchain_core.prompts import ChatPromptTemplate
from langchain_core.pydantic_v1 import BaseModel, Field
from langchain_openai import ChatOpenAI

class RouteQuery(BaseModel):
    """Route a user query to the most relevant datasource."""

    datasource: Literal["vectorstore", "web_search"] = Field(
        ...,
        description="Given a user question choose to route it to web search or a vectorstore.",
    )


# LLM with function call
llm = ChatOpenAI(model="gpt-3.5-turbo-0125", temperature=0)
structured_llm_router = llm.with_structured_output(RouteQuery)

For defining and querying retriever

# add documents in LanceDB
vectorstore = LanceDB.from_documents(
    documents=doc_splits,
    embedding=OpenAIEmbeddings(),
)
retriever = vectorstore.as_retriever()

# query using defined retriever
question = "How adaptive RAG works"
docs = retriever.get_relevant_documents(question)

1.7 KiB Raw Blame History Unescape Escape

Adaptive RAG 🤹‍♂️

1.7 KiB

Raw Blame History