RAG LLM Blueprint

This blueprint is a one click to deploy a RAG pipeline for inference using LLM and updater webapp connected to MinIO / S3 storage solution. User needs to have three external services online before using this blueprint i.e a Large language model, an Elastic Search service and MinIO / S3 storage. User will use the RAG endpoint for inference, which in turn will connect with Elastic Search Service to retrieve latest documents and LLM to generate relevant answers. The Elastic Search document index will be updated using the webapp that will be deployed along with the RAG endpoint. User needs to update their latest documents in the storage bucket and it will be directly updated in the elastic search document index.

For more information about this blueprint you can refer to this blog.

Technical pre-requisite

Setup ElasticSearch
Retrieve the access key either from the llm model hosted on the cnvrg platform or obtain it from OpenAI / Hugging Face.
Setup a MinIO / S3 bucket.
If the user sets-up an S3 bucket, they also need to configure the SNS queue to capture the object created event.

Walkthrough: Configuring a bucket for notifications
- Step 1: Create an Amazon SQS queue
- Step 2: Create an Amazon SNS topic
- Step 3: Add a notification configuration to your bucket
- Step 4: Test the setup
Refer to the link for more information.

Flow

Click on Use Blueprint button.
You will be redirected to your blueprint flow page.
Go to the project settings section and update the environment variables with relevant information that will be used by the RAG endpoint.

For more info see the component documentations
- RAG endpoint documentation
- Listener/Updator documentation

You will have to update the task 'updator' to provide relevant information regarding your storage solution and elastic search service.
Click on the ‘Run Flow’ button.
In a few minutes you will have a RAG endpoint and a webapp deployed to update your elastic search service.
Go to the ‘Serving’ tab in the project and look for your endpoint.
You can use the Try it Live section to query the RAG endpoint and generate relevant answers with LLM connected.
You can also integrate your API with your code using the integration panel at the bottom of the page.

Note: A slim version of RAG is also available as a blueprint that enables the use of RAG locally on cnvrg, without the use of Minio / S3 and elasticsearch. Check it out FastRAG slim blueprint for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
predict		predict
README.md		README.md
blueprint.yaml		blueprint.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG LLM Blueprint

Technical pre-requisite

Flow

About

Releases

Packages

Languages

cnvrg/FastRAG

Folders and files

Latest commit

History

Repository files navigation

RAG LLM Blueprint

Technical pre-requisite

Flow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages