Skip to content

cnvrg/FastRAG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

RAG LLM Blueprint

This blueprint is a one click to deploy a RAG pipeline for inference using LLM and updater webapp connected to MinIO / S3 storage solution. User needs to have three external services online before using this blueprint i.e a Large language model, an Elastic Search service and MinIO / S3 storage. User will use the RAG endpoint for inference, which in turn will connect with Elastic Search Service to retrieve latest documents and LLM to generate relevant answers. The Elastic Search document index will be updated using the webapp that will be deployed along with the RAG endpoint. User needs to update their latest documents in the storage bucket and it will be directly updated in the elastic search document index.

For more information about this blueprint you can refer to this blog.

Technical pre-requisite

  1. Setup ElasticSearch
  2. Retrieve the access key either from the llm model hosted on the cnvrg platform or obtain it from OpenAI / Hugging Face.
  3. Setup a MinIO / S3 bucket.
  4. If the user sets-up an S3 bucket, they also need to configure the SNS queue to capture the object created event.
  • Walkthrough: Configuring a bucket for notifications

    • Step 1: Create an Amazon SQS queue
    • Step 2: Create an Amazon SNS topic
    • Step 3: Add a notification configuration to your bucket
    • Step 4: Test the setup
  • Refer to the link for more information.

Flow

  1. Click on Use Blueprint button.
  2. You will be redirected to your blueprint flow page.
  3. Go to the project settings section and update the environment variables with relevant information that will be used by the RAG endpoint.
  1. You will have to update the task 'updator' to provide relevant information regarding your storage solution and elastic search service.
  2. Click on the ‘Run Flow’ button.
  3. In a few minutes you will have a RAG endpoint and a webapp deployed to update your elastic search service.
  4. Go to the ‘Serving’ tab in the project and look for your endpoint.
  5. You can use the Try it Live section to query the RAG endpoint and generate relevant answers with LLM connected.
  6. You can also integrate your API with your code using the integration panel at the bottom of the page.

Note: A slim version of RAG is also available as a blueprint that enables the use of RAG locally on cnvrg, without the use of Minio / S3 and elasticsearch. Check it out FastRAG slim blueprint for more information.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published