The Simple Principle Behind Retrieval Augmented Generation in…

Author(s): Krupesh Raikar

Originally published on Towards AI.

Understand RAG intuitively and implement a chat pipeline with your documents using LangChain and Llamma v2
Photo by Mika Baumeister on Unsplash

In a timeframe that can only be best described as a blink of an eye, large language models have exploded in the general public consciousness.

Even if you have nothing to do with tech, you (and your grandma) have heard of ChatGPT!

It’s easy, accessible, and mighty — even on the free tier.

ChatGPT is pretty good at tackling general questions like:

What is the speed of a rock in free fall from a height of 10 meters?

But what if you want it to calculate something proprietary, like:

What is the exact trajectory of landing a SpaceX rocket?

In the first case, it gets the answer correct (14 m/s in case you were wondering), but in the second case…

IT FAILS.

Were you a SpaceX employee, you wouldn’t want to risk inputting your proprietary data into an external LLM — that could be a big security risk for the organization!

What do you do in such a case where proprietary documents are involved?Wouldn’t it be wonderful if you could pose questions to your documents too?

Well, one of the methods is to train an LLM with your data.

If you wish to do that, I hope you have a billion dollars in the bank, or you… Read the full blog for free on Medium.

Published via Towards AI

Source link

AI Gumbo

The Simple Principle Behind Retrieval Augmented Generation in… – Towards AI

Author(s): Krupesh Raikar

About The Author

AIGumbo.crew

Leave a Reply Cancel reply

Author(s): Krupesh Raikar

You may also like

About The Author

Leave a Reply Cancel reply