RAG (Retrieval augmented generation) is a technique to search for relevant documents before asking the LLM to generate a natural answer.
For what I understand:
- load the documents (from files, scraping, sql queries, etc)
- split into chunks of text
- generate synthetic questions for each chunk
- embed the questions (e.g., using OpenAI embeddings)
- save to a vector database
Then when the user sends a message, we embed the question (user message) and search the embeddings (e.g., cosine similarity, knn).
So I heard about Semantic Search, apparently it understands the context of what the user is asking, can find the perfect amount of context to answer. I searched about it, I found this research on github: Semantic Graphs for Generating Deep Questions and found Graph Databases and several articles about Semantic Search using Knowledge Graphs.
I'm assuming that semantic search is some kind of search in graph database that knows how to retrieve the right amount of context.
I tried to find some existing services for semantic search and AWS came up: Semantic search in Amazon OpenSearch Service. But it seems to be just a vector search with extra sauce, not really a semantic search using knowledge graph in a graph database.
Now I'm confused if semantic search is actually the same as vector search. And I have no idea on how to use a Graph Database and why they're not using for semantic search in AWS
Starting from version 2.9+, OpenSearch does support semantic searches using models deployed either within the cluster, pre-trained models, and models running outside the cluster such as Amazon SageMaker, Amazon Bedrock, etc. To use this feature, the first step is registering the model that will augment the search with a semantic (meaning oriented) value, which will be used during the search execution to find which documents match the search criteria, using the vectors from the text embedding.
For this to work, you don't need to use graph-oriented databases. Graph databases are useful for finding data over complex relationships on highly connected datasets, like trees and heaps. Any dataset provided by the model would work. With this said, you can surely use a graph database to enrich your model with complex relationships.
For more information about the support of semantic search in OpenSearch:
https://opensearch.org/docs/latest/search-plugins/semantic-search/