Vector search
Advanced Managed Search supports high-dimensional embedding–based similarity search by using the vector search capability of OpenSearch. Beyond traditional keyword search, it enables proximity-based search using embedding vectors derived from text, images, audio, and other data.
Vector search can be used in various machine learning and AI–based services such as recommendation systems, semantic search, and natural language query processing.
Overview of vector search
What is a vector?
A vector is a high-dimensional numeric array created using machine learning models or embedding techniques.
These vectors contain information about semantic similarity or distance between data, enabling similar data retrieval based on those relationships.
Characteristics of vector search
With vector search in Advanced Managed Search, you can use the following capabilities.
-
Similarity search
- Finds the closest results by calculating distance or similarity between a query vector and stored data vectors.
-
High-dimensional space exploration
- Supports searching using various distance or similarity metrics such as Euclidean distance and cosine similarity.
-
AI and machine learning integration
- Semantic search using NLP and image embeddings
- Candidate recommendation based on user–item similarity in recommendation systems
Vector search workflow
Vector search generally follows this workflow.
-
Generate data embeddings
- Convert text, image, or audio data into vectors
- Example: Convert a sentence into a 768-dimensional vector using an NLP model
-
Index vectors
- Index documents containing embedding vectors
- Map the relevant field as a vector type for search
-
Run vector query
- Generate a query vector used as the search reference
- Search for documents with the most similar vectors
-
Return and post-process results
- Perform post-processing such as sorting and filtering
Vector search use cases
- Semantic text search
- Similar image search based on image embeddings
- Recommendation filtering based on user behavior or preference similarity
- Query expansion or natural language–based search
Detailed usage of vector search
For more detailed usage methods and examples such as writing vector queries, mapping settings, and index structures, refer to the OpenSearch official documentation.