SQL-Based Vector Stores

SQL-based vector store components provide vector storage capabilities using traditional SQL databases with vector extensions, combining familiar SQL operations with modern vector search.

PGVector

PGVector provides vector storage and similarity search capabilities using PostgreSQL with the pgvector extension.

Usage

PGVector features:

  • PostgreSQL integration

  • SQL-based vector operations

  • ACID transactions

  • Complex queries

  • Mature ecosystem

Inputs

Name
Display Name
Info

connection_string

Connection String

PostgreSQL connection string

table_name

Table Name

Table for vector storage

vector_column

Vector Column

Column name for vector data

text_column

Text Column

Column name for text content

metadata_columns

Metadata Columns

Additional metadata columns

Outputs

Name
Display Name
Info

vector_store

Vector Store

Configured PGVector vector store

Redis

Redis provides vector search capabilities through its RediSearch module with vector similarity search.

Usage

Redis vector features:

  • In-memory performance

  • Real-time search

  • Hybrid data structures

  • Pub/sub messaging

  • Cluster support

Inputs

Name
Display Name
Info

host

Host

Redis server host

port

Port

Redis port (default: 6379)

password

Password

Redis authentication password

index_name

Index Name

Vector index name

key_prefix

Key Prefix

Prefix for Redis keys

Outputs

Name
Display Name
Info

vector_store

Vector Store

Configured Redis vector store

Vector Operations

PGVector Capabilities

Distance Functions:

  • <-> Euclidean distance (L2)

  • <#> Negative inner product

  • <=> Cosine distance

  • Custom distance functions

Index Types:

  • IVFFlat: Inverted file with flat compression

  • HNSW: Hierarchical Navigable Small World graphs

  • Custom Indexes: Application-specific indexing

Query Examples:

-- Similarity search
SELECT * FROM documents 
ORDER BY embedding <-> '[0.1,0.2,0.3]' 
LIMIT 10;

-- Threshold filtering
SELECT * FROM documents 
WHERE embedding <-> '[0.1,0.2,0.3]' < 0.5;

-- Hybrid search with metadata
SELECT * FROM documents 
WHERE category = 'technical' 
ORDER BY embedding <-> '[0.1,0.2,0.3]' 
LIMIT 5;

Redis Vector Features

Search Capabilities:

  • Vector Similarity: KNN and range queries

  • Hybrid Search: Combine vector and traditional search

  • Filtering: Pre and post-filtering options

  • Aggregation: Search result aggregation

Data Types:

  • FLOAT32: Standard floating-point vectors

  • FLOAT64: Double precision vectors

  • Binary: Binary vector representations

  • Sparse: Sparse vector support

Advanced Features

PostgreSQL Integration Benefits

ACID Compliance:

  • Atomicity: All-or-nothing transactions

  • Consistency: Data integrity constraints

  • Isolation: Concurrent transaction handling

  • Durability: Persistent data storage

SQL Capabilities:

  • Complex Queries: JOIN operations with vector search

  • Window Functions: Advanced analytical queries

  • CTEs: Common table expressions for complex logic

  • Views: Create vector-enabled views

Ecosystem Integration:

  • ORM Support: Integration with popular ORMs

  • Backup/Recovery: Standard PostgreSQL tools

  • Replication: Streaming and logical replication

  • Extensions: Rich extension ecosystem

Redis Performance Benefits

In-Memory Speed:

  • Sub-millisecond: Ultra-fast query response

  • High Throughput: Handle thousands of queries/second

  • Real-time: Instant updates and searches

  • Low Latency: Consistent performance

Scalability:

  • Cluster Mode: Distributed vector storage

  • Sharding: Automatic data distribution

  • Replication: Master-slave replication

  • Persistence: Configurable persistence options

Use Cases

PGVector Applications

  • Document Management: Full-text and semantic search

  • E-commerce: Product recommendation systems

  • Analytics: Complex analytical queries with vectors

  • Data Warehousing: Integration with existing data infrastructure

Redis Applications

  • Real-time Recommendations: Instant recommendation engines

  • Session Storage: User session and preference vectors

  • Caching: Vector result caching for performance

  • Pub/Sub Systems: Real-time vector updates

Deployment Options

PostgreSQL Deployment

  • Self-hosted: On-premise PostgreSQL instances

  • Cloud Managed: AWS RDS, Google Cloud SQL, Azure Database

  • Container: Docker and Kubernetes deployments

  • Serverless: Aurora Serverless and similar services

Redis Deployment

  • Redis Cloud: Managed Redis service

  • Self-hosted: On-premise Redis instances

  • Container: Docker and Kubernetes deployments

  • Edge: Edge computing deployments

Usage Notes

  • SQL Familiarity: Leverage existing SQL knowledge and tools

  • Transaction Support: ACID compliance for critical applications

  • Performance: Balance between consistency and speed

  • Integration: Easy integration with existing database infrastructure

  • Tooling: Use familiar database administration tools

  • Migration: Smooth migration from traditional databases

Last updated