redis-applied-ai
diff --git a/‎README.md
Lines changed: 68 additions & 1 deletion b/‎README.md
Lines changed: 68 additions & 1 deletion
diff --git a/‎RELEASE_NOTES_0.3.0.md
Lines changed: 180 additions & 0 deletions b/‎RELEASE_NOTES_0.3.0.md
Lines changed: 180 additions & 0 deletions
@@ -69,13 +69,14 @@ For complete code examples, see the following notebooks:
 | Basic grid study | [00_grid_study.ipynb](https://github.com/redis-applied-ai/redis-retrieval-optimizer/blob/main/docs/examples/grid_study/00_grid_study.ipynb) |
 | Custom grid study | [01_custom_grid_study.ipynb](https://github.com/redis-applied-ai/redis-retrieval-optimizer/blob/main/docs/examples/grid_study/01_custom_grid_study.ipynb) |
 | Bayesian Optimization | [00_bayes_study.ipynb](https://github.com/redis-applied-ai/redis-retrieval-optimizer/blob/main/docs/examples/bayesian_optimization/00_bayes_study.ipynb) |
+| Search study | [00_search_study.ipynb](https://github.com/redis-applied-ai/redis-retrieval-optimizer/blob/main/docs/examples/search_study/00_search_study.ipynb) |
 | Embedding model comparison | [00_comparison.ipynb](https://github.com/redis-applied-ai/redis-retrieval-optimizer/blob/main/docs/examples/comparison/00_comparison.ipynb) |
 
 ---
 
 ## 🚀 Quick Start
 
-The Retrieval Optimizer supports two *study* types: **Grid** and **Bayesian Optimization**. Each is suited to a different stage of building a high-quality search system.
+The Retrieval Optimizer supports three *study* types: **Grid**, **Bayesian Optimization**, and **Search Study**. Each is suited to a different stage of building a high-quality search system.
 
 ### Grid
 
@@ -85,6 +86,10 @@ Use a grid study to explore the impact of different **embedding models** and **r
 
 Once you've identified a solid starting point, use Bayesian optimization to **fine-tune your index configuration**. This mode intelligently selects the most promising combinations to test, in place of exhaustive testing (which is time-consuming). Bayesian optimization mode is especially useful for balancing **cost, speed, and latency** as you work toward a production-ready solution.
 
+### Search Study
+
+Use a search study when you have an **existing Redis index** and want to quickly test different search methods against it without recreating the index or data. This is ideal for A/B testing search strategies or evaluating custom search methods on production data.
+
 ## Running a Grid study
 
 #### Study config
@@ -242,6 +247,68 @@ metrics = run_bayes_study(
 
 ---
 
+## Running a Search study
+
+Use a search study when you have an **existing Redis index** and want to quickly test different search methods against it without recreating the index or data. This is ideal for A/B testing search strategies or evaluating custom search methods on production data.
+
+### Search study config
+```yaml
+embedding_model:
+  dim: 768
+  dtype: float32
+  embedding_cache_name: vec-cache
+  model: sentence-transformers/all-mpnet-base-v2
+  type: hf
+index_name: cars
+queries: "../resources/cars/car_queries.json"
+qrels: "../resources/cars/car_qrels.json"
+ret_k: 6
+search_methods:
+- base_vector
+- pre_filter_vector
+study_id: test-search-study
+```
+
+### Code
+```python
+import os
+from redis_retrieval_optimizer.search_study import run_search_study
+from dotenv import load_dotenv
+
+# load environment variables containing necessary credentials
+load_dotenv()
+
+redis_url = os.environ.get("REDIS_URL", "redis://localhost:6379/0")
+
+# Define custom search methods
+def gather_vector_results(search_method_input):
+    # Your vector search implementation
+    pass
+
+def gather_pre_filter_results(search_method_input):
+    # Your pre-filtered search implementation
+    pass
+
+CUSTOM_SEARCH_METHOD_MAP = {
+    "base_vector": gather_vector_results,
+    "pre_filter_vector": gather_pre_filter_results
+}
+
+metrics = run_search_study(
+    config_path="search_study_config.yaml",
+    redis_url=redis_url,
+    search_method_map=CUSTOM_SEARCH_METHOD_MAP
+)
+```
+
+### Example output
+| search_method     | avg_query_time | recall | precision | ndcg   |
+|-------------------|----------------|--------|-----------|---------|
+| base_vector       | 0.002605       | 0.9    | 0.23      | 0.717676|
+| pre_filter_vector | 0.001177       | 1.0    | 0.25      | 0.914903|
+
+---
+
 ## 🔍 Search Methods
 
 Below is a comprehensive table documenting the built-in search methods available in the Retrieval Optimizer:
 
@@ -0,0 +1,180 @@
+# Redis Retrieval Optimizer v0.3.0 Release Notes
+
+## 🎉 What's New in v0.3.0
+
+Redis Retrieval Optimizer v0.3.0 introduces several major features that make it easier than ever to build and optimize high-performance search systems with Redis. This release focuses on enhanced optimization capabilities, improved search methods, and better integration with modern embedding models.
+
+## 🚀 Major New Features
+
+### 🎛️ Threshold Optimization
+**NEW**: Automatically tune semantic cache and router thresholds for optimal performance.
+
+The new threshold optimization feature helps you maximize the performance of RedisVL's Semantic Cache and Semantic Router by automatically finding the best distance thresholds. This feature supports multiple evaluation metrics including F1 score, precision, and recall.
+
+**Key capabilities:**
+- **Cache Threshold Optimization**: Optimize thresholds for semantic caches to improve cache hit rates and relevance
+- **Router Threshold Optimization**: Fine-tune route thresholds for semantic routers to improve routing accuracy
+- **Multiple Evaluation Metrics**: Support for F1 score, precision, and recall optimization
+- **Easy Integration**: Works seamlessly with existing RedisVL SemanticCache and SemanticRouter instances
+
+**Example usage:**
+```python
+from redis_retrieval_optimizer.threshold_optimization import CacheThresholdOptimizer
+
+# Optimize cache threshold
+optimizer = CacheThresholdOptimizer(cache, test_data)
+optimizer.optimize()
+
+# Optimize router thresholds
+optimizer = RouterThresholdOptimizer(router, test_data)
+optimizer.optimize(max_iterations=20, search_step=0.1)
+```
+
+### 🔄 Weighted Reciprocal Rank Fusion (Weighted RRF)
+**NEW**: Advanced search method that combines multiple retrieval strategies with configurable weighting.
+
+Weighted RRF allows you to intelligently blend BM25 and vector search results with controlled weighting parameters. This method is particularly effective when different search strategies have complementary strengths.
+
+**Features:**
+- Configurable weighting between BM25 and vector search
+- Parameter k controls how quickly rankings decay
+- Handles cases where methods have complementary strengths
+- Improved relevance through intelligent result fusion
+
+### 🧠 Enhanced Vector Data Type Support
+**NEW**: Support for multiple vector data types including float16 and float32.
+
+You can now test different vector data types to find the optimal balance between memory usage and precision. This is especially useful for production deployments where memory efficiency is crucial.
+
+**Supported data types:**
+- `float16`: Reduced memory usage with acceptable precision
+- `float32`: Standard precision (default)
+
+**Configuration:**
+```yaml
+vector_data_types: ["float16", "float32"]
+```
+
+### 🤖 OpenAI Embedding Model Support
+**NEW**: Native support for OpenAI's text-embedding-3-small model.
+
+The optimizer now supports OpenAI's latest embedding models, allowing you to compare their performance against HuggingFace models in your studies.
+
+**Supported models:**
+- `text-embedding-3-small` (1536 dimensions)
+- All existing HuggingFace models
+
+**Example configuration:**
+```yaml
+embedding_models:
+  - type: "openai"
+    model: "text-embedding-3-small"
+    dim: 1536
+    embedding_cache_name: "openai-small-vec-cache"
+```
+
+## 🔧 Improvements & Enhancements
+
+### 📊 Enhanced Search Methods
+- **Improved BM25**: Better handling of edge cases and error recovery
+- **Enhanced Hybrid Search**: More robust combination of lexical and semantic search
+- **Optimized Reranking**: Improved cross-encoder integration with better error handling
+- **Better Vector Search**: Enhanced distance metric support and query optimization
+
+### 🛠️ Developer Experience
+- **Better Error Handling**: More graceful error recovery across all search methods
+- **Improved Logging**: Enhanced logging for debugging and monitoring
+- **Type Safety**: Better type hints and validation throughout the codebase
+- **Documentation**: Comprehensive examples and API documentation
+
+### 🔌 Extensibility
+- **Custom Search Methods**: Easier creation of domain-specific search strategies
+- **Flexible Corpus Processors**: Support for custom data formats and processing
+- **Modular Architecture**: Better separation of concerns for easier extension
+
+## 📚 New Documentation & Examples
+
+### 📖 Comprehensive Examples
+- **Threshold Optimization**: Complete notebook showing cache and router optimization
+- **Model Comparison**: Side-by-side comparison of different embedding models
+- **Custom Grid Study**: Advanced example with domain-specific search methods
+- **Bayesian Optimization**: Detailed guide for fine-tuning index configurations
+
+### 🔍 API Documentation
+- Complete API reference for all new features
+- Detailed configuration guides
+- Best practices and performance tips
+
+## 🐛 Bug Fixes
+
+- Fixed issue with embedding cache name collisions
+- Improved handling of empty search results
+- Better error messages for configuration issues
+- Fixed memory leaks in long-running studies
+- Resolved issues with Redis connection handling
+
+## 🔄 Breaking Changes
+
+**None**: This release maintains full backward compatibility with v0.2.x.
+
+## 📦 Installation
+
+```bash
+pip install redis-retrieval-optimizer==0.3.0
+```
+
+## 🎯 Migration Guide
+
+No migration required! All existing configurations and code will work without changes. New features are opt-in and can be added to your existing studies.
+
+## 🚀 Quick Start with New Features
+
+### Threshold Optimization
+```python
+from redis_retrieval_optimizer.threshold_optimization import CacheThresholdOptimizer
+
+# Create test data
+test_data = [
+    {"query": "What's the capital of France?", "query_match": "paris_key"},
+    {"query": "What's the capital of Britain?", "query_match": ""}
+]
+
+# Optimize cache threshold
+optimizer = CacheThresholdOptimizer(cache, test_data)
+optimizer.optimize()
+```
+
+### Weighted RRF
+```yaml
+search_methods: ["bm25", "vector", "hybrid", "rerank", "weighted_rrf"]
+```
+
+### Vector Data Types
+```yaml
+vector_data_types: ["float16", "float32"]
+```
+
+### OpenAI Embeddings
+```yaml
+embedding_models:
+  - type: "openai"
+    model: "text-embedding-3-small"
+    dim: 1536
+    embedding_cache_name: "openai-cache"
+```
+
+## 🙏 Acknowledgments
+
+Thank you to all contributors who helped make this release possible! Special thanks to the Redis community for feedback and testing.
+
+## 📞 Support
+
+- **Documentation**: [GitHub Wiki](https://github.com/redis-applied-ai/redis-retrieval-optimizer)
+- **Issues**: [GitHub Issues](https://github.com/redis-applied-ai/redis-retrieval-optimizer/issues)
+- **Discussions**: [GitHub Discussions](https://github.com/redis-applied-ai/redis-retrieval-optimizer/discussions)
+
+---
+
+**Stop guessing. Start measuring.** 📊
+
+Transform your retrieval system from *"looks good to me"* to *"proven to perform"* with Redis Retrieval Optimizer v0.3.0!