MCP Qdrant

TL;DR

Run as a server
Use LLM + LLM Client + mcp-qdrant to find stuff using semantic meaning
Make sure the model you use with the MCP is the same model you already used for doing - FastEmbed - embeddings!

Tip

If you experience connection errors, make sure you have .env in project root

Do source .env

Also ensure you actually have Qdrant running

The Flow

1. Client makes HTTP request
   ↓
2. Axum server receives TCP connection on port 8766
   ↓
3. Axum parses HTTP request
   ↓
4. Router checks path: "/mcp" → forward to service
   ↓
5. StreamableHttpService handles MCP protocol
   ↓
6. Your QdrantMCPServer executes the tool
   ↓
7. Response bubbles back up through service → router → server
   ↓
8. Axum server sends HTTP response over TCP

Testing

https://modelcontextprotocol.io/docs/tools/inspector

If you use Goose:

mcp-qdrant:
    enabled: true
    type: streamable_http
    name: mcp-qdrant
    description: mcp-qdrant
    uri: http://localhost:8766/mcp
    headers:
      Authorization: "Bearer mFC+GCGRI7uknvKuVNa8lGeOlZgUY8UxdJxIq7HqSOs="
    envs: {}
    env_keys: []
    timeout: 120
    bundled: null
    available_tools: []

Qdrant MCP Server

Enables semantic search capabilities through Qdrant vector database. Connect Claude or any MCP-compatible client to your vector data for intelligent, context-aware search and retrieval.

Features

Semantic Text Search - Natural language queries automatically embedded and searched
Filtered Search - Combine semantic search with metadata filtering (username, filename, etc.)
Keyword Search - Find semantically similar content that contains specific keywords
Vector Operations - Direct vector search, scrolling, counting, and collection management
Local Embeddings - Fast, private text embedding using FastEmbed (no external API calls)
Remote Access - Serve over HTTP for connection from Claude.ai and other MCP clients
Easy Configuration - Simple .env file setup with sensible defaults

Installation

Prerequisites

Rust 1.75 or higher
Qdrant instance running (local or remote)
Basic familiarity with vector databases

Build from Source

# Clone the repository
git clone <repository-url>
cd qdrant-mcp-server

# Build the server
cargo build --release

# Run the server
cargo run --release

On first run, the server will automatically create a .env file with default configuration.

Quick Start

1. Start Qdrant

If you don't have Qdrant running, start it with Docker:

docker run -p 6334:6334 qdrant/qdrant

2. Configure the Server

The server will create a .env file on first run. Edit it to match your setup:

# Server binding (use 0.0.0.0 for remote access)
HOST=127.0.0.1
PORT=8766

# Qdrant connection
QDRANT_URL=http://localhost:6334
QDRANT_COLLECTION=qc1

# Embedding model
EMBEDDING_MODEL=BAAI/bge-small-en-v1.5

# Logging
RUST_LOG=info,mcp_qdrant=debug

Tip

Please note that when using Docker we set HOST="0.0.0.0" to make the server listen on all network interfaces. This is necessary when running the server in a Docker container.

3. Start the Server

cargo run --release

You should see:

✅ Server is running!

📋 Available tools:
  • search_text - Natural language semantic search
  • filter_search - Search with metadata filters
  • keyword_search - Semantic search with keyword filtering
  ...

🔗 MCP endpoint: http://127.0.0.1:8766/mcp

Configuration

Embedding Models

Choose from several pre-trained FastEmbed models:

Model	Size	Speed	Quality	Use Case
`BAAI/bge-small-en-v1.5`	Small	Fast	Good	Default, balanced
`BAAI/bge-base-en-v1.5`	Medium	Moderate	Better	Higher quality
`BAAI/bge-large-en-v1.5`	Large	Slower	Best	Maximum accuracy
`sentence-transformers/all-MiniLM-L6-v2`	Small	Fast	Good	Alternative option

And many more, the list changes as FastEmbed adds more models. Kudos to FastEmbed!

Set your preferred model in .env:

EMBEDDING_MODEL=BAAI/bge-base-en-v1.5

Network Configuration

For local-only access (Claude Desktop, local clients):

HOST=127.0.0.1

For remote access (Claude.ai, network clients):

HOST=0.0.0.0

Security Warning: When exposing to the network, ensure proper firewall rules and consider adding authentication, or run behind a proxy, e.g., Nginx.

Connecting to Claude

Option 1: Claude Desktop (Local)

Add to your Claude Desktop configuration file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json

Windows: %APPDATA%\Claude\claude_desktop_config.json

Linux: ~/.config/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "qdrant": {
      "command": "/path/to/qdrant-mcp-server",
      "env": {
        "QDRANT_URL": "http://localhost:6334",
        "QDRANT_COLLECTION": "qc1"
      }
    }
  }
}

Windows Note: Use double backslashes (\\) or forward slashes (/) in paths:

"command": "C:\\path\\to\\qdrant-mcp-server.exe"

Option 2: Claude.ai (Remote via Custom Connector)

Configure for remote access in .env:
```
HOST=0.0.0.0
PORT=8766
```
Start the server:
```
cargo run --release
```
Add Custom Connector in Claude.ai:
- Open Claude.ai
- Go to Settings → Connectors
- Click "Add custom connector"
- Enter your server URL: http://your-server-ip:8766/mcp
- Complete any authentication if configured
Use the connector:
- Click the paperclip icon in any conversation
- Select resources/prompts from your Qdrant server
- Ask Claude to search your vector database

For detailed instructions, see the MCP Remote Servers documentation.

Available Tools

Core Search Tools

`search_text`

Natural language semantic search - use this first!

Search for: "machine learning papers about transformers"

Parameters:

query (string) - Your natural language search query
limit (number, default: 10) - Maximum results to return
with_payload (boolean, default: true) - Include metadata

`filter_search`

Semantic search with metadata filtering

Search for: "project updates"
Filter by: username = "alice"

Parameters:

query (string) - Search query
filter_field (string) - Metadata field name (e.g., "username", "filename")
filter_value (string) - Value to match
limit (number, default: 10)

`keyword_search`

Semantic search that must contain specific keywords

Query: "database performance"
Must contain: "postgresql indexing"

Parameters:

query (string) - Semantic search query
must_contain_keywords (string) - Space-separated keywords that must appear
limit (number, default: 10)

Utility Tools

`scroll_points`

Paginate through all points in the collection

`count_points`

Get the total number of points

`search_vectors`

Advanced: Search using pre-computed embedding vectors

`get_collection_info`

View collection statistics and configuration

Resources

`qdrant://collection`

Provides information about the connected Qdrant collection

Prompts

`vector_search_assistant`

Get guidance on using the vector search capabilities

Architecture

┌─────────────────────┐
│   MCP Client        │
│  (Claude, etc.)     │
└──────────┬──────────┘
           │ HTTP/MCP Protocol
           │
┌──────────▼──────────┐
│  Qdrant MCP Server  │
│  ┌───────────────┐  │
│  │  HTTP Server  │  │
│  │  (Axum)       │  │
│  └───────┬───────┘  │
│          │          │
│  ┌───────▼───────┐  │
│  │ MCP Handler   │  │
│  │ (Tools/Prompts)│ │
│  └───────┬───────┘  │
│          │          │
│  ┌───────▼───────┐  │
│  │   FastEmbed   │  │
│  │  (Local Model)│  │
│  └───────────────┘  │
└──────────┬──────────┘
           │ gRPC
           │
┌──────────▼──────────┐
│  Qdrant Database    │
│  (Vector Storage)   │
└─────────────────────┘

Key Components:

Axum HTTP Server: Serves the MCP endpoint
MCP Handler: Implements MCP protocol (tools, resources, prompts)
FastEmbed: Local text embedding (no external API calls)
Qdrant Client: Communicates with Qdrant vector database
Thread-Safe Design: Arc<Mutex<>> ensures safe concurrent access

Development

Project Structure

src/
├── config/
│   └── mod.rs          # Configuration management
├── mcp_server/
│   ├── handler.rs      # MCP protocol handlers
│   ├── server.rs       # Core server logic and tools
│   ├── types.rs        # Request/response types
│   └── mod.rs
└── main.rs             # Entry point

Adding New Tools

Define the argument struct in types.rs:

#[derive(Debug, Deserialize, JsonSchema)]
pub struct MyToolArgs {
    pub param: String,
}

Add the tool method in server.rs inside the #[tool_router] impl block:

#[tool(description = "My new tool")]
pub async fn my_tool(
    &self,
    Parameters(args): Parameters<MyToolArgs>,
) -> Result<CallToolResult, McpError> {
    // Implementation
    Ok(CallToolResult::success(vec![Content::text("result")]))
}

Running Tests

cargo test

Logging

Control log levels via RUST_LOG:

# In .env file
RUST_LOG=info,mcp_qdrant=debug

# Or via environment variable
RUST_LOG=debug cargo run

Troubleshooting

Server won't start

"Failed to bind to address"

Port 8766 might be in use
Change PORT in .env file
Check with: lsof -i :8766 (macOS/Linux) or netstat -ano | findstr :8766 (Windows)

"Failed to connect to Qdrant"

Ensure Qdrant is running: docker ps
Check QDRANT_URL in .env
Verify network connectivity

Model download fails

"Failed to download embedding model"

Check internet connection (first run only)
Models are cached in ~/.cache/fastembed
Try a different model in .env

Search returns no results

Empty results from search

Verify collection exists: Use get_collection_info tool
Check collection name in .env matches your data
Ensure points have been indexed in Qdrant
Try increasing limit parameter

Claude can't connect

Remote connection fails

Verify HOST=0.0.0.0 for network access
Check firewall rules allow port 8766
Confirm server is accessible: curl http://your-ip:8766/mcp
Review server logs for errors

Performance issues

Slow embedding generation

Embeddings are generated sequentially (thread-safe)
Consider using a smaller model (bge-small)
For high concurrency, implement model pooling
Check Qdrant query performance separately

Security Considerations

When exposing the server remotely:

Use HTTPS: Place behind a reverse proxy (nginx, Caddy) with TLS
Add Authentication: Implement API keys or OAuth
Firewall Rules: Restrict access to known IP addresses
Rate Limiting: Prevent abuse of embedding/search endpoints
Monitor Logs: Watch for suspicious activity

Example nginx configuration:

server {
    listen 443 ssl;
    server_name your-domain.com;
    
    ssl_certificate /path/to/cert.pem;
    ssl_certificate_key /path/to/key.pem;
    
    location /mcp {
        proxy_pass http://127.0.0.1:8766;
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
        proxy_set_header Connection 'upgrade';
    }
}

Performance Tips

Choose the right model: Smaller models are faster but less accurate
Optimize limits: Don't fetch more results than needed
Use filters: Narrow searches with metadata filters
Batch operations: Group related queries when possible
Monitor Qdrant: Ensure proper indexing and resource allocation

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Add tests for new functionality
Ensure cargo fmt and cargo clippy pass
Submit a pull request

License

MIT

Acknowledgments

Built with rmcp - Rust MCP SDK
Powered by FastEmbed - Fast, local embeddings
Uses Qdrant - High-performance vector database
Implements the Model Context Protocol

Support

Report issues: GitHub Issues
MCP Documentation: https://modelcontextprotocol.io/
Qdrant Documentation: https://qdrant.tech/documentation/

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
src		src
.env.example		.env.example
Cargo.toml		Cargo.toml
README.md		README.md
requests.md		requests.md

RGGH/mcp-qdrant

Folders and files

Latest commit

History

Repository files navigation

MCP Qdrant

TL;DR

The Flow

Testing

Qdrant MCP Server

Features

Table of Contents

Installation

Prerequisites

Build from Source

Quick Start

1. Start Qdrant

2. Configure the Server

3. Start the Server

Configuration

Embedding Models

Network Configuration

Connecting to Claude

Option 1: Claude Desktop (Local)

Option 2: Claude.ai (Remote via Custom Connector)

Available Tools

Core Search Tools

search_text

filter_search

keyword_search

Utility Tools

scroll_points

count_points

search_vectors

get_collection_info

Resources

qdrant://collection

Prompts

vector_search_assistant

Architecture

Development

Project Structure

Adding New Tools

Running Tests

Logging

Troubleshooting

Server won't start

Model download fails

Search returns no results

Claude can't connect

Performance issues

Security Considerations

Performance Tips

Contributing

License

Acknowledgments

Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 27

Packages 0

Languages

`search_text`

`filter_search`

`keyword_search`

`scroll_points`

`count_points`

`search_vectors`

`get_collection_info`

`qdrant://collection`

`vector_search_assistant`

Packages