🤖 MCP Server Usage Guide¶
Connect any AI agent to Spector's search engine in minutes.
This guide covers practical setup for Claude Desktop, Cursor IDE, and custom MCP clients.
Quick Start (3 Steps)¶
1. Build the Distribution JAR¶
The fat JAR is produced at spector-dist/target/spector.jar.
2. Configure Your AI Agent¶
Add the following to your agent's MCP configuration (see per-agent sections below):
{
"mcpServers": {
"spector": {
"command": "java",
"args": [
"--add-modules", "jdk.incubator.vector",
"--enable-native-access=ALL-UNNAMED",
"--enable-preview",
"-jar", "/path/to/spector-dist/target/spector.jar",
"--config", "/path/to/spector.yml"
]
}
}
}
3. Start Using¶
Your AI agent now has access to up to 13 tools. With cognitive memory enabled (spector.memory.enabled: true), all 13 tools are registered. Otherwise, the 6 search tools are available:
- "Search for documents about SIMD acceleration" →
semantic_search - "Find articles mentioning 'Panama' and related to memory management" →
hybrid_search - "What does the codebase say about quantization?" →
rag_query - "Add this document to the index: ..." →
ingest_document - "Remember that the user prefers dark mode" →
core_memory_append - "What do you remember about the user's preferences?" →
recall_context
CLI Options¶
| Flag | Default | Description |
|---|---|---|
--config <FILE> |
(none) | Explicit config file (YAML or .properties) |
--profile <NAME> |
(none) | Configuration profile (loads spector-{profile}.yml) |
--dims <N> |
384 | Vector dimensionality (must match your embedding model) |
--capacity <N> |
100,000 | Maximum document capacity |
--data-dir <DIR> |
(none) | Persistence directory (auto-enables DISK mode) |
--ollama-url <URL> |
(none) | Ollama embedding server URL (e.g., http://localhost:11434) |
--ollama-model <NAME> |
(none) | Ollama embedding model name (e.g., nomic-embed-text) |
--help, -h |
— | Show help message |
Tip
Recommended approach: Use a spector.yml config file rather than CLI flags. CLI flags override values from the config file.
Configuration File¶
All settings can be specified in a spector.yml file:
spector:
engine:
dimensions: 768
capacity: 100000
persistence-mode: DISK
data-directory: .spector/index
embedding:
model: nomic-embed-text
base-url: http://localhost:11434
memory:
enabled: true # Enable cognitive memory tools
persistence-path: .spector/memory
See the Configuration Guide for the complete list of settings.
Choosing Dimensions¶
The --dims flag must match your embedding model's output dimensionality:
| Model | Dimensions | Flag |
|---|---|---|
nomic-embed-text |
768 | --dims 768 |
all-minilm |
384 | --dims 384 |
mxbai-embed-large |
1024 | --dims 1024 |
qwen3-embedding |
4096 | --dims 4096 |
Agent Configuration¶
Claude Desktop¶
Edit your claude_desktop_config.json:
Configuration:
{
"mcpServers": {
"spector": {
"command": "java",
"args": [
"--add-modules", "jdk.incubator.vector",
"--enable-native-access=ALL-UNNAMED",
"--enable-preview",
"-jar", "/absolute/path/to/spector.jar",
"--config", "/absolute/path/to/spector.yml"
]
}
}
}
Tip
Use absolute paths for the JAR file. Relative paths may not resolve correctly from Claude Desktop's working directory.
Cursor IDE¶
Add to your Cursor MCP settings (.cursor/mcp.json in your project, or global settings):
{
"mcpServers": {
"spector": {
"command": "java",
"args": [
"--add-modules", "jdk.incubator.vector",
"--enable-native-access=ALL-UNNAMED",
"--enable-preview",
"-jar", "/absolute/path/to/spector.jar",
"--config", "/absolute/path/to/spector.yml"
]
}
}
}
Custom MCP Clients¶
Any application implementing the MCP client specification can connect to Spector. The server communicates via JSON-RPC 2.0 over stdio (stdin/stdout).
Key requirements:
- Spawn the Java process with the correct JVM flags
- Write JSON-RPC messages to the process's stdin
- Read JSON-RPC responses from the process's stdout
- All logging goes to stderr (stdout is reserved for protocol messages)
Example initialization sequence:
// Client → Server
{"jsonrpc": "2.0", "id": 1, "method": "initialize", "params": {"protocolVersion": "2025-03-26", "capabilities": {}, "clientInfo": {"name": "my-app", "version": "1.0"}}}
// Server → Client
{"jsonrpc": "2.0", "id": 1, "result": {"protocolVersion": "2025-03-26", "capabilities": {"tools": {}}, "serverInfo": {"name": "spector-mcp", "version": "0.1.0"}}}
// Client → Server
{"jsonrpc": "2.0", "method": "notifications/initialized"}
MCP Tools Overview¶
Once connected, your agent has access to these tools:
Search Tools (always available)¶
| Tool | Description | Requires Embedding |
|---|---|---|
semantic_search |
Vector similarity search | ✅ |
hybrid_search |
Keyword + vector with RRF fusion | Partial (keyword mode works without) |
rag_query |
Retrieval-Augmented Generation context | ✅ |
ingest_document |
Add documents to the index | ✅ (for auto-embedding) |
delete_document |
Remove documents by ID | ❌ |
engine_status |
Engine capabilities and stats | ❌ |
Cognitive Memory Tools (enabled via spector.memory.enabled: true)¶
| Tool | Description |
|---|---|
core_memory_append |
Store a semantic memory with tags and source |
recall_context |
Cognitive recall with fused scoring across tiers |
memory_status |
Memory tier counts and persistence info |
memory_reinforce |
Report positive/negative outcome for a memory |
memory_forget |
Tombstone a memory by ID |
memory_introspect |
Metamemory self-analysis on a topic |
working_memory_scratchpad |
Quick-write to working memory |
Note
For full tool schemas and parameter details, see the MCP Integration Architecture page.
Troubleshooting¶
Agent can't find or start the server¶
- Check the JAR path — Use absolute paths, not relative
- Check Java version — Spector requires JDK 25+. Run
java -versionto verify - Check JVM flags —
--add-modules jdk.incubator.vectoris required
"Embedding provider not configured" errors¶
The semantic_search and rag_query tools require an embedding provider. Ensure:
- Ollama is running:
ollama serve - The model is pulled:
ollama pull nomic-embed-text - Both
--ollama-urland--ollama-modelare specified in the args
Stdout corruption / garbled output¶
Spector redirects all logging to stderr. If you see garbled output:
- Check that nothing else is writing to stdout
- Verify the logback configuration routes to stderr
- Check for print statements in any custom code
Performance issues¶
- High latency on first query — The HNSW index is built lazily. First query triggers graph construction. Subsequent queries are fast.
- Memory usage — Vectors are stored off-heap. Monitor with
-XX:NativeMemoryTracking=summaryandjcmd <pid> VM.native_memory summary
Adding a New Tool¶
To extend the MCP server with a custom tool:
- Create a new class extending
McpToolHandler:
public final class MyCustomTool extends McpToolHandler {
@Override public String name() { return "my_custom_tool"; }
@Override public String description() { return "Does something useful."; }
@Override public Map<String, Object> inputSchema() {
return ToolSchemaBuilder.object()
.requiredString("input", "The input parameter.")
.build();
}
@Override public CallToolResult execute(SpectorEngine engine, Map<String, Object> args) {
String input = requireString(args, "input");
// Your logic here
return textResult("Result: " + input);
}
}
- Register it in
SpectorToolRegistry.handlers():
That's it — the tool is automatically available to all connected agents.
See Also¶
- MCP Integration Architecture — Module structure, data flow, and performance analysis
- Architecture Overview — Full system architecture
- REST API Reference — Alternative HTTP interface