Memory Settings - Configuration Viewer
Display the current configuration of the AI Memory Module, including collections, types, thresholds, token budgets, and service endpoints.
Usage
bash
# Show all memory settings /memory-settings # Show specific section /memory-settings --section collections /memory-settings --section types /memory-settings --section thresholds /memory-settings --section services /memory-settings --section agents
Configuration Sections
Collections
Shows the 3 Memory System V2.0 collections:
- •code-patterns - Project-specific implementation patterns
- •conventions - Cross-project shared conventions
- •discussions - Decision context and session summaries
Memory Types (14 total)
Organized by collection:
- •code-patterns:
implementation,error_fix,refactor,file_pattern - •conventions:
rule,guideline,port,naming,structure - •discussions:
decision,session,blocker,preference,context
Thresholds
- •similarity_threshold - Minimum relevance score for search results (default: 0.7)
- •dedup_threshold - Similarity threshold for duplicate detection (default: 0.95)
- •max_retrievals - Maximum memories per search (default: 5)
- •token_budget - Maximum tokens for context injection (default: 2000)
Services
Shows connection details for:
- •Qdrant - Vector database (default: localhost:26350)
- •Embedding Service - Jina AI embeddings (default: localhost:28080)
- •Monitoring API - Health checks and metrics (default: localhost:28000)
- •Streamlit Dashboard - Web UI (default: localhost:28501)
- •Grafana - Metrics visualization (default: localhost:23000)
- •Prometheus - Metrics storage (default: localhost:29090)
- •Pushgateway - Metrics push gateway (default: localhost:29091)
Agent Token Budgets
Shows token allocation per BMAD agent:
- •architect: 1500 tokens
- •analyst: 1200 tokens
- •pm: 1200 tokens
- •developer/dev: 1200 tokens
- •solo-dev: 1500 tokens
- •quick-flow-solo-dev: 1500 tokens
- •ux-designer: 1000 tokens
- •qa: 1000 tokens
- •tea: 1000 tokens
- •code-review/code-reviewer: 1200 tokens
- •scrum-master/sm: 800 tokens
- •tech-writer: 800 tokens
- •default: 1000 tokens
Logging
- •log_level - Logging verbosity (default: INFO)
- •log_format - Log output format (json or text, default: json)
Collection Size Limits
- •Warning threshold - 10,000 points
- •Critical threshold - 50,000 points
Examples
bash
# View complete configuration /memory-settings # Check current thresholds /memory-settings --section thresholds # View service endpoints /memory-settings --section services # Check agent token budgets /memory-settings --section agents # View all memory types /memory-settings --section types
Python Configuration Reference
Configuration is managed by src/memory/config.py:
python
from src.memory.config import get_config, AGENT_TOKEN_BUDGETS, get_agent_token_budget
# Get configuration singleton
config = get_config()
# Access settings
print(f"Qdrant: {config.qdrant_host}:{config.qdrant_port}")
print(f"Similarity threshold: {config.similarity_threshold}")
print(f"Max retrievals: {config.max_retrievals}")
# Get agent-specific token budget
budget = get_agent_token_budget("architect") # Returns 1500
Environment Variables
Configuration can be customized via environment variables or .env file:
bash
# Core thresholds SIMILARITY_THRESHOLD=0.7 # Retrieval relevance cutoff DEDUP_THRESHOLD=0.95 # Duplicate detection sensitivity MAX_RETRIEVALS=5 # Results per search TOKEN_BUDGET=2000 # Context injection limit # Service endpoints QDRANT_HOST=localhost QDRANT_PORT=26350 EMBEDDING_HOST=localhost EMBEDDING_PORT=28080 MONITORING_HOST=localhost MONITORING_PORT=28000 # Logging LOG_LEVEL=INFO # DEBUG, INFO, WARNING, ERROR, CRITICAL LOG_FORMAT=json # json or text # Collection size limits COLLECTION_SIZE_WARNING=10000 COLLECTION_SIZE_CRITICAL=50000
Configuration Precedence
Settings are loaded in this order (highest priority first):
- •Environment variables
- •
.envfile in project root - •Default values
Output Format
The skill displays configuration in organized sections with:
- •Current values
- •Default values (if different)
- •Validation ranges (for thresholds)
- •Service URLs with ports
- •Memory type mappings to collections
Technical Details
- •Type Safety: Uses pydantic-settings v2.6+ for validation
- •Immutable: Configuration is frozen (thread-safe)
- •Singleton: Single config instance per process (lru_cache)
- •Validation: All thresholds validated on load
Related Skills
- •
/search-memory- Use these settings for memory search - •
/memory-status- Check system health and statistics
Notes
- •Configuration is loaded once at startup
- •Changes to .env require service restart
- •All ports use 2XXXX prefix to avoid conflicts
- •Token budgets optimized per agent role (architects need more context than scrum masters)