Spaces:

DataQuests
/

DeepCritical

Running

App Files Files Community

DeepCritical / docs /CONFIGURATION.md

Joseph Pollack

Initial commit - Independent repository - Breaking fork relationship

016b413 13 days ago

preview code

raw

history blame

7.15 kB

Configuration Guide

Overview

DeepCritical uses Pydantic Settings for centralized configuration management. All settings are defined in src/utils/config.py and can be configured via environment variables or a .env file.

Quick Start

Copy the example environment file (if available) or create a .env file in the project root
Set at least one LLM API key (OPENAI_API_KEY or ANTHROPIC_API_KEY)
Optionally configure other services as needed

Configuration System

How It Works

Settings Class: Settings class in src/utils/config.py extends BaseSettings from pydantic_settings
Environment File: Automatically loads from .env file (if present)
Environment Variables: Reads from environment variables (case-insensitive)
Type Safety: Strongly-typed fields with validation
Singleton Pattern: Global settings instance for easy access

Usage

from src.utils.config import settings

# Check if API keys are available
if settings.has_openai_key:
    # Use OpenAI
    pass

# Access configuration values
max_iterations = settings.max_iterations
web_search_provider = settings.web_search_provider

Required Configuration

At Least One LLM Provider

You must configure at least one LLM provider:

OpenAI:

LLM_PROVIDER=openai
OPENAI_API_KEY=your_openai_api_key_here
OPENAI_MODEL=gpt-5.1

Anthropic:

LLM_PROVIDER=anthropic
ANTHROPIC_API_KEY=your_anthropic_api_key_here
ANTHROPIC_MODEL=claude-sonnet-4-5-20250929

Optional Configuration

Embedding Configuration

# Embedding Provider: "openai", "local", or "huggingface"
EMBEDDING_PROVIDER=local

# OpenAI Embedding Model (used by LlamaIndex RAG)
OPENAI_EMBEDDING_MODEL=text-embedding-3-small

# Local Embedding Model (sentence-transformers)
LOCAL_EMBEDDING_MODEL=all-MiniLM-L6-v2

# HuggingFace Embedding Model
HUGGINGFACE_EMBEDDING_MODEL=sentence-transformers/all-MiniLM-L6-v2

HuggingFace Configuration

# HuggingFace API Token (for inference API)
HUGGINGFACE_API_KEY=your_huggingface_api_key_here
# Or use HF_TOKEN (alternative name)

# Default HuggingFace Model ID
HUGGINGFACE_MODEL=meta-llama/Llama-3.1-8B-Instruct

Web Search Configuration

# Web Search Provider: "serper", "searchxng", "brave", "tavily", or "duckduckgo"
# Default: "duckduckgo" (no API key required)
WEB_SEARCH_PROVIDER=duckduckgo

# Serper API Key (for Google search via Serper)
SERPER_API_KEY=your_serper_api_key_here

# SearchXNG Host URL
SEARCHXNG_HOST=http://localhost:8080

# Brave Search API Key
BRAVE_API_KEY=your_brave_api_key_here

# Tavily API Key
TAVILY_API_KEY=your_tavily_api_key_here

PubMed Configuration

# NCBI API Key (optional, for higher rate limits: 10 req/sec vs 3 req/sec)
NCBI_API_KEY=your_ncbi_api_key_here

Agent Configuration

# Maximum iterations per research loop
MAX_ITERATIONS=10

# Search timeout in seconds
SEARCH_TIMEOUT=30

# Use graph-based execution for research flows
USE_GRAPH_EXECUTION=false

Budget & Rate Limiting Configuration

# Default token budget per research loop
DEFAULT_TOKEN_LIMIT=100000

# Default time limit per research loop (minutes)
DEFAULT_TIME_LIMIT_MINUTES=10

# Default iterations limit per research loop
DEFAULT_ITERATIONS_LIMIT=10

RAG Service Configuration

# ChromaDB collection name for RAG
RAG_COLLECTION_NAME=deepcritical_evidence

# Number of top results to retrieve from RAG
RAG_SIMILARITY_TOP_K=5

# Automatically ingest evidence into RAG
RAG_AUTO_INGEST=true

ChromaDB Configuration

# ChromaDB storage path
CHROMA_DB_PATH=./chroma_db

# Whether to persist ChromaDB to disk
CHROMA_DB_PERSIST=true

# ChromaDB server host (for remote ChromaDB, optional)
# CHROMA_DB_HOST=localhost

# ChromaDB server port (for remote ChromaDB, optional)
# CHROMA_DB_PORT=8000

External Services

# Modal Token ID (for Modal sandbox execution)
MODAL_TOKEN_ID=your_modal_token_id_here

# Modal Token Secret
MODAL_TOKEN_SECRET=your_modal_token_secret_here

Logging Configuration

# Log Level: "DEBUG", "INFO", "WARNING", or "ERROR"
LOG_LEVEL=INFO

Configuration Properties

The Settings class provides helpful properties for checking configuration:

from src.utils.config import settings

# Check API key availability
settings.has_openai_key          # bool
settings.has_anthropic_key       # bool
settings.has_huggingface_key     # bool
settings.has_any_llm_key         # bool

# Check service availability
settings.modal_available         # bool
settings.web_search_available    # bool

Environment Variables Reference

Required (at least one LLM)

OPENAI_API_KEY or ANTHROPIC_API_KEY - At least one LLM provider key

Optional LLM Providers

DEEPSEEK_API_KEY (Phase 2)
OPENROUTER_API_KEY (Phase 2)
GEMINI_API_KEY (Phase 2)
PERPLEXITY_API_KEY (Phase 2)
HUGGINGFACE_API_KEY or HF_TOKEN
AZURE_OPENAI_ENDPOINT (Phase 2)
AZURE_OPENAI_DEPLOYMENT (Phase 2)
AZURE_OPENAI_API_KEY (Phase 2)
AZURE_OPENAI_API_VERSION (Phase 2)
LOCAL_MODEL_URL (Phase 2)

Web Search

WEB_SEARCH_PROVIDER (default: "duckduckgo")
SERPER_API_KEY
SEARCHXNG_HOST
BRAVE_API_KEY
TAVILY_API_KEY

Embeddings

EMBEDDING_PROVIDER (default: "local")
HUGGINGFACE_EMBEDDING_MODEL (optional)

RAG

RAG_COLLECTION_NAME (default: "deepcritical_evidence")
RAG_SIMILARITY_TOP_K (default: 5)
RAG_AUTO_INGEST (default: true)

ChromaDB

CHROMA_DB_PATH (default: "./chroma_db")
CHROMA_DB_PERSIST (default: true)
CHROMA_DB_HOST (optional)
CHROMA_DB_PORT (optional)

Budget

DEFAULT_TOKEN_LIMIT (default: 100000)
DEFAULT_TIME_LIMIT_MINUTES (default: 10)
DEFAULT_ITERATIONS_LIMIT (default: 10)

Other

LLM_PROVIDER (default: "openai")
NCBI_API_KEY (optional)
MODAL_TOKEN_ID (optional)
MODAL_TOKEN_SECRET (optional)
MAX_ITERATIONS (default: 10)
LOG_LEVEL (default: "INFO")
USE_GRAPH_EXECUTION (default: false)

Validation

Settings are validated on load using Pydantic validation:

Type checking: All fields are strongly typed
Range validation: Numeric fields have min/max constraints
Literal validation: Enum fields only accept specific values
Required fields: API keys are checked when accessed via get_api_key()

Error Handling

Configuration errors raise ConfigurationError:

from src.utils.config import settings
from src.utils.exceptions import ConfigurationError

try:
    api_key = settings.get_api_key()
except ConfigurationError as e:
    print(f"Configuration error: {e}")

Future Enhancements (Phase 2)

The following configurations are planned for Phase 2:

Additional LLM Providers: DeepSeek, OpenRouter, Gemini, Perplexity, Azure OpenAI, Local models
Model Selection: Reasoning/main/fast model configuration
Service Integration: Migrate folder/llm_config.py to centralized config

See CONFIGURATION_ANALYSIS.md for the complete implementation plan.