luis.erlacher/Archon

Fork 0

Cole Medin 012d2c58ed Removing references to Archon "Alpha"

2025-09-06 14:51:02 -05:00

14 KiB

Raw Blame History

AGENTS.md

Beta Development Guidelines

Local-only deployment - each user runs their own instance.

Core Principles

No backwards compatibility - remove deprecated code immediately
Detailed errors over graceful failures - we want to identify and fix issues fast
Break things to improve them - beta is for rapid iteration

Error Handling

Core Principle: In beta, we need to intelligently decide when to fail hard and fast to quickly address issues, and when to allow processes to complete in critical services despite failures. Read below carefully and make intelligent decisions on a case-by-case basis.

When to Fail Fast and Loud (Let it Crash!)

These errors should stop execution and bubble up immediately: (except for crawling flows)

Service startup failures - If credentials, database, or any service can't initialize, the system should crash with a clear error
Missing configuration - Missing environment variables or invalid settings should stop the system
Database connection failures - Don't hide connection issues, expose them
Authentication/authorization failures - Security errors must be visible and halt the operation
Data corruption or validation errors - Never silently accept bad data, Pydantic should raise
Critical dependencies unavailable - If a required service is down, fail immediately
Invalid data that would corrupt state - Never store zero embeddings, null foreign keys, or malformed JSON

When to Complete but Log Detailed Errors

These operations should continue but track and report failures clearly:

Batch processing - When crawling websites or processing documents, complete what you can and report detailed failures for each item
Background tasks - Embedding generation, async jobs should finish the queue but log failures
WebSocket events - Don't crash on a single event failure, log it and continue serving other clients
Optional features - If projects/tasks are disabled, log and skip rather than crash
External API calls - Retry with exponential backoff, then fail with a clear message about what service failed and why

Critical Nuance: Never Accept Corrupted Data

When a process should continue despite failures, it must skip the failed item entirely rather than storing corrupted data:

❌ WRONG - Silent Corruption:

try:
    embedding = create_embedding(text)
except Exception as e:
    embedding = [0.0] * 1536  # NEVER DO THIS - corrupts database
    store_document(doc, embedding)

✅ CORRECT - Skip Failed Items:

try:
    embedding = create_embedding(text)
    store_document(doc, embedding)  # Only store on success
except Exception as e:
    failed_items.append({'doc': doc, 'error': str(e)})
    logger.error(f"Skipping document {doc.id}: {e}")
    # Continue with next document, don't store anything

✅ CORRECT - Batch Processing with Failure Tracking:

def process_batch(items):
    results = {'succeeded': [], 'failed': []}

    for item in items:
        try:
            result = process_item(item)
            results['succeeded'].append(result)
        except Exception as e:
            results['failed'].append({
                'item': item,
                'error': str(e),
                'traceback': traceback.format_exc()
            })
            logger.error(f"Failed to process {item.id}: {e}")

    # Always return both successes and failures
    return results

Error Message Guidelines

Include context about what was being attempted when the error occurred
Preserve full stack traces with exc_info=True in Python logging
Use specific exception types, not generic Exception catching
Include relevant IDs, URLs, or data that helps debug the issue
Never return None/null to indicate failure - raise an exception with details
For batch operations, always report both success count and detailed failure list

Code Quality

Remove dead code immediately rather than maintaining it - no backward compatibility or legacy functions
Prioritize functionality over production-ready patterns
Focus on user experience and feature completeness
When updating code, don't reference what is changing (avoid keywords like LEGACY, CHANGED, REMOVED), instead focus on comments that document just the functionality of the code

Architecture Overview

Archon V2 Beta is a microservices-based knowledge management system with MCP (Model Context Protocol) integration:

Frontend (port 3737): React + TypeScript + Vite + TailwindCSS
- UI Strategy: Radix UI primitives in /features, custom components in legacy /components
- State Management: TanStack Query for all data fetching in /features
- Styling: Tron-inspired glassmorphism with Tailwind CSS
Main Server (port 8181): FastAPI with HTTP polling for updates
MCP Server (port 8051): Lightweight HTTP-based MCP protocol server
Agents Service (port 8052): PydanticAI agents for AI/ML operations
Database: Supabase (PostgreSQL + pgvector for embeddings)

Development Commands

Frontend (archon-ui-main/)

npm run dev              # Start development server on port 3737
npm run build            # Build for production
npm run lint             # Run ESLint
npm run test             # Run Vitest tests
npm run test:coverage    # Run tests with coverage report

Biome Linter Guide for AI Assistants

Overview

This project uses Biome for linting and formatting the /src/features directory. Biome provides fast, machine-readable feedback that AI assistants can use to improve code quality.

Configuration

Biome is configured in biome.json:

Scope: Only checks /src/features/** directory
Formatting: 2 spaces, 80 char line width
Linting: Recommended rules enabled
Import Organization: Automatically sorts and groups imports

AI Assistant Workflow in the new /features directory

Check Issues: Run npm run biome:ai to get JSON output
Parse Output: Extract error locations and types
Apply Fixes:
- Run npm run biome:ai-fix for auto-fixable issues
- Manually fix remaining issues based on patterns above
Verify: Run npm run biome:ai again to confirm fixes

JSON Output Format

When using biome:ai, the output is structured JSON:

{
  "diagnostics": [
    {
      "file": "path/to/file.tsx",
      "line": 10,
      "column": 5,
      "severity": "error",
      "message": "Description of the issue",
      "rule": "lint/a11y/useButtonType"
    }
  ]
}

Backend (python/)

# Using uv package manager
uv sync                  # Install/update dependencies
uv run pytest            # Run tests
uv run python -m src.server.main  # Run server locally

# With Docker
docker-compose up --build -d       # Start all services
docker-compose logs -f             # View logs
docker-compose restart              # Restart services

Testing

# Frontend tests (from archon-ui-main/)
npm run test:coverage:stream       # Run with streaming output
npm run test:ui                    # Run with Vitest UI

# Backend tests (from python/)
uv run pytest tests/test_api_essentials.py -v
uv run pytest tests/test_service_integration.py -v

Key API Endpoints

Knowledge Base

POST /api/knowledge/crawl - Crawl a website
POST /api/knowledge/upload - Upload documents (PDF, DOCX, MD)
GET /api/knowledge/items - List knowledge items
POST /api/knowledge/search - RAG search

MCP Integration

GET /api/mcp/health - MCP server status
POST /api/mcp/tools/{tool_name} - Execute MCP tool
GET /api/mcp/tools - List available tools

Projects & Tasks (when enabled)

GET /api/projects - List all projects
POST /api/projects - Create project
GET /api/projects/{id} - Get single project
PUT /api/projects/{id} - Update project
DELETE /api/projects/{id} - Delete project
GET /api/projects/{id}/tasks - Get tasks for project (use this, not getTasks)
POST /api/tasks - Create task
PUT /api/tasks/{id} - Update task
DELETE /api/tasks/{id} - Delete task

Polling Architecture

HTTP Polling (replaced Socket.IO)

Polling intervals: 1-2s for active operations, 5-10s for background data
ETag caching: Reduces bandwidth by ~70% via 304 Not Modified responses
Smart pausing: Stops polling when browser tab is inactive
Progress endpoints: /api/progress/crawl, /api/progress/project-creation

Key Polling Hooks

usePolling - Generic polling with ETag support
useDatabaseMutation - Optimistic updates with rollback
useProjectMutation - Project-specific operations

Environment Variables

Required in .env:

SUPABASE_URL=https://your-project.supabase.co
SUPABASE_SERVICE_KEY=your-service-key-here

Optional:

OPENAI_API_KEY=your-openai-key        # Can be set via UI
LOGFIRE_TOKEN=your-logfire-token      # For observability
LOG_LEVEL=INFO                         # DEBUG, INFO, WARNING, ERROR

File Organization

Frontend Structure

src/components/ - Legacy UI components (custom-built)
src/features/ - Modern vertical slice architecture with Radix UI
- ui/primitives/ - Radix UI primitives with Tron glassmorphism
- projects/ - Project management feature
- tasks/ - Task management sub-feature
src/pages/ - Main application pages
src/services/ - API communication and business logic
src/hooks/ - Custom React hooks
src/contexts/ - React context providers

UI Libraries

Radix UI (@radix-ui/react-*) - Unstyled, accessible primitives for /features
TanStack Query - Data fetching and caching for /features
React DnD - Drag and drop for Kanban boards
Tailwind CSS - Utility-first styling with Tron-inspired glassmorphism
Framer Motion - Animations (minimal usage)

Theme Management

ThemeContext - Manages light/dark theme state
Tailwind dark mode - Uses dark: prefix with selector strategy
Automatic switching - All components respect theme via Tailwind classes
Persistent - Theme choice saved in localStorage
Tron aesthetic - Stronger neon glows in dark mode, subtle in light mode

We're migrating to a vertical slice architecture where each feature is self-contained. Features are organized by domain hierarchy - main features contain their sub-features. For example, tasks are a sub-feature of projects, so they live at features/projects/tasks/ rather than as separate siblings. Each feature level has its own components, hooks, types, and services folders. This keeps related code together and makes the codebase more maintainable as it scales.

Backend Structure

src/server/ - Main FastAPI application
src/server/api_routes/ - API route handlers
src/server/services/ - Business logic services
src/mcp/ - MCP server implementation
src/agents/ - PydanticAI agent implementations

Database Schema

Key tables in Supabase:

sources - Crawled websites and uploaded documents
documents - Processed document chunks with embeddings
projects - Project management (optional feature)
tasks - Task tracking linked to projects
code_examples - Extracted code snippets

API Naming Conventions

Task Status Values

Use database values directly (no UI mapping):

todo, doing, review, done

Service Method Patterns

get[Resource]sByProject(projectId) - Scoped queries
get[Resource](id) - Single resource
create[Resource](data) - Create operations
update[Resource](id, updates) - Updates
delete[Resource](id) - Soft deletes

State Naming

is[Action]ing - Loading states (e.g., isSwitchingProject)
[resource]Error - Error messages
selected[Resource] - Current selection

Common Development Tasks

Add a new API endpoint

Create route handler in python/src/server/api_routes/
Add service logic in python/src/server/services/
Include router in python/src/server/main.py
Update frontend service in archon-ui-main/src/services/

Add a new UI component

For features directory (preferred for new components):

Use Radix UI primitives from src/features/ui/primitives/
Create component in relevant feature folder under src/features/
Use TanStack Query for data fetching
Apply Tron-inspired glassmorphism styling with Tailwind

For legacy components:

Create component in archon-ui-main/src/components/
Add to page in archon-ui-main/src/pages/
Include any new API calls in services
Add tests in archon-ui-main/test/

Debug MCP connection issues

Check MCP health: curl http://localhost:8051/health
View MCP logs: docker-compose logs archon-mcp
Test tool execution via UI MCP page
Verify Supabase connection and credentials

Code Quality Standards

We enforce code quality through automated linting and type checking:

Python 3.12 with 120 character line length
Ruff for linting - checks for errors, warnings, unused imports, and code style
Mypy for type checking - ensures type safety across the codebase
Auto-formatting on save in IDEs to maintain consistent style
Run uv run ruff check and uv run mypy src/ locally before committing

MCP Tools Available

When connected to Cursor/Windsurf:

archon:perform_rag_query - Search knowledge base
archon:search_code_examples - Find code snippets
archon:manage_project - Project operations
archon:manage_task - Task management
archon:get_available_sources - List knowledge sources

Important Notes

Projects feature is optional - toggle in Settings UI
All services communicate via HTTP, not gRPC
HTTP polling handles all updates (Socket.IO removed)
Frontend uses Vite proxy for API calls in development
Python backend uses uv for dependency management
Docker Compose handles service orchestration
we use tanstack query NO PROP DRILLING! refacring in progress!

14 KiB Raw Blame History