π Cortex Distributed Agent: Project TODO List
This document tracks prioritized tasks, technical debt, and future implementations for the Cortex project.
π High Priority (Infrastructure)
[ ] Persistent Sub-Worker Bridges (CDP/LSP) - π’ CURRENT FOCUS
- Description: Implement long-running "Skill Bridges" for Browser automation (CDP) and Code Intelligence (LSP).
- Goal: Support the Antigravity Browser Skill by maintaining a persistent browser session rather than spawning a new process per task.
[ ] Multi-Tenancy & Resource Isolation
- Description: Isolate node groups by user/tenant and enforce hardware quotas.
- Why: Allows the Main AI (Antigravity) to manage resource usage and forcefully cancel zombie tasks that may be hanging or orphaned, ensuring node health.
[ ] Binary Artifact & Large Data Handling (Chunking)
- Description: Implement gRPC stream-based chunking for large artifacts.
- Requirement: Transparency. The Main AI should just see a "File" result; reassembly happens at the server layer.
[ ] Graceful Shutdown & Local Task Persistence (Built-in)
- Description: Handle node interrupts (SIGTERM/SIGINT) to allow workers to finish or checkpoint. Store a local
task_history.json on the node to recover state after crash/restart.
[ ] Server-Side Registry & Task Persistence
- Description: Migrate
NodeRegistry and WorkPool from in-memory to a persistent backend (Postgres/Redis).
- Priority: Deferred until Full System Integration phase.
[ ] Workspace Mirroring & Efficient File Sync
- Description: Maintain a local server-side mirror of node workspaces for Zero-latency AI perception.
[ ] Real-time gRPC Log Streaming
- Description: Bidirectional stream for live
stdout/stderr.
π’ Low Priority / Observation
[ ] OS-Level Isolation (Firecracker/VNC)
- Description: Lightweight virtualization (microVMs) for worker execution.
- Status: Monitored.
[ ] Node Lifecycle: Auto-Updates
- Description: Mechanism for nodes to self-update.
[ ] Vertical & Horizontal Scalability
- Description: Migrate to a stateless server design with load balancing.
πΊοΈ Future Roadmap (Strategic)
[ ] Advanced Scheduling & Capability Routing
- Description: Sophisticated scheduler to match complex constraints (GPU, Region, Priority).
[ ] mTLS Certificate Lifecycle Management
- Description: Automated renewal, revocation, and rotation of node certificates.
[ ] Immutable Audit & Compliance
- Description: Cryptographically signed records of every TaskRequest and TaskResponse for forensics.