Autonomous Development – What Actually Works Today

Status: This page describes what works right now in CORE, and clearly separates shipped behaviour from planned extensions.

CORE can already take a natural‑language goal, turn it into a governed "crate" of work, and drive it through:

All of this is performed under the Mind–Body–Will architecture and enforced by the .intent/ constitution.

1. Quick demo (what actually runs)

From your project root:

# Start your environment
poetry install

# Example: ask CORE to implement a small feature
poetry run core-admin develop feature "Add health endpoint"

In the background, CORE will:

Create a crate describing your request (intent, context, constraints).
Use the Autonomous Developer pipeline to generate code, tests, and any supporting artifacts.
Run the validation pipeline (formatters, linters, tests) over the crate outputs.
Run constitutional audits (Mind) before any change is considered acceptable.

The result is a governed feature crate that is either:

The exact CLI flags and options are documented in the CLI Reference and CLI Workflows.

This is the high‑level autonomous development pipeline as implemented in CORE today.

Stage	Description	Primary components	Status
1. Intent capture	You describe the goal in natural language via CLI.	`core-admin develop` (Typer app under `src/body/cli`), `will/cli_logic/run.py` / `develop.py` wiring	Shipped
2. Crate creation	The request is turned into a structured crate: metadata, constraints, files to touch, and context.	`src/body/services/crate_creation_service.py`	Shipped
3. Context building	Relevant code, docs, and manifest metadata are collected into an LLM‑ready context.	`src/services/context/builder.py`, `src/services/context/providers/*`	Shipped
4. Autonomous coding	Agents generate or modify code, tests, and docs inside the crate.	`src/features/autonomy/autonomous_developer.py`, `src/will/agents/coder_agent.py`, `src/will/orchestration/prompt_pipeline.py`	Shipped
5. Local validation	Code is formatted, linted, and tests are executed against the crate outputs.	`src/services/validation/*` (Black, Ruff, syntax, pytest runner)	Shipped
6. Constitutional audits	Mind runs policy and safety checks over the proposed changes.	`src/mind/governance/`, especially `audit_context.py`, `auditor.py`, `checks/`	Shipped
7. Decision & feedback	Crate is marked as accepted or rejected; findings are recorded for inspection.	`src/body/services/crate_processing_service.py`, governance services	Shipped (manual integration)

The key point: every autonomous change is validated and audited before you ever merge it.

Some parts of the original design are still aspirational. This section makes that explicit.

These pieces exist in src/ and are used by the CLI:

Autonomous Developer core
src/features/autonomy/autonomous_developer.py
src/will/agents/coder_agent.py
src/will/orchestration/cognitive_service.py
src/will/orchestration/prompt_pipeline.py
src/will/orchestration/validation_pipeline.py
Crate lifecycle (without long‑running daemon)
src/body/services/crate_creation_service.py
src/body/services/crate_processing_service.py
Self‑healing & coverage tooling
src/features/self_healing/* – coverage analyzers, test generators, remediation services.
src/mind/governance/checks/* – coverage, style, ID hygiene, knowledge, security checks.
Validation tools
src/services/validation/black_formatter.py
src/services/validation/ruff_linter.py
src/services/validation/test_runner.py
plus syntax and YAML validators.

These are described in the vision and planning docs, but do not yet exist as working commands or services in src/:

Crate daemon
Background worker that continuously picks up crates and processes them.
CLI group like core-admin daemon ... and corresponding systemd unit.
Rich crate management CLI
Commands such as core-admin crate status, crate list, crate show, crate retry.
A structured namespace for inspecting, filtering, and replaying crates.
Debt scanner & auto‑refactorer
Commands such as core-admin develop scan for automated technical debt surveys.
Dedicated refactorer services beyond the current self‑healing and coverage tooling.
Peer review CLI group
Commands like core-admin review export / review constitution to assist human reviewers.

Whenever you see these in the docs, treat them as Phase 2+ capabilities rather than currently‑shipped behaviour.

Autonomous development is not a separate subsystem; it is the coordinated use of Mind, Body, and Will.

Mind (src/mind/)
Defines and enforces the rules: policies, checks, auditors, constitutional gates.
Validates that any autonomous change is safe, compliant, and within scope.
Body (src/body/, src/features/, src/services/)
Executes the work: crate services, validation tools, persistence, context builders.
Exposes CLI commands via core-admin (Typer apps in src/body/cli).
Will (src/will/)
Hosts the agents and orchestration glue that actually decide what to generate.
Uses LLMs (via src/services/llm/*) under the constraints provided by Mind.

When you run core-admin develop feature ..., you are effectively triggering a Mind‑governed Will acting through Body.

To keep expectations realistic, here are the most important current limitations:

No always‑on daemon yet Crates are processed when you invoke CLI commands; there is no long‑running daemon that automatically drains crate queues.
Manual integration step CORE can prepare governed crates; you still decide when and how to merge them into your main branch or deployment pipeline.
Limited refactor automation Self‑healing and refactoring capabilities exist, but the full "debt scanner" vision is not yet implemented as a single develop scan command.
Some CLI docs describe future commands If a command referenced in the docs is missing from core-admin --help, consider it planned rather than a bug.

CORE is deliberately conservative:

Autonomy is always governed – Mind (policies, audits, constitution) can veto any generated change.
Crates are the unit of work – everything autonomous happens inside a crate with clear boundaries.
Self‑healing is first‑class – coverage, style, IDs, and knowledge alignment are treated as autonomous tasks, not chores.

When you extend CORE (new features, new agents, new policies), treat this page as the contract:

If something is described as shipped, keep the implementation in sync.
If you add a new autonomous capability, add it to the pipeline and explicitly mark its status here.

That way, this document remains a truthful, high‑signal description of what CORE can actually do today.