Autonomous Development – What Actually Works Today
Status: This page describes what works right now in CORE, and clearly separates shipped behaviour from planned extensions.
CORE can already take a natural‑language goal, turn it into a governed "crate" of work, and drive it through:
- Autonomous code generation
- Validation (formatting, linting, tests)
- Constitutional audits
All of this is performed under the Mind–Body–Will architecture and enforced by the .intent/ constitution.
1. Quick demo (what actually runs)
From your project root:
# Start your environment
poetry install
# Example: ask CORE to implement a small feature
poetry run core-admin develop feature "Add health endpoint"
In the background, CORE will:
- Create a crate describing your request (intent, context, constraints).
- Use the Autonomous Developer pipeline to generate code, tests, and any supporting artifacts.
- Run the validation pipeline (formatters, linters, tests) over the crate outputs.
- Run constitutional audits (Mind) before any change is considered acceptable.
The result is a governed feature crate that is either:
- ✅ Accepted – passes validation and audits, ready to be integrated, or
- ❌ Rejected – captured with findings so you can inspect what failed.
The exact CLI flags and options are documented in the CLI Reference and CLI Workflows.
2. End‑to‑end pipeline
This is the high‑level autonomous development pipeline as implemented in CORE today.
| Stage | Description | Primary components | Status |
|---|---|---|---|
| 1. Intent capture | You describe the goal in natural language via CLI. | core-admin develop (Typer app under src/body/cli), will/cli_logic/run.py / develop.py wiring |
Shipped |
| 2. Crate creation | The request is turned into a structured crate: metadata, constraints, files to touch, and context. | src/body/services/crate_creation_service.py |
Shipped |
| 3. Context building | Relevant code, docs, and manifest metadata are collected into an LLM‑ready context. | src/services/context/builder.py, src/services/context/providers/* |
Shipped |
| 4. Autonomous coding | Agents generate or modify code, tests, and docs inside the crate. | src/features/autonomy/autonomous_developer.py, src/will/agents/coder_agent.py, src/will/orchestration/prompt_pipeline.py |
Shipped |
| 5. Local validation | Code is formatted, linted, and tests are executed against the crate outputs. | src/services/validation/* (Black, Ruff, syntax, pytest runner) |
Shipped |
| 6. Constitutional audits | Mind runs policy and safety checks over the proposed changes. | src/mind/governance/*, especially audit_context.py, auditor.py, checks/* |
Shipped |
| 7. Decision & feedback | Crate is marked as accepted or rejected; findings are recorded for inspection. | src/body/services/crate_processing_service.py, governance services |
Shipped (manual integration) |
The key point: every autonomous change is validated and audited before you ever merge it.
3. What is shipped vs planned
Some parts of the original design are still aspirational. This section makes that explicit.
3.1. Shipped today
These pieces exist in src/ and are used by the CLI:
-
Autonomous Developer core
-
src/features/autonomy/autonomous_developer.py src/will/agents/coder_agent.pysrc/will/orchestration/cognitive_service.pysrc/will/orchestration/prompt_pipeline.py-
src/will/orchestration/validation_pipeline.py -
Crate lifecycle (without long‑running daemon)
-
src/body/services/crate_creation_service.py -
src/body/services/crate_processing_service.py -
Self‑healing & coverage tooling
-
src/features/self_healing/*– coverage analyzers, test generators, remediation services. -
src/mind/governance/checks/*– coverage, style, ID hygiene, knowledge, security checks. -
Validation tools
-
src/services/validation/black_formatter.py src/services/validation/ruff_linter.pysrc/services/validation/test_runner.py- plus syntax and YAML validators.
3.2. Planned / not yet implemented
These are described in the vision and planning docs, but do not yet exist as working commands or services in src/:
-
Crate daemon
-
Background worker that continuously picks up crates and processes them.
-
CLI group like
core-admin daemon ...and corresponding systemd unit. -
Rich crate management CLI
-
Commands such as
core-admin crate status,crate list,crate show,crate retry. -
A structured namespace for inspecting, filtering, and replaying crates.
-
Debt scanner & auto‑refactorer
-
Commands such as
core-admin develop scanfor automated technical debt surveys. -
Dedicated refactorer services beyond the current self‑healing and coverage tooling.
-
Peer review CLI group
-
Commands like
core-admin review export/review constitutionto assist human reviewers.
Whenever you see these in the docs, treat them as Phase 2+ capabilities rather than currently‑shipped behaviour.
4. How this maps to Mind–Body–Will
Autonomous development is not a separate subsystem; it is the coordinated use of Mind, Body, and Will.
-
Mind (
src/mind/) -
Defines and enforces the rules: policies, checks, auditors, constitutional gates.
-
Validates that any autonomous change is safe, compliant, and within scope.
-
Body (
src/body/,src/features/,src/services/) -
Executes the work: crate services, validation tools, persistence, context builders.
-
Exposes CLI commands via
core-admin(Typer apps insrc/body/cli). -
Will (
src/will/) -
Hosts the agents and orchestration glue that actually decide what to generate.
- Uses LLMs (via
src/services/llm/*) under the constraints provided by Mind.
When you run core-admin develop feature ..., you are effectively triggering a Mind‑governed Will acting through Body.
5. Current limitations
To keep expectations realistic, here are the most important current limitations:
-
No always‑on daemon yet Crates are processed when you invoke CLI commands; there is no long‑running daemon that automatically drains crate queues.
-
Manual integration step CORE can prepare governed crates; you still decide when and how to merge them into your main branch or deployment pipeline.
-
Limited refactor automation Self‑healing and refactoring capabilities exist, but the full "debt scanner" vision is not yet implemented as a single
develop scancommand. -
Some CLI docs describe future commands If a command referenced in the docs is missing from
core-admin --help, consider it planned rather than a bug.
6. How to think about autonomous development in CORE
CORE is deliberately conservative:
- Autonomy is always governed – Mind (policies, audits, constitution) can veto any generated change.
- Crates are the unit of work – everything autonomous happens inside a crate with clear boundaries.
- Self‑healing is first‑class – coverage, style, IDs, and knowledge alignment are treated as autonomous tasks, not chores.
When you extend CORE (new features, new agents, new policies), treat this page as the contract:
- If something is described as shipped, keep the implementation in sync.
- If you add a new autonomous capability, add it to the pipeline and explicitly mark its status here.
That way, this document remains a truthful, high‑signal description of what CORE can actually do today.