Governing LLM-Assisted Work — Part I of III

A Control Framework for LLM-Assisted Research

Governing Research Definition Under Weak Supervision

The central problem is not that LLM-assisted research produces errors. It is that certain errors harden into structure while appearing to improve the work.

Most writing about LLM-assisted research focuses on output quality: better prompts, better retrieval, better model choice, better fact checking. Those matter, but they do not solve the deeper governance problem. A solo researcher working iteratively with an LLM can produce cleaner documents, tighter scope, and more coherent prose while still drifting away from the actual project they meant to define.

The danger is recursive. A false or weak claim gets accepted as a working premise. That premise shapes later framing. Later framing becomes a design constraint. Eventually the design constraint is treated as if it had always belonged to the project. By the time anyone notices the problem, the artifact set itself may encode the error as settled structure.

A governance system for LLM-assisted research must judge movement across cycles, not just apparent quality inside any single draft.

The governance problem

Traditional research environments impose external pressure on project definition. Advisors, peer reviewers, collaborators, and disciplinary norms all create friction between what feels coherent and what is actually warranted. A solo researcher using an LLM for extended knowledge work has much less of that friction. The tool can critique, revise, and restate — but it does so inside the same probabilistic channel that produced the original reasoning.

That means the usual revision loop becomes a closed loop. The same system helps draft the project, diagnose the project, repair the project, and evaluate whether the repair worked. Even when prompted to be adversarial, it is still reasoning from within a context shaped by the same premises, omissions, and framing decisions that created the draft in the first place.

This is why ordinary “improvement” prompts are insufficient. They can make documents better in surface terms without creating any mechanism that distinguishes real convergence from circular refinement.

A control-theory framing

The cleanest way to understand the problem is as a control problem. A functioning control system has a reference signal, a current state, a way to measure error, a bounded corrective mechanism, and a way to decide whether the system is converging or merely oscillating.

Reference signal

The frozen statement of what the project is fundamentally about and what would count as drift.

Governed object

The current proposal / pre-registration bundle that is being audited and revised.

Sensors

An adversarial audit layer that surfaces defect structure rather than merely polishing prose.

Controller

A bounded patch layer that revises the bundle in response to tracked defects rather than freeform “improvement.”

What matters here is not the analogy for its own sake. It is that the analogy makes the missing pieces visible. Most LLM-assisted workflows have drafting and revision, but not a stable reference signal, persistent defect memory, or a convergence judgment that operates above individual revisions.

Why standard revision loops fail

Drift

The project quietly becomes a different project. Each revision is locally defensible, but the cumulative trajectory leads away from the original object.

Polish mistaken for progress

Documents become more elegant, more formal, and more structured without becoming more scientifically or methodologically sound.

Endless revision

The bundle keeps changing and critiques keep appearing, but there is no way to judge whether blocking defects are actually being exhausted.

These are not separate problems. They reinforce one another. A drifted project can still look polished. A polished project can still remain stuck in revision. And a team or individual without cross-cycle memory can mistake that activity for genuine progress.

What the framework requires

A workable governance framework needs more than a single audit prompt. It needs a small stack of cooperating constraints.

Identity preservation

A frozen identity anchor that records what must remain invariant across revision and what kinds of narrowing are legitimate.

Authority control

An explicit distinction between current governing artifacts and historical calibration or lineage artifacts.

Adversarial audit

A staged diagnostic layer that evaluates the bundle as a system: coherence, feasibility, identity preservation, and inferential discipline.

Persistent defect memory

A defect register that tracks defects as stable objects across cycles rather than letting each pass begin from scratch.

Bounded revision

A patch layer in which changes map to named defects instead of being justified as generic improvement.

Re-audit and convergence

A post-revision check and a trajectory layer capable of saying whether the system is progressing, circling, drifting, or ready to stop.

Taken together, these requirements make revision accountable. They force every cycle to answer not just “what changed?” but also “what problem was that change supposed to solve?” and “how do we know the system is closer to a valid terminal state than before?”

Five valid decision states

A governance framework without stopping conditions is not a governance framework. It is just a refinement engine. The convergence layer should be able to output only a small number of valid decision states.

Continue

Blocking defects remain and a plausible revision path still exists.

Commit

Major defects are exhausted and the bundle is ready to bind.

Narrow

A reduced scope still preserves the project’s identity anchor.

Pause

Key design commitments remain too unresolved for further revision to mean much.

Kill

The current form cannot become commitment-ready without unacceptable drift.

The presence of a “kill” state matters. LLM-assisted workflows are very good at producing another pass, another framing, another cleaner version. A real governance system must be able to conclude that continued revision would produce a better-looking artifact rather than a better-founded one.

What this framework does not yet solve

Important limitation

This framework is a governance design, not a completed validated instrument. The audit logic is stronger than the convergence logic. The architecture is coherent, but parts of the stack — especially persistent defect memory and full cross-cycle convergence judgment — remain under active development rather than fully stabilized.

That limitation is not incidental. It is part of the intellectual honesty the framework itself requires. A governance architecture that cannot distinguish built components from proposed ones is already violating its own standards of authority control and state clarity.

Why this matters

The audience for this framework is broader than academic research in the narrow sense. Any long-lived knowledge project using LLMs across multiple cycles of revision — policy design, technical methodology, documentation strategy, product reasoning, or independent structured inquiry — encounters some version of the same problem. The issue is not merely hallucination. It is governance under weak supervision.

The practical value of the framework is that it relocates the problem. Instead of asking only how to get better answers from the model, it asks how to structure the work so that project identity, defect memory, revision discipline, and stopping conditions exist outside the immediate fluency of the model’s outputs.

↷ The control framework is the systems paper for this series. For the most concrete reusable artifact, see Part II: The Frozen Anchor. For the workflow architecture that operationalizes these constraints, see Part III: Designing Multi-Role AI Collaboration Workflows. For the component map of the same system, see the companion architecture note.