docs: add LoRA-per-task design document by abrichr · Pull Request #54 · OpenAdaptAI/openadapt-ml

abrichr · 2026-03-16T22:57:54Z

Summary

Literature-backed design for task-specific LoRA adapters with runtime routing
Covers architecture (registry, router, PolicyAgent with dynamic LoRA), training pipeline, tiered data collection strategy
Identifies correction flywheel (PR openadapt-evals#116) as natural training data source: corrections → SFT data → LoRA
Compares LoRA-per-task vs demo-conditioning with hybrid strategy
30+ paper references (ShowUI-Aloha, SeeClick, LoRAHub, S-LoRA, etc.)
Positioned as one experiment track within the broader experimentation framework

Test plan

Review doc for accuracy and completeness
Validate against current codebase (existing infra referenced)

🤖 Generated with Claude Code

Literature-backed design for task-specific LoRA adapters with runtime routing. Covers architecture, training pipeline, data collection (including correction flywheel as training data source), update economics, and validation plan. Positioned as one experiment track within the broader OpenAdapt experimentation framework. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

abrichr merged commit d6d63b6 into main Mar 17, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add LoRA-per-task design document#54

docs: add LoRA-per-task design document#54
abrichr merged 1 commit intomainfrom
docs/lora-per-task-design

abrichr commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

abrichr commented Mar 16, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant