Skip to content

docs: add LoRA-per-task design document#54

Merged
abrichr merged 1 commit intomainfrom
docs/lora-per-task-design
Mar 17, 2026
Merged

docs: add LoRA-per-task design document#54
abrichr merged 1 commit intomainfrom
docs/lora-per-task-design

Conversation

@abrichr
Copy link
Member

@abrichr abrichr commented Mar 16, 2026

Summary

  • Literature-backed design for task-specific LoRA adapters with runtime routing
  • Covers architecture (registry, router, PolicyAgent with dynamic LoRA), training pipeline, tiered data collection strategy
  • Identifies correction flywheel (PR openadapt-evals#116) as natural training data source: corrections → SFT data → LoRA
  • Compares LoRA-per-task vs demo-conditioning with hybrid strategy
  • 30+ paper references (ShowUI-Aloha, SeeClick, LoRAHub, S-LoRA, etc.)
  • Positioned as one experiment track within the broader experimentation framework

Test plan

  • Review doc for accuracy and completeness
  • Validate against current codebase (existing infra referenced)

🤖 Generated with Claude Code

Literature-backed design for task-specific LoRA adapters with runtime
routing. Covers architecture, training pipeline, data collection
(including correction flywheel as training data source), update
economics, and validation plan. Positioned as one experiment track
within the broader OpenAdapt experimentation framework.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@abrichr abrichr merged commit d6d63b6 into main Mar 17, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant