Search before asking
Description
Following up on discussions in #418 and #169, and recent conversations regarding the roadmap for Keypoint support.
I understand that full pre-training for a pose model is computationally heavy and currently a lower priority than OBB (#56). However, I would like to contribute by implementing the architectural components as of right now, so the model is "pose-ready" when the team decides to allocate compute resources for training.
Proposed Implementation Plan:
- Model Architecture: Extend the Transformer Decoder to support a Keypoint Head (likely an MLP predicting
K points per query).
- Matcher: Update the Hungarian Matcher to include Keypoint cost (using OKS - Object Keypoint Similarity).
- Loss Function: Implement the OKS Loss calculation for the training loop.
- Data Pipeline: Ensure the dataloader structure can accept COCO-Keypoint format annotations.
Use case
The Keypoint support is on developers' roadmap
Additional
Questions for the Team
- Do you prefer I mirror the implementation style of
RT-DETR / YOLOv8-Pose for the head?
- Should this live in a specific feature branch (e.g.,
feat/keypoints)?
I am happy to open a PR with the structural changes + unit tests verifying the forward pass/loss calculation works on dummy data.
Are you willing to submit a PR?
Search before asking
Description
Following up on discussions in #418 and #169, and recent conversations regarding the roadmap for Keypoint support.
I understand that full pre-training for a pose model is computationally heavy and currently a lower priority than OBB (#56). However, I would like to contribute by implementing the architectural components as of right now, so the model is "pose-ready" when the team decides to allocate compute resources for training.
Proposed Implementation Plan:
Kpoints per query).Use case
The Keypoint support is on developers' roadmap
Additional
Questions for the Team
RT-DETR/YOLOv8-Posefor the head?feat/keypoints)?I am happy to open a PR with the structural changes + unit tests verifying the forward pass/loss calculation works on dummy data.
Are you willing to submit a PR?