Task 1: Enhanced Pose Control (Priority)
Objective: Implement superior pose estimation and tracking using Meta Sapiens model.
Implementation Steps:
- Sapiens Integration:
- Deploy Sapiens model as custom ComfyUI nodes
- Ensure compatibility with DWPose / OpenPose rig format
- Replace current DWPose preprocessor
- Performance Remapping:
- Handle dimensional mismatches between reference image and base video
- Map key points between reference and target performances
- Ensure pose consistency across different aspect ratios/scales
Task 2: Trajectory Control System
Objective: Track and control significant objects/elements throughout video sequences.
Implementation Approach:
- Use SAM2 for bounding box detection and tracking
- Identify significant objects in initial frames
- Maintain consistent tracking across entire video sequence
- Python + ComfyUI prototype development
Deliverables:
- Bounding box detection system
- Format bounding box information as VACE preprocessor
- Multi-object tracking capability
- Integration with existing video pipeline
Task 3: Camera Path Control