Language-conditioned manipulation with NVIDIA GR00T N1.5.
Speak the task ('pick up the red block, put it in the bowl'); a vision-language-action policy plans the grasp, executes the trajectory, and reports success. Runs on Jetson AGX Thor with TensorRT-optimized inference. Reference implementation: SIGRobotics' Matcha Bot, embodied-AI hackathon winner.
Inspired by
Jetson AI Lab — Matcha Bot ↗