clawhub

tao-generate-referring-expressions

Four-step image referring-expression pipeline: turns images plus KITTI bounding-box labels into region descriptions, scene captions, grounded referring expressions, and (optionally) verified expressions via VLM distillation. Use when the user wants to generate referring-expression annotations from images with KITTI labels, build region descriptions, produce grouped grounding phrases tied to bboxes, run a double-check verification pass on grounding expressions, auto-label traffic / scene images for referring datasets, or run the image_referring_expression pipeline. Triggers include 'referring expression', 'region description', 'KITTI labels', 'spatial relationship annotation', 'auto-label image referring expression', 'image_referring_expression'.

view source

installs

stars

karma

full SKILL.md lives at the source

we've indexed the metadata for this skill but the body is fetched on demand. click "view source" above to read the canonical SKILL.md on clawhub, or "run inline in claude" to apply it without leaving your session.

read on clawhub

related skills

semantically similar in the cross-vendor index

clawhub

47% match

🫧 Image-to-Video — Pro Pack on RunComfy

Image-to-video generation on RunComfy. This image-to-video skill turns any still image into a short video clip via the RunComfy Model API. The image-to-video...

don't have the plugin yet? install it then click "run inline in claude" again.