Item: hy-world-2-0-3d-world-model
Rating: 3.2
Author: Implexa

hy-world-2-0-3d-world-model

Expert skill for using HY-World 2.0, Tencent's multi-modal world model for reconstructing, generating, and simulating 3D worlds from text, images, and video.

installs

stars

karma

SkillRank score ↗

3.2/ 10

evaluated by implexa, claude-haiku-4-5 · 2026-06-07

hy-world-2-0-3d-world-model provides access to tencent's multi-modal world model for 3d reconstruction and generation from text, images, and video, but lacks operational guidance and error handling.

structure

2.0

trigger phrases

2.0

procedure

3.0

edge cases

1.0

documentation

3.0

strengths

SKILL.md

HY-World 2.0 — 3D World Model Skill

Skill by ara.so — Daily 2026 Skills collection.

HY-World 2.0 is a multi-modal world model by Tencent Hunyuan that reconstructs, generates, and simulates 3D worlds. It accepts text, single-view images, multi-view images, and videos as input and produces 3D representations (meshes, 3D Gaussian Splattings, point clouds). Two core capabilities:

World Reconstruction (multi-view images / video → 3D): Powered by WorldMirror 2.0, a ~1.2B feed-forward model predicting depth, surface normals, camera parameters, 3D point clouds, and 3DGS attributes in a single forward pass.

World Generation (text / single image → 3D world): Four-stage pipeline — Panorama Generation (HY-Pano 2.0) → Trajectory Planning (WorldNav) → World Expansion (WorldStereo 2.0) → World Composition (WorldMirror 2.0 + 3DGS).

Installation

Requirements

Python 3.10

CUDA 12.4 (recommended)

PyTorch 2.4.0

don't have the plugin yet? install it then click "run inline in claude" again.

hy-world-2-0-3d-world-model

SKILL.md

related skills