IBL News | New York
The Chinese company Tencent introduced an open-source AI model, called HunyuanWorld-Voyager, that turns photos and images into explorable 3D sequences.
This video diffusion framework achieved the highest overall score of 77.62 on Stanford University’s WorldScore benchmark, surpassing competitors including WonderWorld (72.69) and CogVideoX-I2V (62.15).
HunyuanWorld-Voyager builds upon Tencent’s earlier HunyuanWorld 1.0 model, released in July. The new system generates both RGB video and depth information simultaneously, allowing users to navigate virtual environments through keyboard or joystick controls.
According to Tencent’s announcement on September 2, HunyuanWorld-Voyager is the industry’s first ultra-long-range world model with native 3D reconstruction capabilities.