Tencent Hunyuan — Agentic Video World Modeling
Streaming audio-visual generation with semantic-temporal alignment via Hierarchical World State Memory.
As a Research Intern at Tencent Hunyuan, I work on agentic video world modeling that synergizes reasoning and generation:
- Streaming audio-visual generation with semantic and temporal alignment.
- Hierarchical World State Memory for long-horizon consistency.
- Closed-loop feedback between perception, reasoning, and generation.