Tencent Hunyuan — Agentic Video World Modeling

Streaming audio-visual generation with semantic-temporal alignment via Hierarchical World State Memory.

As a Research Intern at Tencent Hunyuan, I work on agentic video world modeling that synergizes reasoning and generation:

  • Streaming audio-visual generation with semantic and temporal alignment.
  • Hierarchical World State Memory for long-horizon consistency.
  • Closed-loop feedback between perception, reasoning, and generation.