Livoa LogoLivoa
Text Token
Codec Embedding
Text Embedding
Vision Hidden
Audio Hidden
Codec Hidden
Pad Hidden
Qwen3-Omni
Streaming Codec Decoder
MTP Module
Qwen3-Omni MoE Talker
Hidden extraction from middle layers
Qwen3-Omni MoE Thinker
Vision Encoder
AuT
Please describe this video with audio.
According to the ... are available.

gpt

by luffy

0
0 uses