老牛影视免费观看电视剧-老牛影视文化传媒有限公司官方-老牛影院在线观看电视剧免费-老牛影视在线观看免费观看电视剧

position: EnglishChannel  > News> Upload a Photo, Get a Video

Upload a Photo, Get a Video

Source: Science and Technology Daily | 2025-06-11 11:29:11 | Author: LI LInxu

The rapid developments in AI have unlocked new possibilities for digital representation. With the help of AI models, you can now achieve a remarkable feat: bringing characters to life with just an image and an audio clip.

Jointly developed by Tencent Hunyuan and Tencent Music, the newly released HunyuanVideo-Avatar, a multimodal diffusion transformer-based model, is capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. This capability supports head-and-shoulder, half-body, and full-body views, encompassing multiple styles, species, and even dual-character scenes.

To put it simply, you just upload a photo and a voice clip, and the model figures out the context, emotion and lip movements to create a realistic animated video.

For instance, if you upload an image of a woman sitting on a beach with a guitar, along with a piece of lyrical music,  the model understands the scene as "a woman playing the guitar and singing a lyrical song by the sea," and subsequently generates a video of the woman performing the song.

The model provides video creators with highly consistent and dynamic video generation capabilities. Its versatility can unlock a myriad of applications in fields like entertainment, media, e-commerce, advertising and education.

It has already been applied in multiple scenarios within Tencent Music, such as AI companions for music listening, long-form audio podcasts, and music videos (MVs).

For example, on the app QQ Music, when users listen to songs by "AI Leehom" (a fully AI-driven singer created by Tencent Music and Team Leehom), a lively and adorable AI Leehom image synchronizes its singing in real-time on the player.

On WeSing, a popular karaoke singing app, users can upload their images to generate personalized MVs of themselves singing.

In subject consistency and audio-video synchronization, the HunyuanVideo-Avatar shows top-tier industry performance. For video dynamics and natural body movements, it exceeds open-source solutions and rivals closed-source ones.

Currently, the model supports audio uploads of up to 14 seconds for video generation, with more capabilities to be released and open-sourced in the future.

Editor:李林旭

Top News

Xi Congratulates Science and Technology Daily on Its 40th Anniversary

Chinese President Xi Jinping has sent a congratulatory letter to the Science and Technology Daily on the occasion of the 40th anniversary of its founding.

Mystery of Qingming's Shifting Dates

Many people have noticed that the date on the Gregorian calendar of the Qingming Festival, a time when Chinese honor their ancestors, is not fixed. In 2025, Qingming fell on April 4, while in 2026 it falls on April 5. So, why does the date of Qingming vary from year to year, when the precise moment of its onset is calculated down to the minute? Science and Technology Daily spoke to Yan Weiguo, president of the Tianjin Astronomical Society, to find out the reason.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽