Whether to support sound streaming input audio2face to drive lip shape？

Question

Whether to support sound streaming input audio2face to drive lip shape？

Closed this issue 6 months ago · 9 comments

thank you for your great work，May I ask if this project supports sound streaming into audio2face to drive lip shapes, rather than in the form of voice files?

Answer 1 · 2024-02-02T13:10:36.000Z

Hey,

I haven't implemented that yet, however I think it's not that complicated, the headless server of Nvidia supports it.
Feel free to contribute, I would appreciate!

Best regards

Answer 2 · 2024-04-09T08:04:09.000Z

感谢无私奉献，能否进一步实现实时流式驱动？

Answer 3 · 2024-04-12T07:54:36.000Z

Confirmed. Will work on it. The project will be improved and continued.

Answer 4 · 2024-07-10T08:22:37.000Z

Started working on it. In a second step I will also move away from plain requests and build on fastSDK and media-toolkit. Second step will take longer though. I'll release an upgraded module first - can't tell you yet how long it takes, but I expect to finish in a few weeks.

Answer 5 · 2024-07-10T09:22:17.000Z

@singelhero @CasonTsai can you give me a little bit more information where your input stream comes from?
Do you expect a live microphone audio input? What are your aims / projects you're working on about?
How would this feature look like in your opinion?

For the beginning I would implement chunking an audio file and feeding it to the stream.

Answer 6 · 2024-07-11T22:05:33.000Z

Finished streaming implementation.
@singelhero, @CasonTsai please check feature especially interesting wiht livelink. Please share result. Raise new issue if something is missing or misbehaving.
Next steps: fastSDK

Answer 7 · 2024-07-15T08:27:01.000Z

@w4hns1nn yes,

@singelhero @CasonTsai can you give me a little bit more information where your input stream comes from? Do you expect a live microphone audio input? What are your aims / projects you're working on about? How would this feature look like in your opinion?

For the beginning I would implement chunking an audio file and feeding it to the stream.

yeah ,i want to build a metahuman in unreal.the input stream from microphone audio input or audio data in memory .i found a solution in audio2face official doc,it use grpc in that document.
excuse me,I want to know if there is a way to use audio2face without opening omniverse?

Answer 8 · 2024-07-15T10:30:37.000Z

@CasonTsai
I've added the streaming feature which uses gRPC. Check the updated readme for it.
About the second question:

please check if the headless server also uses omniverse.. py_audio2face runs with the headles..
I plan something similar, but will be end of the year until i can publish something about

Answer 9 · 2024-07-17T07:14:57.000Z

@CasonTsai I've added the streaming feature which uses gRPC. Check the updated readme for it. About the second question:

please check if the headless server also uses omniverse.. py_audio2face runs with the headles..

I plan something similar, but will be end of the year until i can publish something about

thanks