[WIP] Add a Avatar Chatbot (Audio) example #523

Hi @ctao456 , thanks for contribution. The talking avatar with Wav2Lip-GFPGAN looks good. Before reviewing I have a few questions. I notice that you use HPU to run Wav2Lip and report the latency "10-50 seconds for AvatarAnimation on Gaudi". How long is the driven audio? Is that the latency of the first run or after a warmup? Have you tried to optimize the related models (Wav2Lip model, GFPGAN) on HPU? With optimization it could be faster by fully utilizing the static shape feature on Gaudi.

ctao456 · 2024-08-06T15:15:26Z

Hi @ctao456 , thanks for contribution. The talking avatar with Wav2Lip-GFPGAN looks good. Before reviewing I have a few questions. I notice that you use HPU to run Wav2Lip and report the latency "10-50 seconds for AvatarAnimation on Gaudi". How long is the driven audio? Is that the latency of the first run or after a warmup? Have you tried to optimize the related models (Wav2Lip model, GFPGAN) on HPU? With optimization it could be faster by fully utilizing the static shape feature on Gaudi.

Hi @Spycsh thank you for your comments.

In the demo video, the driven audio was 22s seconds long, and the inference time was around 50 seconds using both Wav2Lip-GAN and GFPGAN models (--inference_mode set to wav2lip+gfpgan). There will be significant speedup by switching --inference_mode flag to wav2lip_only, with some tradeoff on face restoration quality.
"10-50 seconds for AvatarAnimation on Gaudi" is the latency of the first run, without warming up. But we can try including warm-up to speed up.
- We're using eager mode on Gaudi 2. Not applying torch.compile at the moment because torch.compile didn't work for the GFPGAN model. We met some HPU-PTBridge issues with lazy mode as well. Haven't tried torch.jit.trace() yet.
Thank you for your suggestion. The current efforts focus on building the micro- and megaservice architecture. We will gradually add more features for
a. HPUs optimization (eager mode with torch.compile v.s. lazy mode, torch.jit, HPU graph, BF16 & INT8 precision, etc.) to acclerate graph inference
b. Distributed inference on multiple Gaudi cards, using DeepSpeed.
c. Support for more SoTA face animation models (SadTalker, LivePortrait, etc.)

Signed-off-by: Chun Tao <[email protected]>

louie-tsai

looks good. minor feedback

AvatarChatbot/docker/Dockerfile

AvatarChatbot/docker/gaudi/README.md

louie-tsai

looks good. leave some comments

AvatarChatbot/docker/avatarchatbot.py

louie-tsai · 2024-08-27T16:43:03Z

AvatarChatbot/docker/ui/gradio/app.py

+            outputs=[video_output, video_time_text],
+        )
+
+        demo.queue().launch(server_name="0.0.0.0", server_port=65535)  # demo port is 65535


good to make server name and port as variables in the beginning. doesn't hurt to make them as env variables.

louie-tsai · 2024-08-27T16:45:05Z

AvatarChatbot/docker/ui/gradio/app_audioqna.py

+    # Prepare 3 image paths
+    # HOME = os.getenv("HOME")
+    # HOME="/mnt/localdisk4"
+    HOME = "/home/demo/"


might need to make "HOME" configurable via env or parameter

louie-tsai · 2024-08-27T16:58:12Z

AvatarChatbot/docker/ui/gradio/app_audioqna.py

+            asyncio.set_event_loop(loop)
+            audio_file = loop.run_until_complete(aiavatar_demo(audio))
+            count += 1
+            end_time = time.time()


for the latency, we might need to support statistics RESTful API in the end.

louie-tsai · 2024-08-27T17:17:27Z

AvatarChatbot/docker/ui/gradio/app_audioqna.py

+            outputs=[video_output, video_time_text],
+        )
+
+        demo.queue().launch(server_name="0.0.0.0", server_port=7861)


good to make server name and port as env variables.

louie-tsai · 2024-08-27T17:23:28Z

AvatarChatbot/docker/ui/gradio/app_audioqna.py

+    # 2. Run inference.sh bash script to perform Wav2Lip+GFPGAN inference
+    # Output video is saved at the path 'OUTFILE'
+    command_wav2lip_gfpgan = "bash inference_vars.sh"
+    subprocess.run(command_wav2lip_gfpgan, shell=True)


protentially might have issue from Intel IP scan..

Signed-off-by: letonghan <[email protected]>

ctao456 and others added 30 commits June 24, 2024 18:47

remove torch version

c77239d

Signed-off-by: Chun Tao <[email protected]>

Merge branch 'ctao/AvatarChatbot' of https://github.com/ctao456/GenAI…

1ff69c6

…Examples into ctao/AvatarChatbot Signed-off-by: Chun Tao <[email protected]>

initial commit

0e2a57d

Signed-off-by: Chun Tao <[email protected]>

update

c3dafba

Signed-off-by: Chun Tao <[email protected]>

structure

8f8d3ea

Signed-off-by: Chun Tao <[email protected]>

Dockerfile

0a6cfa7

Signed-off-by: Chun Tao <[email protected]>

env vars for animation

16ea8ca

Signed-off-by: Chun Tao <[email protected]>

ports

94c87a2

Signed-off-by: Chun Tao <[email protected]>

change back to "0.0.0.0"

26f401d

Signed-off-by: Chun Tao <[email protected]>

test

a9e3ca9

Signed-off-by: Chun Tao <[email protected]>

updates

da846a0

Signed-off-by: Chun Tao <[email protected]>

save path

0a371ce

Signed-off-by: Chun Tao <[email protected]>

new video from running megaservice

fe228d2

Signed-off-by: Chun Tao <[email protected]>

update

b091446

Signed-off-by: Chun Tao <[email protected]>

readme

e48ceac

Signed-off-by: Chun Tao <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

16a0e49

for more information, see https://pre-commit.ci

update

bebff1b

Signed-off-by: Chun Tao <[email protected]>

Merge branch 'ctao/AvatarChatbot' of https://github.com/ctao456/GenAI…

5aeb201

…Examples into ctao/AvatarChatbot Signed-off-by: Chun Tao <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

b872d77

for more information, see https://pre-commit.ci

test script

326bd27

Signed-off-by: Chun Tao <[email protected]>

Merge branch 'ctao/AvatarChatbot' of https://github.com/ctao456/GenAI…

d3727e1

…Examples into ctao/AvatarChatbot

test script

13ec383

Signed-off-by: Chun Tao <[email protected]>

update test script

ba30719

Signed-off-by: Chun Tao <[email protected]>

test scripts

2c1a079

Signed-off-by: Chun Tao <[email protected]>

update test script

6142be1

Signed-off-by: Chun Tao <[email protected]>

update test script

0f4c17d

Signed-off-by: Chun Tao <[email protected]>

update test script

133b7b5

Signed-off-by: Chun Tao <[email protected]>

update test script

5c7ed0c

Signed-off-by: Chun Tao <[email protected]>

update test script

e49f703

Signed-off-by: Chun Tao <[email protected]>

update test script

09d5942

Signed-off-by: Chun Tao <[email protected]>

ctao456 added 2 commits August 4, 2024 13:42

update Dockerfile

80ac42c

Signed-off-by: Chun Tao <[email protected]>

Merge branch 'opea-project:main' into ctao/AvatarChatbot

0d95a8d

This was referenced Aug 5, 2024

[WIP] Initiate animation component opea-project/GenAIComps#400

Closed

RFC - AI Avatar Animation Design opea-project/docs#59

Merged

updates

4c692f5

Signed-off-by: Chun Tao <[email protected]>

ctao456 requested a review from Spycsh as a code owner August 5, 2024 20:44

pre-commit-ci bot and others added 2 commits August 5, 2024 20:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

bd04aee

for more information, see https://pre-commit.ci

Merge branch 'main' into ctao/AvatarChatbot

2d6e5f9

hshen14 requested review from lvliang-intel and WenjiaoYue August 6, 2024 00:18

ctao456 and others added 2 commits August 5, 2024 20:00

update

7d2d31d

Signed-off-by: Chun Tao <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

1bac719

for more information, see https://pre-commit.ci

ctao456 added 2 commits August 6, 2024 08:17

Merge branch 'opea-project:main' into ctao/AvatarChatbot

071c1fc

mermaid

253f9df

Signed-off-by: Chun Tao <[email protected]>

louie-tsai reviewed Aug 19, 2024

View reviewed changes

ctao456 marked this pull request as draft August 21, 2024 18:21

louie-tsai closed this Aug 21, 2024

louie-tsai reviewed Aug 27, 2024

View reviewed changes

ctao456 mentioned this pull request Oct 9, 2024

Initiate "AvatarChatbot" (audio) example #923

Merged

4 tasks

wangkl2 pushed a commit to wangkl2/GenAIExamples that referenced this pull request Dec 11, 2024

Refine Nginx Component (opea-project#523)

69f9895

Signed-off-by: letonghan <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add a Avatar Chatbot (Audio) example #523

[WIP] Add a Avatar Chatbot (Audio) example #523

ctao456 commented Aug 4, 2024 •

edited

Loading

Spycsh commented Aug 6, 2024

ctao456 commented Aug 6, 2024 •

edited

Loading

louie-tsai left a comment

louie-tsai left a comment

louie-tsai Aug 27, 2024

louie-tsai Aug 27, 2024

louie-tsai Aug 27, 2024

louie-tsai Aug 27, 2024

louie-tsai Aug 27, 2024

[WIP] Add a Avatar Chatbot (Audio) example #523

[WIP] Add a Avatar Chatbot (Audio) example #523

Conversation

ctao456 commented Aug 4, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

Spycsh commented Aug 6, 2024

ctao456 commented Aug 6, 2024 • edited Loading

louie-tsai left a comment

Choose a reason for hiding this comment

louie-tsai left a comment

Choose a reason for hiding this comment

louie-tsai Aug 27, 2024

Choose a reason for hiding this comment

louie-tsai Aug 27, 2024

Choose a reason for hiding this comment

louie-tsai Aug 27, 2024

Choose a reason for hiding this comment

louie-tsai Aug 27, 2024

Choose a reason for hiding this comment

louie-tsai Aug 27, 2024

Choose a reason for hiding this comment

ctao456 commented Aug 4, 2024 •

edited

Loading

ctao456 commented Aug 6, 2024 •

edited

Loading