Update AudioQnA README.md for its workflow (#903)

opea-project · Oct 8, 2024 · 63bad29 · 63bad29
1 parent 36d3ef2
commit 63bad29
Showing 1 changed file with 57 additions and 0 deletions.
diff --git a/AudioQnA/README.md b/AudioQnA/README.md
@@ -2,6 +2,63 @@
 
 AudioQnA is an example that demonstrates the integration of Generative AI (GenAI) models for performing question-answering (QnA) on audio files, with the added functionality of Text-to-Speech (TTS) for generating spoken responses. The example showcases how to convert audio input to text using Automatic Speech Recognition (ASR), generate answers to user queries using a language model, and then convert those answers back to speech using Text-to-Speech (TTS).
 
+The AudioQnA example is implemented using the component-level microservices defined in [GenAIComps](https://github.com/opea-project/GenAIComps). The flow chart below shows the information flow between different microservices for this example.
+
+```mermaid
+---
+config:
+  flowchart:
+    nodeSpacing: 400
+    rankSpacing: 100
+    curve: linear
+  themeVariables:
+    fontSize: 50px
+---
+flowchart LR
+    %% Colors %%
+    classDef blue fill:#ADD8E6,stroke:#ADD8E6,stroke-width:2px,fill-opacity:0.5
+    classDef orange fill:#FBAA60,stroke:#ADD8E6,stroke-width:2px,fill-opacity:0.5
+    classDef orchid fill:#C26DBC,stroke:#ADD8E6,stroke-width:2px,fill-opacity:0.5
+    classDef invisible fill:transparent,stroke:transparent;
+    style AudioQnA-MegaService stroke:#000000
+
+    %% Subgraphs %%
+    subgraph AudioQnA-MegaService["AudioQnA MegaService "]
+        direction LR
+        ASR([ASR MicroService]):::blue
+        LLM([LLM MicroService]):::blue
+        TTS([TTS MicroService]):::blue
+    end
+    subgraph UserInterface[" User Interface "]
+        direction LR
+        a([User Input Query]):::orchid
+        UI([UI server<br>]):::orchid
+    end
+
+
+
+    WSP_SRV{{whisper service<br>}}
+    SPC_SRV{{speecht5 service <br>}}
+    LLM_gen{{LLM Service <br>}}
+    GW([AudioQnA GateWay<br>]):::orange
+
+
+    %% Questions interaction
+    direction LR
+    a[User Audio Query] --> UI
+    UI --> GW
+    GW <==> AudioQnA-MegaService
+    ASR ==> LLM
+    LLM ==> TTS
+
+    %% Embedding service flow
+    direction LR
+    ASR <-.-> WSP_SRV
+    LLM <-.-> LLM_gen
+    TTS <-.-> SPC_SRV
+
+```
+
 ## Deploy AudioQnA Service
 
 The AudioQnA service can be deployed on either Intel Gaudi2 or Intel Xeon Scalable Processor.