3 Lower Architecture for GenAI Workloads

1. Overview

Within the organization, new intelligent applications (such as internal information search and proposal apps using generative AI) are being developed daily. The toolsets, skillsets, and approaches required for developing generative AI application workloads demand dynamism and speed, making the "develop once and done" or "system shelving" mindset obsolete.

This architecture aims to enable the organization to efficiently and swiftly deploy intelligent application workloads by addressing the following personas and challenges:

Personas
- Project leaders and business leaders who are about to develop new intelligent applications.
- Project leaders and business leaders aiming to generalize and template already developed intelligent application workloads.
- Project leaders and business leaders who have multiple intelligent application development requests within the organization but are struggling to find the right approach and have not yet taken the first step.
Target Challenges
- Front Layer (User Interface)
  - Although there are multiple samples available on the internet, customizing them for the organization is difficult.
  - Even if deployed within the organization based on internet samples, keeping up with the versions of libraries and frameworks used is structurally challenging, raising security concerns.
- Middle Layer (Data Orchestration)
  - Although there are multiple samples available on the internet, customizing them for the organization is difficult.
  - Even if deployed within the organization based on internet samples, keeping up with the versions of libraries and frameworks used is structurally challenging, raising security concerns.
  - Scaling when the number of users increases is not easy, and due to a lack of cloud computing knowledge, permanent operation is difficult.
- Back-End Layer (Generative AI Model/Data Store)
  - Ensuring security through centralized policy settings and enabling cross-organizational use of generative AI models is desired, but the method is unknown.
  - Frequent errors occur in client applications due to token limitations of the generative AI model.

To solve the above challenges, it is crucial to design approaches applicable to each layer. This architecture aims to improve operational and maintenance efficiency through low-code/no-code approaches and to reduce errors/delays in client applications by providing a common platform specialized for generative AI workloads.

2. 3-Lower Architecture

Above architecture is built upon Copilot Stack.

3. Architecture Points/Samples

Below are the samples and purposes available for each layer:

Layer	Sample	Purpose
App (User Interface) Layer	POC-CopilotStudio-Tips	Utilize Copilot Studio to provide an out-of-the-box chat interface, reducing frontend development labor.
AI Orchestration Layer	POC-PromptFlow-AdvancedRAG	Define access to internal documents and generative AI models with low-code using Prompt Flow, configuring advanced RAG approaches like query expansion and HyDE to improve maintainability.
AI Landing Zone (Data) Layer	POC-DocumentOrchestration	Execute document indexing methods using Document Intelligence with a pro/low-code approach to limit the maintenance scope.
AI Landing Zone (Data) Layer	POC-AI-Gateway	Establish a common generative AI platform to handle requests from all intelligent applications within the organization using API Management, enabling load balancing between models, token and cost visualization, and improving client application reliability.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
images		images
.gitattributes		.gitattributes
README-JP.md		README-JP.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

3 Lower Architecture for GenAI Workloads

1. Overview

2. 3-Lower Architecture

3. Architecture Points/Samples

About

Releases

Packages

TK3214-MS/POC-3Low-Architecture

Folders and files

Latest commit

History

Repository files navigation

3 Lower Architecture for GenAI Workloads

1. Overview

2. 3-Lower Architecture

3. Architecture Points/Samples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages