Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new example of searchQnA on both xeon and gaudi #202

Merged
merged 3 commits into from
Jul 22, 2024

Conversation

zhlsunshine
Copy link
Collaborator

Description

Add new example of searchQnA on both xeon and gaudi.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)

Dependencies

n/a.

Tests

root@satg-opea-4node-1:/home/huailong# kubectl get svc -n searchqa     
NAME                TYPE        CLUSTER-IP       EXTERNAL-IP   PORT(S)    AGE
embedding-svc       ClusterIP   172.21.58.179    <none>        6000/TCP   100m
llm-svc             ClusterIP   172.21.127.255   <none>        9000/TCP   99m
reranking-svc       ClusterIP   172.21.93.122    <none>        8000/TCP   99m
router-service      ClusterIP   172.21.129.137   <none>        8080/TCP   99m
tei-embedding-svc   ClusterIP   172.21.73.33     <none>        80/TCP     99m
tei-reranking-svc   ClusterIP   172.21.159.247   <none>        80/TCP     99m
tgi-service-m       ClusterIP   172.21.191.84    <none>        80/TCP     99m
web-retriever-svc   ClusterIP   172.21.31.70     <none>        7077/TCP   99m
root@satg-opea-4node-1:/home/huailong# 
root@satg-opea-4node-1:/home/huailong# kubectl get deployment -n searchqa
NAME                           READY   UP-TO-DATE   AVAILABLE   AGE
embedding-svc-deployment       1/1     1            1           100m
llm-svc-deployment             1/1     1            1           99m
reranking-svc-deployment       1/1     1            1           99m
router-server                  1/1     1            1           99m
tei-embedding-svc-deployment   1/1     1            1           100m
tei-reranking-svc-deployment   1/1     1            1           99m
tgi-service-m-deployment       1/1     1            1           99m
web-retriever-svc-deployment   1/1     1            1           100m
root@satg-opea-4node-1:/home/huailong#
root@satg-opea-4node-1:/home/huailong# kubectl get pods -n searchqa
NAME                                            READY   STATUS    RESTARTS   AGE
embedding-svc-deployment-85fbdbbdbf-vstxf       1/1     Running   0          41m
llm-svc-deployment-9d4bd55b7-vlltd              1/1     Running   0          40m
reranking-svc-deployment-6c988dbdc5-mxvr9       1/1     Running   0          41m
router-server-67fdbd7699-jg24l                  1/1     Running   0          40m
tei-embedding-svc-deployment-c8f597f77-7kq72    1/1     Running   0          103m
tei-reranking-svc-deployment-8674bdd656-7hffr   1/1     Running   0          103m
tgi-service-m-deployment-647967c8fb-rnqps       1/1     Running   0          103m
web-retriever-svc-deployment-598586d9b4-fkdsm   1/1     Running   0          41m
root@satg-opea-4node-1:/home/huailong# 
root@satg-opea-4node-1:/home/huailong# kubectl get gmc -n searchqa
NAME       URL                                                     READY   AGE
searchqa   http://router-service.searchqa.svc.cluster.local:8080   7/0/7   100m
root@satg-opea-4node-1:/home/huailong# 
root@satg-opea-4node-1:/home/huailong# 
root@satg-opea-4node-1:/home/huailong# kubectl exec "$CLIENT_POD" -n chatqa -- curl $accessUrl  -X POST  -d '{"text":"What is the latest news? Give me also the source link."}' -H 'Content-Type: application/json'
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100   877  100   812  100   data: b'\n'     3  0:00:21  0:00:20  0:00:01   176

data: b'\n'

data: b'An'

data: b'swer'

data: b':'

data: b' The'

data: b' latest'

data: b' news'

data: b' can'

data: b' be'

data: b' found'

data: b' at'

data: b' the'

data: b' source'

data: b' link'

data: b':'

data: b' https'

data: b'://'

data: b'www'

data: b'.'

data: b'c'

data: b'nn'

data: b'.'

data: b'com'

data: b'/,'

data: b' where'

data: b' you'

data: b' can'

data: b' view'

data: b' breaking'

data: b' news'

data: b' today'

data: b' for'

data: b' various'

data: b' categories'

data: b' such'

data: b' as'

data: b' U'

data: b'.'

data: b'S'

data: b'.,'

data: b' world'

data: b','

data: b' weather'

data: b','

data: b' entertainment'

data: b','

data: b' politics'

data: b','

data: b' and'

data: b' health'

data: b'.'

data: b'</s>'

data: [DONE]

 65     38      3  0:00:21  0:00:20  0:00:01   224

@lianhao lianhao added this to the v0.8 milestone Jul 19, 2024
@KfreeZ KfreeZ merged commit 883c8da into opea-project:main Jul 22, 2024
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants