Utilize processing server proxy to mets servers #1220

MehmedGIT · 2024-05-03T18:06:45Z

Allow the Processing Server to accept general mets server-related TCP requests, and translate them to UDS requests. This feature is useful for cases when the worker is located on a remote host and wants to communicate with a UDS Mets Server. One main benefit of this approach is to avoid allocating separate ports for different mets servers.

~~This PR is still a draft and the implementation is still on a conceptual level~~. Please feel free to suggest any ideas.

bertsky

I still don't fully understand: if the METS Server user (here: tcp_mets caller) is remote (relative to the workspace), then how is this useful? All the files that the METS updates will relate to will also be remote, and we concluded it's not a good design to squeeze file contents through the METS Server earlier.

src/ocrd_network/runtime_data/deployer.py

MehmedGIT · 2024-05-17T09:28:56Z

I still don't fully understand: if the METS Server user (here: tcp_mets caller) is remote (relative to the workspace), then how is this useful? All the files that the METS updates will relate to will also be remote, and we #966 (review) earlier.

The idea is that the processing workers could send requests to any Mets server through the Processing Server without allocating separate ports per Mets server on the host where the Processing Server is running.

You are right that it will not work when the Processing Server (Mets Servers) and the Processing Workers are on different hosts with the current setup. You are also right that it has been decided to not transfer files over to the Mets Server as discussed in #966.

The forwarding through the Processing Server as a proxy is supposed to be used when:

there is a Network File System that is accessible by both the Mets Servers and Processing Workers
when running docker containers on the same host - to avoid extra allocation of ports for the Mets Servers

@joschrew, is there anything more to add that I have missed?

joschrew · 2024-05-17T09:38:48Z

I imagine a setup where workers (and processing server) are on different vm's. The workspace is shared through NFS so that every processor has access to the same files. Currently this would not work with the Mets-Server as the unix-domain-sockets cannot be shared through NFS. With this PR it should be possible for workers on different vms to make requests to the Mets-Server.

bertsky · 2024-05-17T17:04:53Z

Thanks @MehmedGIT @joschrew for the explanation. Absolutely makes sense now – fantastic idea!

joschrew · 2024-05-31T13:46:47Z

I did some commits. It is now working for me with processors in docker-containers. I still want to change little things and do additional tests. After that I would change the pr's status for reviews.

MehmedGIT · 2024-05-31T13:55:19Z

I did some commits. It is now working for me with processors in docker-containers. I still want to change little things and do additional tests. After that I would change the pr's status for reviews.

Thanks.

MehmedGIT · 2024-06-03T18:38:55Z

It is still unclear to me why one of the processors sent the request to /tcp_mets/workspace_path instead of /tcp_mets, leading to 404 errors. However, the error I was getting was related to an outdated ocrd_all local installation. All redirections work as expected with the latest version of ocrd_all and core.

…tilize-ps-proxy-to-ms

kba

Thanks, very thorough, ready for release

MehmedGIT added 15 commits April 25, 2024 14:23

implement start/stop tcp skeleton

c250dcc

tcp server addition

ef9b2df

remove unnecessary tcp methods

deeba67

rename: unix -> uds

47ad818

reorganize mets_server.py

3960065

reorganize mets_server.py client

26c9742

add template forwarding in PS

28b38b4

add example request multiplexing to OcrdClient

ab927f8

add mets_target to client side mets

b09f4b9

fix ws_dir_path, add all multiplexing requests

d031dd8

use ws dir instead of mets file path for socket

bed2fad

use ws dir instead of mets file path for socket

8f6450c

fix: pass the ws dir, not mets path

30e1128

tcp to uds mets proxy

9041bd6

fix: test case

8acb044

bertsky reviewed May 17, 2024

View reviewed changes

src/ocrd_network/runtime_data/deployer.py Outdated Show resolved Hide resolved

MehmedGIT and others added 11 commits May 24, 2024 15:57

fix: multiplexing mode breaking the test

b40c5c1

fix: ws dir instead of mets path

a7d23ae

fix: multiplex properly

1b72738

Add 2 module tests for the proxy server

a976100

add more proxy server module tests

973a871

Add note for improved error handling

e70e93b

Set multiplexing url optional to internal-cb url

f4ea16a

Add param type to tcp forward method

1072d71

Set response type for tcp_mets to dict

ac17aa3

Set response type in tests too

045bb6c

Make tcp_mets get requests parameter work

3534fa7

joschrew added 4 commits May 30, 2024 12:02

Fix is_mets_server_running

3da9464

Fix find_files proxy call

8151766

Set suitable param type for uds mets forwarding

ea3d641

Change tcp mets proxy and its way to test

521fab3

joschrew and others added 6 commits June 3, 2024 09:56

Handle add_file errors with tcp_mets forward

227758f

Log exception when mets server startup fails

e118416

Use MpxReq-method for mets server running check

e087eca

Make tcp mets configurable

b158522

set: use_tcp_mets flag in config

7882653

Merge branch 'master' into utilize-ps-proxy-to-ms

abd06e1

MehmedGIT added 3 commits June 4, 2024 12:50

replace nohup in deployer

4d95329

refine MpxReq

7eba992

Merge branch 'master' into utilize-ps-proxy-to-ms

022a37b

MehmedGIT marked this pull request as ready for review June 4, 2024 12:05

MehmedGIT requested review from kba, joschrew and bertsky June 4, 2024 12:05

MehmedGIT added 2 commits June 4, 2024 14:33

abstract away mets server cli

2b9c184

Merge branch 'utilize-ps-proxy-to-ms' of github.com:OCR-D/core into u…

b95c214

…tilize-ps-proxy-to-ms

kba approved these changes Jun 6, 2024

View reviewed changes

kba merged commit 69c1ce6 into master Jun 7, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Utilize processing server proxy to mets servers #1220

Utilize processing server proxy to mets servers #1220

MehmedGIT commented May 3, 2024 •

edited

Loading

bertsky left a comment

MehmedGIT commented May 17, 2024 •

edited

Loading

joschrew commented May 17, 2024

bertsky commented May 17, 2024

joschrew commented May 31, 2024

MehmedGIT commented May 31, 2024

MehmedGIT commented Jun 3, 2024

kba left a comment

Utilize processing server proxy to mets servers #1220

Utilize processing server proxy to mets servers #1220

Conversation

MehmedGIT commented May 3, 2024 • edited Loading

bertsky left a comment

Choose a reason for hiding this comment

MehmedGIT commented May 17, 2024 • edited Loading

joschrew commented May 17, 2024

bertsky commented May 17, 2024

joschrew commented May 31, 2024

MehmedGIT commented May 31, 2024

MehmedGIT commented Jun 3, 2024

kba left a comment

Choose a reason for hiding this comment

MehmedGIT commented May 3, 2024 •

edited

Loading

MehmedGIT commented May 17, 2024 •

edited

Loading