Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add socket sharing #94

Merged
merged 23 commits into from
Nov 10, 2021
Merged

Add socket sharing #94

merged 23 commits into from
Nov 10, 2021

Conversation

sansyrox
Copy link
Member

@sansyrox sansyrox commented Oct 13, 2021

Todo:

  • Add more integration tests [x]
  • Check and fix directories not working [x]

UPDATE

  • Argparser not working. Need to fix this
  • Fix the installation on Raspberry Pi and Windows
  • Write the documentation for the flags

@netlify
Copy link

netlify bot commented Oct 13, 2021

✔️ Deploy Preview for robyn canceled.

🔨 Explore the source changes: bae23d5

🔍 Inspect the deploy log: https://app.netlify.com/sites/robyn/deploys/618ba755e63d070008da016b

@sansyrox
Copy link
Member Author

https://gist.github.com/josiahcarlson/3723597

Socket is not being pickled, need to find some other way to share them across the processes.

@sansyrox
Copy link
Member Author

@sansyrox
Copy link
Member Author

@messense @JackThomson2 @awestlake87 , I know it has been long time since you folks worked here. But I am facing some major issues in passing the socket about around the multi processes. Do you folks seem to know a solution for this? https://stackoverflow.com/questions/69788270/a-copied-socket-is-not-being-pickled

@JackThomson2
Copy link
Contributor

Hey, sorry I've been very busy starting a new job. Have you had a look in my fork I had a working example of this under one of the branches I can't remember which

@sansyrox
Copy link
Member Author

@JackThomson2 , first of all, congratulations on your new job! 🥳

I did take a look here: https://github.com/JackThomson2/robyn/tree/multiprocess .

I couldn't find the process sharing here.

@JackThomson2
Copy link
Contributor

Thank you! If you look at line 85 on the init.py file inside the Robyn folder the cloning of the socket takes place here and is sent to each new process

@sansyrox
Copy link
Member Author

@JackThomson2 , I tried that.

I am getting the following error:

  File "integration_tests/base_routes.py", line 75, in <module>
    app.start(port=5000, url='0.0.0.0')
  File "/Users/bruhh/.pyenv/versions/maturin/lib/python3.8/site-packages/robyn/__init__.py", line 100, in start
    ns.x = socket.try_clone()
  File "/Users/bruhh/.pyenv/versions/maturin/lib/python3.8/site-packages/multiprocess/managers.py", line 1143, in __setattr__
    return callmethod('__setattr__', (key, value))
  File "/Users/bruhh/.pyenv/versions/maturin/lib/python3.8/site-packages/multiprocess/managers.py", line 834, in _callmethod
    conn.send((self._id, methodname, args, kwds))
  File "/Users/bruhh/.pyenv/versions/maturin/lib/python3.8/site-packages/multiprocess/connection.py", line 209, in send
    self._send_bytes(_ForkingPickler.dumps(obj))
  File "/Users/bruhh/.pyenv/versions/maturin/lib/python3.8/site-packages/multiprocess/reduction.py", line 54, in dumps
    cls(buf, protocol, *args, **kwds).dump(obj)
  File "/Users/bruhh/.pyenv/versions/maturin/lib/python3.8/site-packages/dill/_dill.py", line 498, in dump
    StockPickler.dump(self, obj)
  File "/Users/bruhh/.pyenv/versions/3.8.5/lib/python3.8/pickle.py", line 485, in dump
    self.save(obj)
  File "/Users/bruhh/.pyenv/versions/3.8.5/lib/python3.8/pickle.py", line 558, in save
    f(self, obj)  # Call unbound method with explicit self
  File "/Users/bruhh/.pyenv/versions/3.8.5/lib/python3.8/pickle.py", line 899, in save_tuple
    save(element)
  File "/Users/bruhh/.pyenv/versions/3.8.5/lib/python3.8/pickle.py", line 558, in save
    f(self, obj)  # Call unbound method with explicit self
  File "/Users/bruhh/.pyenv/versions/3.8.5/lib/python3.8/pickle.py", line 884, in save_tuple
    save(element)
  File "/Users/bruhh/.pyenv/versions/3.8.5/lib/python3.8/pickle.py", line 576, in save
    rv = reduce(self.proto)
TypeError: cannot pickle 'builtins.SocketHeld' object

@messense
Copy link
Contributor

messense commented Nov 1, 2021

See PyO3/pyo3#100

@awestlake87
Copy link
Contributor

awestlake87 commented Nov 1, 2021

I'm definitely not an expert on this, but I think that it's impossible to pass sockets to another process via pickling. Sockets are process-local, so you can't just pass the file descriptor of an open socket to another process with normal methods.

You can inherit open file descriptors from the parent process when the process is forked and then do the processing for a connection in the child process. But a forked process will only inherit the file descriptors from the parent at the time of the fork. The child process won't have access to any of the file descriptors that the parent opens after the fork.

In order to do what you want, which I assume is delegate socket connections to a pool of worker processes that were forked before the connection was made, you'll likely need some sort of connection with SCM_RIGHTS. This answer seems promising, but it gets very hairy and I imagine very OS-specific. Maybe there's some existing multiprocessing library that can do this for you?

@awestlake87
Copy link
Contributor

awestlake87 commented Nov 1, 2021

The SCM_RIGHTS option seems to only work with Unix domain sockets, not your standard internet sockets, so forking the process when accepting the connection might be your only option to share the socket file descriptor.

Just a heads up though, tokio does not play well with forked processes, and the worker threads will die when a process is forked (for good reason too) so forking after tokio is running is a problem. (forking before tokio is running would not be a problem though)

Maybe your best option is a some sort of reverse proxy like traefik or a multiprocessing task queue like celery?

@sansyrox
Copy link
Member Author

sansyrox commented Nov 1, 2021

@JackThomson2 @messense @awestlake87 , thank you for your help. 😄

I was able to fix this. All the effort spent in learning low level code and just one line(https://github.com/sansyrox/robyn/pull/94/files#diff-8bbb55cf9793eb46fce842bf2abe71425ddb24af349e3114de42ef25f07b02fcR15) fixed it.

Thanks again for the help! 🥳

@sansyrox
Copy link
Member Author

sansyrox commented Nov 1, 2021

Just a heads up though, tokio does not play well with forked processes, and the worker threads will die when a process is forked (for good reason too) so forking after tokio is running is a problem. (forking before tokio is running would not be a problem though)

@awestlake87 , can't tokio create new threads after the old ones die?

@awestlake87
Copy link
Contributor

awestlake87 commented Nov 2, 2021

It could create new worker threads, but the issue is that when the original worker threads die, they most likely left the tasks that were running prior to the fork in a bad state in the child process. Restarting the worker threads would allow new tasks to run, but the old tasks would never complete and the child process would most likely be left in a problematic state.

Tokio chooses to do nothing in this case since there's no real way it can guarantee that the program should function properly after the fork. This is true of a lot of multi-threaded libraries, not just tokio, so in general the advice for forking your application is to fork it before any threads are created.

This is possible to do in your case as long as pyo3-asyncio is not involved in the parent process at all (pyo3-asyncio tokio initialization is lazy, so it won't start until you try to use it). So basically do not use pyo3-asyncio until the process is forked, then tokio will be started in each child process as it processes the connection whenever pyo3-asyncio is used for the first time.

@sansyrox
Copy link
Member Author

sansyrox commented Nov 2, 2021

This is possible to do in your case as long as pyo3-asyncio is not involved in the parent process at all (pyo3-asyncio tokio initialization is lazy, so it won't start until you try to use it). So basically do not use pyo3-asyncio until the process is forked, then tokio will be started in each child process as it processes the connection whenever pyo3-asyncio is used for the first time.

Thank you for this information! 😄

I think that after pyo3-asyncio is not being used much in the parent process. It is just vanilla PyO3. Now, if it is getting started, I can make it start a little later.

@sansyrox
Copy link
Member Author

sansyrox commented Nov 2, 2021

@awestlake87 , would it make sense if I would start a process for pyo3-asyncio, kill it and then spawn the two separate processes for the runtime?

@awestlake87
Copy link
Contributor

@awestlake87 , would it make sense if I would start a process for pyo3-asyncio, kill it and then spawn the two separate processes for the runtime?

I'm not sure I follow on this.

I was thinking that the parent process would handle server setup, then listening and accepting connections. Each connection forks the process and the child process runs the pyo3-asyncio handler.

As long as pyo3-asyncio and tokio are not involved in the server setup / listening, I think you should be ok.

@sansyrox
Copy link
Member Author

sansyrox commented Nov 2, 2021

@awestlake87 , ah got it. I believe the same is happening right now. The setup is handled before the processes, i.e. app.start() is called.

And even benchmarking is showing improved throughput and reduced time till 50k requests.

@sansyrox
Copy link
Member Author

sansyrox commented Nov 2, 2021

For some reason, headers are not being shown atm. But, I guess, that is not a tokio problem.

@sansyrox
Copy link
Member Author

sansyrox commented Nov 2, 2021

Latest Update: directory serving is not working, everything else works.

Need to fix it and refactor code.

@JackThomson2
Copy link
Contributor

Hey Sans, one thing I think it's worth checking is that all your processes are handling the share of requests. The perf improvements you're seeing may just be from setting workers to 1 which I found to be more efficient

@sansyrox
Copy link
Member Author

sansyrox commented Nov 5, 2021

Hey @JackThomson2 ,

I did the test right now. The maximum performance for me was at 5 processes and 5 workers. I feel it is somehow dependent on the number of CPU cores(virtual or real).

I was thinking of pushing with defaults as 1 and adding config flags for now. If we figure out the optimized algo for this, we can make that the default way but still the reconfigurability will be there in the future as well.

@sansyrox sansyrox force-pushed the socket-sharing branch 4 times, most recently from aaec1af to 86ee90a Compare November 7, 2021 19:40
@sansyrox sansyrox force-pushed the socket-sharing branch 2 times, most recently from c4acb39 to 63ae419 Compare November 10, 2021 10:41
@sansyrox sansyrox merged commit 5e47dbf into main Nov 10, 2021
@sansyrox
Copy link
Member Author

@JackThomson2 @messense @awestlake87 , this feature has now landed. 🥳

Thank you for the help ! ✨

@sansyrox sansyrox deleted the socket-sharing branch November 10, 2021 11:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants