Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement a more accurate hardware model. #4

Open
3 tasks
Tracked by #1
miladHakimi opened this issue Feb 1, 2023 · 0 comments
Open
3 tasks
Tracked by #1

Implement a more accurate hardware model. #4

miladHakimi opened this issue Feb 1, 2023 · 0 comments

Comments

@miladHakimi
Copy link
Collaborator

miladHakimi commented Feb 1, 2023

The current model changes Enzyme's tape loads and stores with Tloads and Tstores. It makes the all the instructions after the barrier be dependent on it and models the memory latency by using a delay. It does not support double buffering right now and the delay might not be fixed in different benchmarks. Moreover, using the noop native operations in salam we want to add the comparison between the recompute and store-all models.

  • Implement actual writes and read request to memory for the barriers.
  • Implement double buffering.
  • Implement the store-all execution model.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant