Devel #1782

dillontsmith · 2020-10-12T13:55:31Z

No description provided.

Updates the requirements on [pytest](https://github.com/pytest-dev/pytest) to permit the latest version. - [Release notes](https://github.com/pytest-dev/pytest/releases) - [Changelog](https://github.com/pytest-dev/pytest/blob/master/CHANGELOG.rst) - [Commits](pytest-dev/pytest@1.0.0b3...6.1.1) Signed-off-by: dependabot[bot] <[email protected]>

Required by llvmlite-0.34.0, except for aarch64 which needs llvm-9. We're not hitting the bug that restricts aarch64 to llvm-9 so bump the version for all archs. Signed-off-by: Jan Vesely <[email protected]>

Updates the requirements on [graphviz](https://github.com/xflr6/graphviz) to permit the latest version. - [Release notes](https://github.com/xflr6/graphviz/releases) - [Changelog](https://github.com/xflr6/graphviz/blob/master/CHANGES.txt) - [Commits](xflr6/graphviz@0.1...0.14.2) Signed-off-by: dependabot[bot] <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

Parameters are configured such that LCA == DDM followed by Logistic function

We're not using group synchronization and this reduces pressure on per-block resources. Most GPUs can handle 2-3 times more warps than blocks per SM [0]. Block size of 128 creates 4 times fewer blocks than warps, maximizing utilization of GPU resources. [0] https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#features-and-technical-specifications__technical-specifications-per-compute-capability Signed-off-by: Jan Vesely <[email protected]> fixup blocksize

We're not using any shared memery, but generate plenty of private data. Signed-off-by: Jan Vesely <[email protected]>

Use explicitly sized types instead of platform specific ones Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

Add compiled variants. Signed-off-by: Jan Vesely <[email protected]>

Signed-off-by: Jan Vesely <[email protected]>

Make IR generation for execution counts bitwidth agnostic. Add minor CUDA tune ups.

Bumps [actions/cache](https://github.com/actions/cache) from v2.1.1 to v2.1.2. - [Release notes](https://github.com/actions/cache/releases) - [Commits](actions/cache@v2.1.1...d1255ad) Signed-off-by: dependabot[bot] <[email protected]>

coveralls · 2020-10-12T14:39:20Z

Coverage increased (+0.004%) to 82.713% when pulling 78a0712 on devel into e415857 on master.

Signed-off-by: Jan Vesely <[email protected]>

dependabot bot and others added 14 commits October 6, 2020 08:04

travis: Bump llvm version to 10

66a4fa6

Required by llvmlite-0.34.0, except for aarch64 which needs llvm-9. We're not hitting the bug that restricts aarch64 to llvm-9 so bump the version for all archs. Signed-off-by: Jan Vesely <[email protected]>

tests/DDM: Add an LCA equivalent DDM test

8bd9fd2

Signed-off-by: Jan Vesely <[email protected]>

tests/LCA: Add a DDM equivalent LCA test

ed4824d

Signed-off-by: Jan Vesely <[email protected]>

tests: Add DDM/LCA equivalence tests (#1778)

de190fa

Parameters are configured such that LCA == DDM followed by Logistic function

llvm/cuda: Always prefer L1$ over shared mem

b453ed4

We're not using any shared memery, but generate plenty of private data. Signed-off-by: Jan Vesely <[email protected]>

llvm: Support conversion of 8 and 16 bit integer types

4f4f53b

Use explicitly sized types instead of platform specific ones Signed-off-by: Jan Vesely <[email protected]>

llvm: Do not assume execution counter uses 32bit ints

21f8c30

Signed-off-by: Jan Vesely <[email protected]>

tests/ProcessingMechanism: Consolidate output port function tests

9b9269d

Add compiled variants. Signed-off-by: Jan Vesely <[email protected]>

tests/ProcessingMechanism: Consolidate function tests

0e269f3

Signed-off-by: Jan Vesely <[email protected]>

llvm: Support more integer bitwidths (#1779)

a895bc4

Make IR generation for execution counts bitwidth agnostic. Add minor CUDA tune ups.

dillontsmith requested a review from SamKG October 12, 2020 13:55

SamKG approved these changes Oct 12, 2020

View reviewed changes

tests/models/predator-prey: Sort by number of attention levels

78a0712

Signed-off-by: Jan Vesely <[email protected]>

dillontsmith merged commit 472a2d0 into master Oct 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Devel #1782

Devel #1782

dillontsmith commented Oct 12, 2020

coveralls commented Oct 12, 2020 •

edited

Loading

Devel #1782

Devel #1782

Conversation

dillontsmith commented Oct 12, 2020

coveralls commented Oct 12, 2020 • edited Loading

coveralls commented Oct 12, 2020 •

edited

Loading