SynapseAI Core

SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi.

SynapseAI Core

Library components

SynapseAI Core contains the following elements:

Synapse Core: an implementation of the SynapseAI APIs. Some APIs are not implemented for brevity, see documentation below.
Synapse Backend: a library for executing code on Habana Gaudi, using the habanalabs driver and the hl-thunk user-space library.
TPC Kernels DB: a library containing some sample TPC (Tensor Processing Core) kernels and accompanying host-side code.
ELFTools: a utility library for parsing ELF sections and extracting metadata generated by the TPC LLVM compiler.
Tests: Some simple tests to demonstrate how to execute workloads on the device that utilize the TPC engines.

Pre-requisites

Linux kernel with latest habanalabs driver. Gaudi device support was added in kernel 5.7 but to work with a secured device, kernel 5.15 and above is required.
hl-thunk library (https://github.com/HabanaAI/hl-thunk)
TPC LLVM compiler (https://github.com/HabanaAI/tpc_llvm)
cmake version 3.5.1 or higher
GCC 7.5 or higher

Building SynapseAI Core

Build the hl-thunk library and TPC LLVM compiler according to their respective instructions. From here on, we assume the root of TPC LLVM is $HOME/tpc-llvm and the root of hl-thunk is $HOME/hl-thunk
Clone the repository. From here on, we assume it was cloned to $HOME/SynapseAI_core
Run the build.sh script:

cd $HOME/SynapseAI_Core
EXTRA_CMAKE_FLAGS="-DTPC_LLVM_BIN_PATH=$HOME/tpc-llvm/build/bin -DHLTHUNK_INCLUDE_PATH=$HOME/hl-thunk/include/uapi -DHLTHUNK_LIB_PATH=$HOME/hl-thunk/build/lib" ./build.sh

Tests

There are a couple of tests in the tests folder but currently only two will compile. In the near future we will add the missing kernels for the rest of the tests.

The tests that can currently be run are:

div_test - This test computes division of two tensors on the device. It copies the output to the host and compares the result to a reference implementation on the host. The test serves as a demonstration of how to create a graph containing a single node. The node represents a divide operator. The division is implemented using a TPC kernel that performs the computation on the TPC engines.
memcpy_test - This test demonstrates how to use the DMA engine to copy a tensor in and out of the device. It doesn't use the TPC engine and doesn't require any TPC kernel.

Running the Tensor Division test

cd $HOME/SynapseAI_Core
./build/bin/div_test

The expected result should be:

Comparison passed successfully

Limitations

Limitations of this implementation compared to the closed-source SynapseAI release:

Operations are synchronous and synchronization occurs through the host.
- So many APIs, like synStreamWaitEvent, synEventCreate etc, are no-ops.
This version of the library doesn't implement any operations itself. This means for an operation like reshape or split, the user has to resolve it themselves, or write a TPC kernel to perform it.
The implementation is limited to single-node graphs. Calling synNodeCreate on a graph that already contains a node will fail.
- As a corollary, the user must perform all memory management of the Gaudi Memory (HBM)
- And control edges are not supported since they're not needed
The "section" mechanism is not supported. All tensors must be created with a null for section handle.
The user is limited to a single stream per type. Only compute, copy device to host and copy host to device are supported.
Tensors must be dense in device memory. Strided tensors are not supported.
Only floating point tensors are supported.
No profiler support.
No support for printf from kernels.
No support for quantization-related APIs, such as providing tensors with static data.
No support for advanced SynapseAI features, like dynamic shape support or tensors of any rank.
Unsupported SynapseAI APIs:
- synRecipeSerialize / Deserialize
- synDeviceAcquireByModuleID
- synConfigurationGet/Set
- synProfilerStart/Stop/GetTrace
- synConstTensorCreate
- synEventElapsedTime always returns 0

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
elftools		elftools
external_includes		external_includes
synapse_backend		synapse_backend
synapse_core		synapse_core
tests		tests
tpc_kernels_db		tpc_kernels_db
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
COPYING.MIT		COPYING.MIT
COPYING.md		COPYING.md
README.md		README.md
build.sh		build.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SynapseAI Core

Library components

Pre-requisites

Building SynapseAI Core

Tests

Running the Tensor Division test

Limitations

About

Releases

Packages

Languages

License

luoyu-intel/SynapseAI_Core

Folders and files

Latest commit

History

Repository files navigation

SynapseAI Core

Library components

Pre-requisites

Building SynapseAI Core

Tests

Running the Tensor Division test

Limitations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages