SynapseAI 1.11.0 Release

The Habana team is happy to announce the release of Habana® SynapseAI® Software version 1.11.0.

In this release, we’ve upgraded versions of several libraries, including DeepSpeed 0.9.4, PyTorch Lightning 2.0.4 and TensorFlow 2.12.1.

We have introduced support for DeepSpeed-Chat, and have an example published in the Habana reference models repository. And, we have also introduced DeepSpeed support in Lightning.

For debug and profiling, users can now retrieve metrics for cpu_fallback, memory_defragmentation, recipe_cache using Metric APIs.

Some of the Habana reference models have been updated to use the PT_HPU_ENABLE_REFINE_DYNAMIC_SHAPES runtime environment variable which allows SynapseAI to automatically handle dynamic shapes.

We have added a new kernel FusedSDPA, a fused implementation of the nn.functional.scaled_dot_product_attention() API on the HPU.

This release also includes some LLM inference performance improvements. Check out Habana’s model performance page.

Lastly a reminder that support for Habana Mixed Precision (HMP) is deprecated and will be dropped in the next release. Users should plan to switch to autocast for mixed precision support.

You can find more information on SynapseAI 1.11.0 on Habana’s release notes page.