Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[Unreleased version] - YYYY-MM-DD

Added

Changed

Updated to common hpu_backend interface for compile support. (#183)

Fixed

Removed

Deprecated

[1.5.0] - 2024-05-03

Added

Added support for Intel Gaudi Profiler. Deprecate HABANA_PROFILE environment variable from HPUProfiler. (#158)
Added support for FP8 inference. (#162)
Added support for LightningCLI. (#173)
Added experimental support for FSDP on HPU. (#174)
Added support for FP8 inference with DeepSpeed. (#176)
Updated the lightning version check for using FSDP. (#182)

Changed

Changed HPUParallelStrategy to HPUDDPStrategy (#160)
Changed HPU docker image based on Synapse AI release 1.15.0 (#166)
Updated to Intel Gaudi software Release 1.15.1 (#171)

Fixed

Fixed "No profiler activity found" error with HPUProfiler. (#172)

Removed

Deprecated

[1.4.0] - 2024-02-16

Added

Added DeepSpeed precision plugin for HPU (#147)
Added support for fp8 training. (#149)

Changed

Decoupled return strings of firmware, synapse version helper (#137)
Changed HPU docker image based on Synapse AI release 1.14.0 (#140)

Fixed

Fixed fabric imports for HPU strategies (#126)
Enabling tests and examples of fabric with HPU (#139)
Fixes an API break due to non-strict loading in Trainer (#150)

Removed

Deprecated

aot_hpu_training_backend will be deprecated. Use hpu_backend instead for torch compile with hpu (#148)

[1.3.0] - 2023-12-06

Added

Added support for Deepspeed inference on HPU with tests and documentation (#110)
Added tests, examples, and documentation for dynamic shapes with recipe caching (#107)
Added preview of torch compile with tests and documentation (#119)

Changed

Changed HPU docker image based on Synapse AI release 1.13.0 (#114)

Fixed

Fixed fabric imports for HPU strategies (#126)

Removed

Deprecated

[1.2.0] - 2023-10-26

Added

Added tests, examples and documentation for HPUPrecisionPlugin with autocast (#94)
Added test to validate checkpoint resuming with HPUDeepSpeedStrategy (#95)
Added support for lightning 2.1 (#100, #105)

Changed

Changed HPU docker image based on synapse AI release 1.12.0 (#90)
Use standard API's and Remove env variable to get HPU distributed backend (#91)
Changed HPU docker image based on synapse AI release 1.12.1, updated hooks (#106)

Fixed

Removed

Deprecated

[1.1.0] - 2023-09-26

Added

Documentation with examples for using DeepSpeed with HPU (#64)
Add autocast using HPUPrecision plugin (#66, #75)
Demonstrate HPU Graphs support (#67)
Enhance test coverage of DeepSpeed strategy on HPU (#68)
Added version check helper to use right release (#75, #76)
Implement reduce with parallel plugin (#77)

Changed

Changed HPU docker image based on synapse AI release 1.11.0 & upgraded deepspeed plugin to version 0.9.4 (#61)

Fixed

Fixed optimizer priority based on deepspeed specification (#36)
Fixed missing extras in package (#70)

Deprecated

Warn on HMP deprecation from HPUPrecision plugin (#65)

[1.0.1] - 2023-07-26

Added

Added tests for mixed precision training (#36)
Example to include mixed precision training (#54)

Changed

Enabled skipped tests based on registered strategy, accelerator (#46)

Fixed

Fixed Attribute Error (#43)
Fixed wrong imports (#44)
Fixed graph breaks in test/val phases in lazy mode (#45)

[1.0.0] - 2023-06-14

Added

Added HPU support for fabric (#11)
Added Pytorch HPU profiler support (#15)
Added basic HPU infra support for deep speed (#21)
Added Pytorch HPU datamodule support (#16)

Changed

Changed code hierarchy in compliance with base lightning code for pytorch (#12)
Changed default HPU docker image based on HPU release 1.10.0 (#30)

Fixed

Fixed mnist example test (#20)
Habana's dataloader hang with Lightning 2.0.x (#29)
Make #29 applicable only for gaudi devices (#39)
Fixed environment initialization for hpus and fixed docs (#40)
Fixed docs and added work around to make use hpu media packages without signature issues (#41)

Removed

Cleaning up env's ID for HPU parallel plugins based on synapse AI release 1.9 (#28)
Remove unnecessary import checks which degrade performance (#38)