Skip to content

Latest commit

 

History

History
215 lines (120 loc) · 8.22 KB

CHANGELOG.md

File metadata and controls

215 lines (120 loc) · 8.22 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

[Unreleased version] - YYYY-MM-DD

Added

Changed

  • Updated to common hpu_backend interface for compile support. (#183)

Fixed

Removed

Deprecated

[1.5.0] - 2024-05-03

Added

  • Added support for Intel Gaudi Profiler. Deprecate HABANA_PROFILE environment variable from HPUProfiler. (#158)
  • Added support for FP8 inference. (#162)
  • Added support for LightningCLI. (#173)
  • Added experimental support for FSDP on HPU. (#174)
  • Added support for FP8 inference with DeepSpeed. (#176)
  • Updated the lightning version check for using FSDP. (#182)

Changed

  • Changed HPUParallelStrategy to HPUDDPStrategy (#160)
  • Changed HPU docker image based on Synapse AI release 1.15.0 (#166)
  • Updated to Intel Gaudi software Release 1.15.1 (#171)

Fixed

  • Fixed "No profiler activity found" error with HPUProfiler. (#172)

Removed

Deprecated

[1.4.0] - 2024-02-16

Added

  • Added DeepSpeed precision plugin for HPU (#147)
  • Added support for fp8 training. (#149)

Changed

  • Decoupled return strings of firmware, synapse version helper (#137)
  • Changed HPU docker image based on Synapse AI release 1.14.0 (#140)

Fixed

  • Fixed fabric imports for HPU strategies (#126)
  • Enabling tests and examples of fabric with HPU (#139)
  • Fixes an API break due to non-strict loading in Trainer (#150)

Removed

Deprecated

  • aot_hpu_training_backend will be deprecated. Use hpu_backend instead for torch compile with hpu (#148)

[1.3.0] - 2023-12-06

Added

  • Added support for Deepspeed inference on HPU with tests and documentation (#110)
  • Added tests, examples, and documentation for dynamic shapes with recipe caching (#107)
  • Added preview of torch compile with tests and documentation (#119)

Changed

  • Changed HPU docker image based on Synapse AI release 1.13.0 (#114)

Fixed

  • Fixed fabric imports for HPU strategies (#126)

Removed

Deprecated

[1.2.0] - 2023-10-26

Added

  • Added tests, examples and documentation for HPUPrecisionPlugin with autocast (#94)
  • Added test to validate checkpoint resuming with HPUDeepSpeedStrategy (#95)
  • Added support for lightning 2.1 (#100, #105)

Changed

  • Changed HPU docker image based on synapse AI release 1.12.0 (#90)
  • Use standard API's and Remove env variable to get HPU distributed backend (#91)
  • Changed HPU docker image based on synapse AI release 1.12.1, updated hooks (#106)

Fixed

Removed

Deprecated

[1.1.0] - 2023-09-26

Added

  • Documentation with examples for using DeepSpeed with HPU (#64)
  • Add autocast using HPUPrecision plugin (#66, #75)
  • Demonstrate HPU Graphs support (#67)
  • Enhance test coverage of DeepSpeed strategy on HPU (#68)
  • Added version check helper to use right release (#75, #76)
  • Implement reduce with parallel plugin (#77)

Changed

  • Changed HPU docker image based on synapse AI release 1.11.0 & upgraded deepspeed plugin to version 0.9.4 (#61)

Fixed

  • Fixed optimizer priority based on deepspeed specification (#36)
  • Fixed missing extras in package (#70)

Deprecated

  • Warn on HMP deprecation from HPUPrecision plugin (#65)

[1.0.1] - 2023-07-26

Added

  • Added tests for mixed precision training (#36)
  • Example to include mixed precision training (#54)

Changed

  • Enabled skipped tests based on registered strategy, accelerator (#46)

Fixed

  • Fixed Attribute Error (#43)
  • Fixed wrong imports (#44)
  • Fixed graph breaks in test/val phases in lazy mode (#45)

[1.0.0] - 2023-06-14

Added

  • Added HPU support for fabric (#11)
  • Added Pytorch HPU profiler support (#15)
  • Added basic HPU infra support for deep speed (#21)
  • Added Pytorch HPU datamodule support (#16)

Changed

  • Changed code hierarchy in compliance with base lightning code for pytorch (#12)
  • Changed default HPU docker image based on HPU release 1.10.0 (#30)

Fixed

  • Fixed mnist example test (#20)
  • Habana's dataloader hang with Lightning 2.0.x (#29)
  • Make #29 applicable only for gaudi devices (#39)
  • Fixed environment initialization for hpus and fixed docs (#40)
  • Fixed docs and added work around to make use hpu media packages without signature issues (#41)

Removed

  • Cleaning up env's ID for HPU parallel plugins based on synapse AI release 1.9 (#28)
  • Remove unnecessary import checks which degrade performance (#38)