Skip to content

Releases: deepspeedai/DeepSpeed

v0.18.4 Patch Release

07 Jan 22:58
b35d9eb

Choose a tag to compare

What's Changed

  • Update version by @sfc-gh-truwase in #7719
  • Disable deterministic option in compile tests by @tohtana in #7720
  • Fix SuperOffloadOptimizer_Stage3 crash due to missing param_names parameter by @ImaGoodFella in #7715
  • [AMD][ROCm] Improve support of AMD by @k-artem in #7448
  • fix typo by @stas00 in #7722
  • Skip none in backward hook by @tohtana in #7725
  • [Engine] Only scale gradients if scale_wrt_gas is True by @kashif in #7724
  • Fix testcases that depends on triton by @k-artem in #7731
  • Fix rare hang in DeepSpeed Async I/O wait by releasing the Python GIL by @xylian86 in #7727
  • Fix #7733: Replace torch.sqrt with math.sqrt in scale_lr for sqrt method by @Rakshit-gen in #7735
  • replace moe checkpoint dp_world_size with seq_dp_world_size by @wukong1992 in #7732
  • [BUG] Fix UlyssesSPAttentionHF.register_with_transformers() crash with PEFT models by @Rakshit-gen in #7737
  • Add core api update blog by @tohtana in #7738
  • Fix Nebula checkpoint engine commit() API mismatch by @Rakshit-gen in #7740
  • Fix DecoupledCheckpointEngine deadlock and improve reliability by @Rakshit-gen in #7742
  • Fix OnebitLamb NaN propagation with empty parameters by @Rakshit-gen in #7736
  • fix: remove premature MPI environment variable check in OpenMPIRunner by @leejianwoo-collab in #7751
  • Enable python 3.11 and 3.12 tests by @loadams in #7007
  • Add CI workflow to run tests on AWS by @tohtana in #7753
  • Add fallback to BF16 support check by @tohtana in #7754
  • Fix DeepCompile for PyTorch 2.8/2.9 compatibility by @tohtana in #7755
  • Removed amp testcases by @k-artem in #7745
  • fix: avoid IndexError in BF16_Optimizer.destroy() when using DummyOptim by @leejianwoo-collab in #7763

New Contributors

Full Changelog: v0.18.3...v0.18.4

v0.18.3 Patch Release

09 Dec 14:48
6eb98aa

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.18.2...v0.18.3

v0.18.2 Patch Release

05 Nov 19:21
e993fea

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.18.1...v0.18.2

v0.18.1 Patch Release

23 Oct 16:39

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.18.0...v0.18.1

v0.18.0

07 Oct 23:27
79caae1

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.17.6...v0.18.0

v0.17.6 Patch Release

19 Sep 19:56
e4f6da9

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.17.5...v0.17.6

v0.17.5 Patch Release

20 Aug 20:28
047a759

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.17.4...v0.17.5

v0.17.4 Patch Release

31 Jul 20:47
c4b1a8c

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.17.3...v0.17.4

v0.17.3 Patch Release

28 Jul 18:20
092625c

Choose a tag to compare

What's Changed

  • [TiledMLP]: fix for bs>1 by @stas00 in #7412
  • Update version.txt after v0.17.2 release. by @loadams in #7417
  • Enable torch version dependent compilation of record_module and iter_params by @deepcharm in #7362
  • [BUGFIX] Reset bucket.elements after reduction in ZeRO Stage 3 by @rahul713rk in #7418
  • Align missing argument in AllReduceCoalescedHandle by @deepcharm in #7414
  • Improvements to Communication Logger by @alexk101 in #7404
  • trying to fix nv-accelerate-v100.yml CI job by @stas00 in #7424
  • fix: Propagate strip_tensor_paddings by @saforem2 in #7426
  • Use past_key_value when provided by @deepcharm in #7428
  • set device_id in torch's init_process_group by @stas00 in #7266
  • [Ulysses-ALST] add FA3 support by @stas00 in #7430
  • TiledMLP + SequenceTiledCompute: improve the bs>1 use-case by @stas00 in #7422
  • Remove unused yaml test configurations and update README by @loadams in #7441
  • [ALST] fix typo in the url by @stas00 in #7444
  • [ALST] fix typo in the url part2 by @stas00 in #7446
  • Remove additional unused tests (human-eval) by @loadams in #7445
  • Fix: Adapt Llama injection policy for newer transformers versions by @huanyuqu in #7443

New Contributors

Full Changelog: v0.17.2...v0.17.3

v0.17.2 Patch Release

07 Jul 18:13
15f054d

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.17.1...v0.17.2