Releases · lixilinx/psgd_torch

25 Dec 19:06

lixilinx

b4b4c11

PSGD 2.0 release Latest

Latest

PSGD 2.0 supports both the old tri-solver based update formulae for Q and a few inverse-free matmul only methods for updating Q, including online Newton-Schulz iterations. Main files:

psgd.py: functional APIs providing all the flexibilities.
wrapped_as_torch_optimizer_for_ddp.py: a basic momentum whitening torch.optim.Optimizer wrapping example for DDP training.
wrapped_as_torch_optimizer_for_dtensor.py: one more basic momentum whitening torch.optim.Optimizer wrapping example for DTensor-based distributed training.

Assets 2

17 Dec 06:39

lixilinx

1.0

e1a82fc

archived code

Update preconditioned_stochastic_gradient_descent.py

1, replace trtrs with triangular_solve due to torch's API update

2, use torch.chain_matmul for things like A @ B @ C ...

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: lixilinx/psgd_torch

PSGD 2.0 release

Uh oh!

archived code

Uh oh!