Skip to content

Releases: lixilinx/psgd_torch

PSGD 2.0 release

25 Dec 19:06
b4b4c11

Choose a tag to compare

PSGD 2.0 supports both the old tri-solver based update formulae for Q and a few inverse-free matmul only methods for updating Q, including online Newton-Schulz iterations. Main files:

archived code

17 Dec 06:39
e1a82fc

Choose a tag to compare

Update preconditioned_stochastic_gradient_descent.py

1, replace trtrs with triangular_solve due to torch's API update

2, use torch.chain_matmul for things like A @ B @ C ...