Releases: kubernetes-sigs/lws
Releases · kubernetes-sigs/lws
v0.7.0
What's Changed
🚀 Major Features:
- add kep-511 partition update by @lixin010 in #541
- KEP #552: allow updates to
spec.leaderWorkerTemplate.sizefield by @synthe102 in #557 - Add KEP#407: Gang Scheduling by @JesseStutler in #496
✨ Features:
- Add LWS configuration in Helm Charts by @ardaguclu in #517
- feat: add meaningful printer columns to LeaderWorkerSet CRD by @xigang in #580
🐛 Bugs:
- Fix helm update by @kerthcet in #481
- Fix UpdateInProgress conditions for newly created LWS by @yankay in #490
- Fix helm install panic by invalid values
.Values.certManager.enableby @nicole-lihui in #520 - Fix issue with rolling update when some existing replicas are unhealthy by @pierewoj in #488
- Fix lws chart: the webhook service selects unexpected pods when it is introduced as a dependency by @carlory in #536
- fix: support foreground deletion by @congcongke in #523
- fix: llamacpp compile error with adding curl dependency by @yankay in #559
- fix rolling update stuck that when old pods were not ready by @gujingit in #568
♻️ Cleanups:
- use a bigger machine for cloud build by @kannon92 in #483
- [CI] Add more verify include gomod, fmt, and generate-apiref by @yankay in #485
- docs: remove the use of Downward API for LWS_WORKER_INDEX by @chewong in #486
- [Docs] Fixed location of vLLM script on TPU image by @Edwinhr716 in #494
- Introduce env variable to modify ns in metrics tests by @ardaguclu in #489
- Add an example of using OCI to the Helm installation guide by @yankay in #503
- Add
prepare-release-branchin Makefile to update versions automatically by @yankay in #504 - Inject certs for metrics endpoint when cert-manager is enabled by @ardaguclu in #478
- Remove duplicate
tagkey in chart values by @nicolasochem in #509 - Run e2e tests also when cert-manager is enabled by @ardaguclu in #518
- Removing SaxML docs by @Edwinhr716 in #522
- Use correct lws apigroup name by @ardaguclu in #527
- fix: check optional field not nil before reference by @panpan0000 in #530
- Example of Configuration CRD by @panpan0000 in #526
- Add dynamo and new presentation by @kerthcet in #549
- test: fix race condition in worker StatefulSet deletion test case by @xigang in #574
- refactor: remove unused lws parameter from CreateRevision function by @xigang in #573
- Update vllm.md by @jonathan-innis in #581
- [adoption] add OME as new adopter by @panpan0000 in #579
- [talk and slides] adding KubeCon HK and JP 2025 talks for LWS by @panpan0000 in #582
- docs: Add HUGGING_FACE_HUB_TOKEN as environment variable by @jonathan-innis in #583
- auto-sync the CRD via make generate & make helm-chart-push by @kerthcet in #592
- Default Partition in RollingUpdateConfiguration to 0 to align with spec by @ardaguclu in #594
- fix: disable CURL to resolve libcurl dependency issues in llamacpp example by @yankay in #599
New Contributors
- @nicolasochem made their first contribution in #509
- @panpan0000 made their first contribution in #525
- @pierewoj made their first contribution in #488
- @lixin010 made their first contribution in #541
- @synthe102 made their first contribution in #557
- @xigang made their first contribution in #574
- @jonathan-innis made their first contribution in #581
- @JesseStutler made their first contribution in #496
Full Changelog: v0.6.0...v0.7.0
v0.6.3
v0.6.2
v0.6.1
v0.6.0
What's Changed:
🚀 Major Features:
- Introduce new component config flag by @ardaguclu in #325
- Creating LWS site by @Edwinhr716 in #426
- KEP 257: LeaderExcluded SubGroup support by @Edwinhr716 in #428
✨ Features:
- Make lws image multi-platform by @yankay in #321
- Inject environment variable LWS_WORKER_INDEX by @singeleaf in #416
🐛 Bugs:
- Add controllerrevisions rbac to helm chart. by @wqlparallel in #319
- Fix for odd numbers of arguments passed as key-value pairs for logging issue by @KeerthanaAP in #360
- Fix infinite StatefulSet creation loops by validate leader annotations by @yankay in #394
- Change annotation
leaderworkerset.gke.io/subgroup-sizetoleaderworkerset.sigs.k8s.io/subgroup-sizeby @yankay in #434 - ignore 0 replica check by @congcongke in #439
♻️ Cleanups:
- Remove pods create permission by @wqlparallel in #323
- add junit and artifacts directory for e2e tests so testgrid will disp… by @kannon92 in #335
- Pass ARTIFACTS env variable when building e2e tests by @Edwinhr716 in #345
- Add adopters, integrations, talks by @kerthcet in #367
- Deprecate flags as the recommended path is to pass configuration file by @ardaguclu in #361
- doc: Add terraform based EKS Blueprints pattern for multi-node inference by @askulkarni2 in #373
- Add missing external types to apply configurations by @astefanutti in #374
- Ignore LeaderElectionNamespace in cmd unit tests as it is dynamic by @ardaguclu in #383
- Allow pass through of additional params to "ray start" in ray_init.sh by @dims in #378
- use staging images instead of gcr by @kannon92 in #398
- Add files to publish helm chart by @Edwinhr716 in #411
- fix image build toolchain errors by @kannon92 in #424
- Run controller-gen only for api and pkg directories by @ardaguclu in #417
- Add Label
leaderworkerset.sigs.k8s.io/nameto the headlessService by @yankay in #433 - Use create jobs command instead of run pods in e2e testing by @ardaguclu in #445
- Export kind and cluster related logs under artifacts directory for e2e by @ardaguclu in #451
- Set readOnlyRootFilesystem explicitly to true by @ardaguclu in #457
- Check controller_runtime_webhook_requests_total that always exists in both pods by @ardaguclu in #456
- Explicitly bind metrics 8443 as containerPort in kustomize and helm by @ardaguclu in #458
📃 Docs:
- Adding User Guide for TensorRT-LLM by @Edwinhr716 in #356
- Adding User Guide for SGLang by @yankay in #377
- [Docs] Use the vllm image instead of building by the user self in the sample by @yankay in #453
- [Doc] Add troubleshooting page in site by @yankay in #468
New Contributors
- @yankay made their first contribution in #321
- @ardaguclu made their first contribution in #325
- @DiegoLelis made their first contribution in #354
- @KeerthanaAP made their first contribution in #360
- @askulkarni2 made their first contribution in #373
- @astefanutti made their first contribution in #374
- @dims made their first contribution in #378
- @singeleaf made their first contribution in #416
- @congcongke made their first contribution in #439
Full Changelog: v0.5.0...v0.6.0
v0.5.1
What's Changed
- Automated cherry pick of #347: Removed update operation from pod webhook by @Edwinhr716 in #348
- Automated cherry pick of #342: Added handling of nil revision in pod controller by @Edwinhr716 in #349
- Automated cherry pick of #341: Removed restarting group log in pod controller by @Edwinhr716 in #350
Full Changelog: v0.5.0...v0.5.1
Release v0.5.0
What's Changed
- Update installation guide by @liurupeng in #217
- Bump the kubernetes group with 5 updates by @dependabot in #224
- Add support to install lws at an arbitrary namespace by @kerthcet in #226
- feat: add build arg to docker buildx by @vie-serendipity in #227
- Injecting TPU_NAME environment variable by @Edwinhr716 in #209
- Creating new workerStatefulSet instead of patching when updating LWS template by @Edwinhr716 in #229
- Update KEP #173 to reflect implementation by @Edwinhr716 in #233
- Add cherry-pick script by @liurupeng in #235
- Add vLLM TPU Multihost example by @Edwinhr716 in #234
- Bump the kubernetes group with 5 updates by @dependabot in #241
- Bump sigs.k8s.io/controller-runtime from 0.19.0 to 0.19.1 by @dependabot in #242
- Changed log level to 2 by @Edwinhr716 in #244
- Bump github.com/onsi/ginkgo/v2 from 2.20.2 to 2.21.0 by @dependabot in #245
- Bump github.com/onsi/gomega from 1.34.2 to 1.35.1 by @dependabot in #246
- change to not use NetworkConfig in hash calculation if it has default value by @Edwinhr716 in #250
- Added validation for subdomainPolicy values by @Edwinhr716 in #251
- changed cherry pick script to be executable by @Edwinhr716 in #253
- add pods finalizer update rbac to allow lws to run in openshift by @kannon92 in #249
- Update installation instructions for release 0.4.2 by @ahg-g in #255
- Bump github.com/open-policy-agent/cert-controller from 0.11.0 to 0.12.0 by @dependabot in #256
- Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.1 to 4.4.3 by @dependabot in #258
- Unify Leader Election Namespace with Deployment Namespace by @wqlparallel in #263
- Bump github.com/onsi/gomega from 1.35.1 to 1.36.0 by @dependabot in #271
- Added validation for LWS Name by @Edwinhr716 in #272
- Updating OWNERS file by @Edwinhr716 in #273
- Bump github.com/onsi/ginkgo/v2 from 2.21.0 to 2.22.0 by @dependabot in #269
- Bump sigs.k8s.io/controller-runtime from 0.19.1 to 0.19.2 by @dependabot in #270
- Bump the kubernetes group with 5 updates by @dependabot in #268
- Bump sigs.k8s.io/controller-runtime from 0.19.2 to 0.19.3 by @dependabot in #278
- update golang to 1.23 and fix docker warnings by @kannon92 in #279
- Use the namespace of the service account by @kannon92 in #282
- Bump the kubernetes group with 5 updates by @dependabot in #286
- Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.3 to 4.5.0 by @dependabot in #287
- Bump github.com/onsi/gomega from 1.36.0 to 1.36.1 by @dependabot in #288
- add doc to say how to install in a different namespace by @kannon92 in #285
- Add helm charts for lws. by @wqlparallel in #266
- add KEP 238, to add controller revision by @Edwinhr716 in #261
- address non linear parsing cve by upgrading golang net by @kannon92 in #290
- Bump github.com/onsi/ginkgo/v2 from 2.22.0 to 2.22.1 by @dependabot in #292
- Remove
namespacearg from manager command by @chiragjn in #293 - Add Controller Revision (Implementation of KEP #238) by @Edwinhr716 in #277
- Bump github.com/onsi/gomega from 1.36.1 to 1.36.2 by @dependabot in #295
- Tests for Controller Revision by @Edwinhr716 in #297
- Minimize number of unnecessary logs by @Edwinhr716 in #298
- Update Reference Doc by @Edwinhr716 in #300
- Updating owners by @Edwinhr716 in #301
- Add Events by @Edwinhr716 in #299
- Bump github.com/onsi/ginkgo/v2 from 2.22.1 to 2.22.2 by @dependabot in #303
- Replaced Kueue with LWS on chart/lws ReadMe by @Edwinhr716 in #305
- Ignore not found error when handling ReacreateGroupOnPodRestart by @Edwinhr716 in #304
- Replace kube-rbac-proxy by @anshuman-agarwala in #284
- Fixed flaky ControllerRevision test and add CR events by @Edwinhr716 in #306
- Fixed metrics service port, added log level flag back by @Edwinhr716 in #307
- Removed kube-rbac-proxy container from helm charts by @Edwinhr716 in #308
New Contributors
- @vie-serendipity made their first contribution in #227
- @wqlparallel made their first contribution in #263
- @chiragjn made their first contribution in #293
- @anshuman-agarwala made their first contribution in #284
Full Changelog: v0.4.0...v0.5.0
0.4.2
What's Changed
- Automated cherry pick of #250: change to not use NetworkConfig in hash calculation if it has default value by @Edwinhr716 in #252
Full Changelog: v0.4.1...v0.4.2
LeaderWorkerSet Patch release 0.4.1
This is a patch release for v0.4.0
Leaderworkerset v0.4.0
Features:
- Support specify NetworkConfig
- Support leader elect for lws controller
- Add group size as an environment variable
- Add an example for using llama.cpp to deploy a distributed inference service
- Update the GPU multi-node inference with vLLM example to serve Llama3.1-405b model
What's Changed
- Update the lws version of the installation guide by @liurupeng in #160
- Bump the kubernetes group with 6 updates by @dependabot in #165
- add leader elect args by @gujingit in #163
- Add executable permission to the example script ray_init.sh by @xgchena in #166
- docs: add LeaderWorkerSet API reference by @zmvictor in #171
- Bump the kubernetes group with 6 updates by @dependabot in #175
- Add example for llama.cpp by @justinsb in #174
- statefulset finalizer rbac update by @kannon92 in #176
- add verify job and fix code where it fails by @kannon92 in #180
- Bump github.com/onsi/gomega from 1.33.1 to 1.34.0 by @dependabot in #181
- set golangci-lint for 15 minutes to be same as jobset by @kannon92 in #183
- Bump github.com/onsi/ginkgo/v2 from 2.19.0 to 2.19.1 by @dependabot in #182
- Fix missing startupPolicy in applyConfiguration by @kerthcet in #186
- Bump github.com/onsi/gomega from 1.34.0 to 1.34.1 by @dependabot in #187
- updating vllm docs to use llama3 405B as example by @Edwinhr716 in #185
- No worker sts when size=1 by @kerthcet in #190
- Bump cert-controller to v0.11.0 by @kerthcet in #191
- Bump github.com/onsi/ginkgo/v2 from 2.19.1 to 2.20.0 by @dependabot in #192
- add golangci config to project by @kannon92 in #194
- Headless Service Per Replica KEP by @Edwinhr716 in #188
- fix make generate by @kerthcet in #196
- Bump the kubernetes group with 6 updates by @dependabot in #198
- We should not import from k8s/kubernetes. by @kannon92 in #200
- Updated vLLM yaml example to use LWS_LEADER_ADDRESS by @Edwinhr716 in #202
- Bump github.com/onsi/ginkgo/v2 from 2.20.0 to 2.20.1 by @dependabot in #201
- Bump github.com/onsi/gomega from 1.34.1 to 1.34.2 by @dependabot in #205
- Bump github.com/onsi/ginkgo/v2 from 2.20.1 to 2.20.2 by @dependabot in #206
- Bump kubernetes dependencies to v0.31.x. by @mbobrovskyi in #207
- feature: Add group size environment variable injection by @googs1025 in #208
- simply llama.cpp example by @carlory in #212
- Make llama.cpp example become avaliable on macOS by @carlory in #211
- Adding SubdomainPolicy to support a service per replica by @Edwinhr716 in #197
- Deprecated DefaultRestartPolicy with NoneRestartPolicy by @kerthcet in #214
New Contributors
- @xgchena made their first contribution in #166
- @zmvictor made their first contribution in #171
- @justinsb made their first contribution in #174
- @mbobrovskyi made their first contribution in #207
- @carlory made their first contribution in #212
Full Changelog: v0.3.0...v0.4.0