Resource As You Wish: Collaborative Reservation and Allocation by Scheduler Plugin and Device PluginTakuya MishinaTatsuhiro Chiba2024KubeDay Japan 2024
Using automation to mitigate risk and enforce policy complianceMatthew JonesYuji Watanabeet al.2024Red Hat Summit 2024
Trimaran: Load-Aware Scheduling for Power Efficiency and Performance StabilityAsser TantawiChen Wang2024KubeCon EU 2024
CASPIAN: A Carbon-Optimized Multi-Cluster Job SchedulerTayebeh BahreiniAsser Tantawi2024KubeCon EU 2024
Training Foundation Model Workloads on Kubernetes at Scale With MCADOlivier TardieuAbhishek Malvankar2023K8SAIHPCDAY 2023
A carbon-aware workload dispatcher in multi-cluster Kubernetes environmentsTayebeh BahreiniAsser Tantawi2023Cloud Native Sustainability Week 2023
GPU OPTIMIZATIONS FOR EFFICIENT AND COST-EFFECTIVE ACCESS TO DIVERSE LARGE LANGUAGE MODELS IN RESEARCH CLUSTERChen WangYue Zhuet al.2024MLSys 2024
DEFT: SLO-Driven Preemptive Scheduling for Containerized DNN ServingYitian HaoWenqing Wuet al.2023NSDI 2023