Research
|
Tally: Non-Intrusive Performance Isolation for Concurrent Deep Learning Workloads
Wei Zhao, Anand Jayarajan, Gennady Pekhimenko
ASPLOS 2025 (Distinguished Artifact Award)
paper / code
|
Seesaw: High-throughput LLM Inference via Model Re-sharding
Qidong Su, Wei Zhao, Xin Li, Muralidhar Andoorveedu, Chenhao Jiang, Zhanda Zhu, Kevin Song, Christina Giannoula, Gennady Pekhimenko
MLSys 2025 (Outstanding Paper Honorable Mention)
paper / code
|
TiLT: A Time-Centric Approach for Stream Query Optimization and Parallelization
Anand Jayarajan, Wei Zhao, Yudi Sun, Gennady Pekhimenko
ASPLOS 2023 (Distinguished Artifact Award)
paper / code
|
Website template from Jon Barron.
Last updated: June 2, 2025
|
|