Skip to content

Commit 8f9ceb0

Browse files
authored
Merge pull request #50 from AmberLJC/claude/issue-49-20260113-0636
Add new papers: AIConfigurator and MoEBlaze
2 parents 9a17c20 + ed4c98b commit 8f9ceb0

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -81,6 +81,7 @@ A curated list of Large Language Model systems related academic papers, articles
8181
- [TrainVerify: Equivalence-Based Verification for Distributed LLM Training](https://sigops.org/s/conferences/sosp/2025/accepted.html) | SOSP' 25
8282
- [Collective Communication for 100k+ GPUs](https://arxiv.org/abs/2510.20171): Large-scale collective communication optimization for massive GPU clusters
8383
- [RDMA Point-to-Point Communication for LLM Systems](https://arxiv.org/abs/2510.27656): RDMA-based point-to-point communication optimization for distributed LLM systems
84+
- [MoEBlaze](https://arxiv.org/abs/2601.05296): Breaking the Memory Wall for Efficient MoE Training on Modern GPUs
8485

8586

8687
#### Systems for Post-training / RLHF
@@ -237,6 +238,7 @@ A curated list of Large Language Model systems related academic papers, articles
237238
- [The ML.ENERGY Benchmark](https://arxiv.org/abs/2505.06371): Toward Automated Inference Energy Measurement and Optimization | NeurIPS' 25
238239
- [Serve Programs, Not Prompts](https://arxiv.org/abs/2510.25412): Efficient LLM serving system for structured program execution
239240
- [Continuum](https://arxiv.org/abs/2511.02230): Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live
241+
- [AIConfigurator](https://arxiv.org/abs/2601.06288): Lightning-Fast Configuration Optimization for Multi-Framework LLM Serving
240242

241243

242244
#### Agent Systems

0 commit comments

Comments
 (0)