今年的两篇最佳论文一作均为华人。
论文:FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
论文链接:https://arxiv.org/abs/2501.01005
项目主页:https://flashinfer.ai/
GitHub 仓库:https://github.com/flashinfer-ai/flashinfer
论文:The Hidden Bloat in Machine Learning Systems
链接:https://arxiv.org/abs/2503.14226
通信人家园 (https://www.txrjy.com/) | Powered by C114 |