英伟达技术博客:https://developer.nvidia.com/zh-cn/blog/fp8-precision-performance/
参考论文:Recipes for Pre-training LLMs with MXFP8 https://arxiv.org/pdf/2506.08027