• EgoIncarnate 14 hours ago

    not "an H200", "In the table above, tensor parallelism is compared to pipeline parallelism with each across eight GPUs"

    • FanaHOVA 14 hours ago

      Title on HN is wrong. The article says GPUs and it's referring to one of their 8xH200 boxes.

    • moondistance 16 hours ago

      Significant further optimizations. FP8!

      • 7e 15 hours ago

        And this is why nobody submits MLPerf against NVIDIA.