• EgoIncarnate 9 months ago

    not "an H200", "In the table above, tensor parallelism is compared to pipeline parallelism with each across eight GPUs"

    • FanaHOVA 9 months ago

      Title on HN is wrong. The article says GPUs and it's referring to one of their 8xH200 boxes.

    • 7e 9 months ago

      And this is why nobody submits MLPerf against NVIDIA.

    • moondistance 9 months ago

      Significant further optimizations. FP8!