Simulation

Simulating MI300X with gem5 and Saving 100k?

A while ago, someone in the community published a Zhihu post on simulating MI300X with gem5 1 . Since I was recently verifying AMDGPU floating point precision, I wanted to compare

A Preliminary Look at Floating-Point Precision for AI FPU Virtual Prototyping Platforms for LLMs

This article first appeared on the WeChat public account GTOC. Quantization is widely used in industry to improve the training and inference efficiency of large models and reduce c