Simulating MI300X with gem5 and Saving 100k?
A while ago, someone in the community published a Zhihu post on simulating MI300X with gem5 1 . Since I was recently verifying AMDGPU floating point precision, I wanted to compare
A while ago, someone in the community published a Zhihu post on simulating MI300X with gem5 1 . Since I was recently verifying AMDGPU floating point precision, I wanted to compare
This article first appeared on the WeChat public account GTOC. Quantization is widely used in industry to improve the training and inference efficiency of large models and reduce c