Thank you so much for the helpful information!
Question 1: Could you please provide me with the MME clock speed?
Question 2: Is configurable MME means that the graph compiler can configure which operation on MME and which on TPC?
Question 3: How is the sparse matrix multiplication (SpMM) optimized on the MME?
Question 4: I observed that fp16 is not supported for the GEMM operation. Could you please tell me the architectural difference between NVIDIA GPU Tensor cores and the MME EU cores?
Thank you!