NVIDIA Interview Question

Efficient ways to parallelize the matrix-multiplication