Transformer Visualizer

Mixed Precision Training (AMP)

Methodology to train DNN using half-precision floating point numbers, without losing model accuracy or having to modify hyper-parameters. Since half-precision has a narrower range than single-precision, they propose three techniques for preventing the loss of critical information:

image.png

Nsight Compute and Systems

Basically, Nsight System is a high-level profiler meant to perform system level profiling. On the other hand, Nsight compute is a low-level profiler, meant for profiling CUDA kernels.