Monitoring GPU cluster performance with NVIDIA DCGM-Exporter and Weights & Biases