More Training Information Required

by jayan12k - opened 4 days ago

4 days ago

Hello!
Could the authors provide some more information into the training process of this model. I know it already lists how many H100 GPUs were used but some metrics on GPU hours and total cost specifically and other compute measures would be helpful. Thank you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment