More Training Information Required

#7
by jayan12k - opened

Hello!
Could the authors provide some more information into the training process of this model. I know it already lists how many H100 GPUs were used but some metrics on GPU hours and total cost specifically and other compute measures would be helpful. Thank you!

Sign up or log in to comment