How would this compare training time wise with gradientai/Llama-3-8B-Instruct-Gradient-1048k ?
#1
by
ucalyptus
- opened
Great work.
Just wanted to see wall time comparison with the below model:
https://huggingface.co/gradientai/Llama-3-8B-Instruct-Gradient-1048k