1394210931018715145

May 17, 2021
2/ I'm even more excited by the FAQ of the project:

"Q: Are you serious when you say you are going to train a model comparable to the biggest GPT⁠-⁠3 (175b parameters)?
A: Yes...We expect our final model to be somewhere between 150 and 200b param"

https://t.co/P2LWvL8dpY