Cool model!

#1
by Datdanboi25 - opened

Very cool!

Thanks !!
But tbh I dont think this model is cool , LIke i blundered the model architeture , in making of as deep model as possible.
Though I learnt a lot in the process.

Oh if anything I think the depth was good, maybe ideal would have been more around 9 layers or something, but I think the main issue is the tokenizer being too big, other than that I reckon it looks promising!

Yeah , You are right
I ll do 4096 next time.
I think that will be perfect .

Yeah reinvest the parameters into width, would be great!

Datdanboi25 changed discussion status to closed
Datdanboi25 changed discussion status to open

Also mind if I add it to the leaderboard?

Well .....Yeah add it
Thanks !!

Sign up or log in to comment