This is a merge of server different models I have been playing with personally. the results have been to good not to share the checkpoint with everyone.
Description
float16 version
This is a merge of server different models I have been playing with personally. the results have been to good not to share the checkpoint with everyone.
float16 version