1. Gpt NeoAn implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.
3. Gpt NeoxAn implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.
4. Dalle MtfOpen-AI's DALL-E for large scale training in mesh-tensorflow.
6. project-menuSee the issue board for the current status of active and prospective projects!