

Muse
Muse is an API that accesses VLM-4, a set of natively trained large Language Models in French, Italian, Spanish, German, and English.
Megatron NLG
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model | NVIDIA Technical Blog
Turing-NLG
Microsoft Project Turing
Med-PaLM
AI Powered Medical Imaging
Macaw By AI2
GitHub - allenai/macaw: Multi-angle c(q)uestion answering
Google LaMDA
LaMDA: our breakthrough conversation technology
Jurassic-1 Language Models
A huge language model to rival OpenAI's GPT-3
InstructGPT
Aligning language models to follow instructions
HyperCLOVA
The Future of Virtual Assistants
GPT-Neo
GitHub - EleutherAI/gpt-neo: An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
GPT-2
Better language models and their implications
Google GShard
[2006.16668] GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Google GLaM
More Efficient In-Context Learning with GLaM – Google AI Blog
Google BERT
Open Sourcing BERT: State-of-the-Art Pre-training for Natural Language Processing – Google AI Blog
GLM-130B
GitHub - THUDM/GLM-130B: GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
ERNIE Titan LLM
ERNIE fuses a big GPT-3-esque language model with a large external knowledge base
DistilBERT
A a distilled version of BERT: smaller, faster, cheaper and lighter
DialoGPT
GitHub - microsoft/DialoGPT: Large-scale pretraining for dialogue
DeepMind RETRO
Improving language models by retrieving from trillions of tokens
Gopher By DeepMind
Language modelling at scale: Gopher, ethical considerations, and retrieval
TOP