Best Alternatives to BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (BERT)
1 alternatives ranked by real usage data. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (BERT) scores 23/100 — 1 tool score higher.
* 🏆 2020: [Language Models are Few-Shot Learners (GPT-3)](https://proceedings.neurips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html)
23
1 alternatives
1 free options
1 score higher