BioBERT: a pre-trained biomedical language representation model for biomedical text mining Paper • 1901.08746 • Published Jan 25, 2019 • 3
Pretraining-Based Natural Language Generation for Text Summarization Paper • 1902.09243 • Published Feb 25, 2019 • 2
RoBERTa: A Robustly Optimized BERT Pretraining Approach Paper • 1907.11692 • Published Jul 26, 2019 • 7
DeBERTa: Decoding-enhanced BERT with Disentangled Attention Paper • 2006.03654 • Published Jun 5, 2020 • 3
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing Paper • 2111.09543 • Published Nov 18, 2021 • 2
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 12 days ago • 113
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation Paper • 2102.04664 • Published Feb 9, 2021 • 2