Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published 19 days ago • 62
Bio Series Collection Embeddings and NLG related to biology / amino acid sequences • 10 items • Updated 23 days ago • 1
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram • 25 days ago • 38
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning Paper • 2402.03046 • Published Feb 5 • 4
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Paper • 2402.09844 • Published Feb 15 • 18
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations Paper • 2310.07276 • Published Oct 11, 2023 • 4
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 27 days ago • 71
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare about 1 month ago • 64
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 108 items • Updated 3 days ago • 23
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Paper • 2402.06619 • Published Feb 9 • 47
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Paper • 2402.07827 • Published Feb 12 • 43
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 76
Journal Club Collection Candidate papers to read in the H4 journal club • 54 items • Updated 27 days ago • 23
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models Paper • 2312.02969 • Published Dec 5, 2023 • 11
Table Transformer Collection The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated 10 days ago • 12
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 61 items • Updated 5 days ago • 59