view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x β’ 1 day ago β’ 34
Sailor: Open Language Models for South-East Asia Paper β’ 2404.03608 β’ Published 23 days ago β’ 17
TransformerFAM: Feedback attention is working memory Paper β’ 2404.09173 β’ Published 13 days ago β’ 38
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper β’ 2404.03715 β’ Published 23 days ago β’ 51
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper β’ 2404.00399 β’ Published 28 days ago β’ 39
DIBT Prompt collective SPIN Collection This collection contains resources related to the replication of SPIN with the dibt prompt collective dataset β’ 8 items β’ Updated Mar 12 β’ 7
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11 β’ 33
Pre-trained LMs ES Collection Monolingual language models pre-trained on Spanish and related languages. β’ 20 items β’ Updated 3 days ago β’ 6
Instruction-Tuned Models ES Collection Instruction-tuned models in Spanish and other related languages β’ 7 items β’ Updated 3 days ago β’ 4
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper β’ 2402.13753 β’ Published Feb 21 β’ 104
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models Paper β’ 2402.13064 β’ Published Feb 20 β’ 45
User-LLM: Efficient LLM Contextualization with User Embeddings Paper β’ 2402.13598 β’ Published Feb 21 β’ 17
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains Paper β’ 2402.05140 β’ Published Feb 6 β’ 18
Instruction-tuned Language Models are Better Knowledge Learners Paper β’ 2402.12847 β’ Published Feb 20 β’ 25
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 12 items β’ Updated 4 days ago β’ 34
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss Paper β’ 2402.10790 β’ Published Feb 16 β’ 39
AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts Paper β’ 2402.07625 β’ Published Feb 12 β’ 10
datasets-SPIN Collection Generated synthetic data used to finetune SPIN. β’ 8 items β’ Updated Feb 9 β’ 10
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Paper β’ 2402.01391 β’ Published Feb 2 β’ 41
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception Paper β’ 2401.16158 β’ Published Jan 29 β’ 15
The Rise and Potential of Large Language Model Based Agents: A Survey Paper β’ 2309.07864 β’ Published Sep 14, 2023 β’ 5
Canonical models Collection This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace β’ 68 items β’ Updated Feb 13 β’ 13
Improving Text Embeddings with Large Language Models Paper β’ 2401.00368 β’ Published Dec 31, 2023 β’ 72
haiku Collection πΈ This is a collection of synthetic datasets built to help improve the ability of open language models to better write haikus through the use of DPO β’ 3 items β’ Updated Jan 16 β’ 4
Comparing DPO with IPO and KTO Collection A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. β’ 56 items β’ Updated Jan 9 β’ 31
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Paper β’ 2308.14469 β’ Published Aug 28, 2023 β’ 6
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Paper β’ 2312.16862 β’ Published Dec 28, 2023 β’ 28
π Mamba fine-tuned models Collection A collection with ClibrAIn's Mamba fine-tuned models β’ 3 items β’ Updated Dec 18, 2023 β’ 11
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. β’ 43 items β’ Updated 15 days ago β’ 87
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers Paper β’ 2311.17136 β’ Published Nov 28, 2023 β’ 7
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper β’ 2311.13231 β’ Published Nov 22, 2023 β’ 25
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook β’ 9 items β’ Updated 15 days ago β’ 134
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Paper β’ 2310.08659 β’ Published Oct 12, 2023 β’ 19
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Paper β’ 2305.11738 β’ Published May 19, 2023 β’ 3
Offline Prompt Evaluation and Optimization with Inverse Reinforcement Learning Paper β’ 2309.06553 β’ Published Sep 13, 2023 β’ 4
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages Paper β’ 2309.09400 β’ Published Sep 17, 2023 β’ 77
Orca: Progressive Learning from Complex Explanation Traces of GPT-4 Paper β’ 2306.02707 β’ Published Jun 5, 2023 β’ 45