Ai-models - a Yoai Collection

Yoai 's Collections

Agents

Agent-Cognition

Ai-models

updated 12 days ago

Ultra-Long Sequence Distributed Transformer

Paper • 2311.02382 • Published Nov 4, 2023 • 2
Ziya2: Data-centric Learning is All LLMs Need

Paper • 2311.03301 • Published Nov 6, 2023 • 16
Relax: Composable Abstractions for End-to-End Dynamic Machine Learning

Paper • 2311.02103 • Published Nov 1, 2023 • 15
Extending Context Window of Large Language Models via Semantic Compression

Paper • 2312.09571 • Published Dec 15, 2023 • 12
Secrets of RLHF in Large Language Models Part II: Reward Modeling

Paper • 2401.06080 • Published Jan 11 • 23
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11 • 35
xDAN-AI/xDAN-L1-Chat-RL-v1

Text Generation • Updated Dec 29, 2023 • 3.28k • 63
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16 • 27
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting

Paper • 2402.13720 • Published Feb 21 • 4
PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15 • 55
Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18 • 30
Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 45
ZigMa: Zigzag Mamba Diffusion Model

Paper • 2403.13802 • Published Mar 20 • 16
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 57
Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 30
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22 • 24
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Paper • 2403.15246 • Published Mar 22 • 8
Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11 • 80
Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 55
Capabilities of Gemini Models in Medicine

Paper • 2404.18416 • Published about 1 month ago • 21
Many-Shot In-Context Learning in Multimodal Foundation Models

Paper • 2405.09798 • Published 13 days ago • 25