Taylor658 (atayloraerospace)

upvoted 8 papers about 21 hours ago

Nearest Neighbor Speculative Decoding for LLM Generation and Attribution

Paper • 2405.19325 • Published 3 days ago • 10

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published 3 days ago • 9

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

Paper • 2405.19320 • Published 3 days ago • 6

upvoted a collection about 21 hours ago

em🍞ing series

Collection

crispy sentence embedding family • 3 items • Updated Mar 28 • 17

upvoted a paper 9 days ago

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Paper • 2404.16375 • Published Apr 25 • 16

upvoted an article 10 days ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

By

•

11 days ago

• 23

upvoted a collection 12 days ago

Critique Models (CM) on the 🤗 Hub

Collection

This collection contains some Critique Models (CM) for LLM evaluation available in the HuggingFace Hub • 5 items • Updated 25 days ago • 3

upvoted an article 12 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

19 days ago

• 131

upvoted 2 collections 12 days ago

CommonCanvas

Collection

Collection of models trained on the CommonCatalogue datasets • 8 items • Updated 16 days ago • 6

MAmmoTH2

Collection

Scaling up instruction data from the web for to build better LLMs • 11 items • Updated 6 days ago • 6

upvoted a collection 15 days ago

Blackhole

Collection

A black hole with lots of high-quality dialogue datasets in many fields, and multilingual helps to train LLMs with SFT and DPO methods easier. • 32 items • Updated 8 days ago • 6

upvoted 7 papers 16 days ago

SpeechVerse: A Large-scale Generalizable Audio Language Model

Paper • 2405.08295 • Published 19 days ago • 10

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Paper • 2405.08317 • Published 19 days ago • 8

What matters when building vision-language models?

Paper • 2405.02246 • Published 29 days ago • 87

No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding

Paper • 2405.08344 • Published 19 days ago • 10

Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

Paper • 2405.08054 • Published 19 days ago • 19

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Paper • 2405.09546 • Published 17 days ago • 9

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Paper • 2401.15914 • Published Jan 29 • 7

upvoted 2 collections 16 days ago

Diffusion All

Collection

5 items • Updated Mar 8 • 4

Berkeley Function-Calling Leaderboard

Collection

2 items • Updated Apr 5 • 3

upvoted an article 16 days ago

Article

Vision Language Models Explained

Apr 11

• 92

upvoted 3 collections 17 days ago

Chronos Models

Collection

Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 6 items • Updated Mar 18 • 25

📦 3D creation workflow

Collection

Going from a text prompt to a nice 3D model • 3 items • Updated Feb 6 • 23

🐒 Stable Diffusion LoRAs

Collection

Awesome LoRAs found on the hub - using only 🐵 • 7 items • Updated Feb 6 • 14

upvoted an article 19 days ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

By

•

7 days ago

• 21

upvoted 4 papers 19 days ago

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

Paper • 2404.07448 • Published Apr 11 • 10

SAGS: Structure-Aware 3D Gaussian Splatting

Paper • 2404.19149 • Published Apr 29 • 12

DOCCI: Descriptions of Connected and Contrasting Images

Paper • 2404.19753 • Published Apr 30 • 9

Trajectory Consistency Distillation

Paper • 2402.19159 • Published Feb 29 • 13

upvoted a collection 19 days ago

Transcription

Collection

Transcribe interviews for free with Whisper in Spaces. • 5 items • Updated Apr 23 • 3

upvoted 2 collections 20 days ago

Yi-1.5 (2024/05)

Collection

10 items • Updated 13 days ago • 76

AnyLLM-Pro

Collection

6 items • Updated Feb 27 • 4

upvoted an article 20 days ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18

• 20

upvoted a paper 20 days ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 55

upvoted a collection 20 days ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 29 items • Updated 2 days ago • 181

upvoted 2 papers 20 days ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 33

Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap

Paper • 2402.19450 • Published Feb 29 • 3

upvoted an article 20 days ago

Article

Constitutional AI with Open LLMs

Feb 1

• 5

upvoted a collection 20 days ago

LLM Spaces

Collection

132 items • Updated 1 day ago • 11

upvoted a collection 21 days ago

cool datasets

Collection

81 items • Updated 22 days ago • 8

upvoted 4 papers 21 days ago

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Paper • 2405.05254 • Published 24 days ago • 8

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19 • 24

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 43

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21 • 26

upvoted 4 articles 21 days ago

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

30 days ago

• 13

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

Apr 29

• 27

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

24 days ago

• 7

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 46

upvoted a paper 21 days ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

upvoted 3 collections 21 days ago

Aya Datasets

Collection

The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 4 items • Updated 9 days ago • 9

C4AI Command R

Collection

C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 3 items • Updated 9 days ago • 12

C4AI Command R Plus

Collection

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 3 items • Updated 9 days ago • 18

upvoted an article 21 days ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 48

upvoted 2 papers 21 days ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published about 1 month ago • 102

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published about 1 month ago • 53

atayloraerospace PRO

AI & ML interests

Organizations

Taylor658's activity

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Vision Language Models Explained

Train custom AI models with the trainer API and adapt them to 🤗

Preference Tuning LLMs with Direct Preference Optimization Methods

Constitutional AI with Open LLMs

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Improving Prompt Consistency with Structured Generations

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)