Taylor658 (atayloraerospace)

upvoted a paper 5 days ago

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Paper • 2404.16375 • Published Apr 25 • 15

upvoted an article 7 days ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

By

•

8 days ago

• 22

upvoted a collection 8 days ago

Critique Models (CM) on the 🤗 Hub

Collection

This collection contains some Critique Models (CM) for LLM evaluation available in the HuggingFace Hub • 5 items • Updated 22 days ago • 3

upvoted an article 8 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

15 days ago

• 126

upvoted 2 collections 8 days ago

CommonCanvas

Collection

Collection of models trained on the CommonCatalogue datasets • 8 items • Updated 13 days ago • 5

MAmmoTH2

Collection

Scaling up instruction data from the web for to build better LLMs • 11 items • Updated 3 days ago • 6

upvoted a collection 12 days ago

Blackhole

Collection

A black hole with lots of high-quality dialogue datasets in many fields, and multilingual helps to train LLMs with SFT and DPO methods easier. • 32 items • Updated 5 days ago • 6

upvoted 7 papers 13 days ago

SpeechVerse: A Large-scale Generalizable Audio Language Model

Paper • 2405.08295 • Published 15 days ago • 10

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Paper • 2405.08317 • Published 15 days ago • 8

What matters when building vision-language models?

Paper • 2405.02246 • Published 26 days ago • 87

No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding

Paper • 2405.08344 • Published 15 days ago • 10

Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

Paper • 2405.08054 • Published 16 days ago • 19

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Paper • 2405.09546 • Published 14 days ago • 9

Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization

Paper • 2401.15914 • Published Jan 29 • 7

upvoted 2 collections 13 days ago

Diffusion All

Collection

5 items • Updated Mar 8 • 4

Berkeley Function-Calling Leaderboard

Collection

2 items • Updated Apr 5 • 3

upvoted an article 13 days ago

Article

Vision Language Models Explained

Apr 11

• 89

upvoted 3 collections 14 days ago

upvoted an article 15 days ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

By

•

4 days ago

• 19

upvoted 4 papers 16 days ago

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

Paper • 2404.07448 • Published Apr 11 • 10

SAGS: Structure-Aware 3D Gaussian Splatting

Paper • 2404.19149 • Published 29 days ago • 12

DOCCI: Descriptions of Connected and Contrasting Images

Paper • 2404.19753 • Published 29 days ago • 9

Trajectory Consistency Distillation

Paper • 2402.19159 • Published Feb 29 • 13

upvoted 3 collections 16 days ago

Transcription

Collection

Transcribe interviews for free with Whisper in Spaces. • 5 items • Updated Apr 23 • 3

Yi-1.5 (2024/05)

Collection

10 items • Updated 9 days ago • 75

AnyLLM-Pro

Collection

6 items • Updated Feb 27 • 4

upvoted an article 16 days ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18

• 19

upvoted a paper 16 days ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 55

upvoted a collection 16 days ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 28 items • Updated Mar 23 • 181

upvoted 2 papers 16 days ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 33

Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap

Paper • 2402.19450 • Published Feb 29 • 3

upvoted an article 16 days ago

Article

Constitutional AI with Open LLMs

Feb 1

• 5

upvoted a collection 16 days ago

LLM Spaces

Collection

130 items • Updated 1 day ago • 11

upvoted a collection 17 days ago

cool datasets

Collection

81 items • Updated 19 days ago • 8

upvoted 4 papers 17 days ago

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Paper • 2405.05254 • Published 21 days ago • 8

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19 • 24

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 43

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21 • 25

upvoted 4 articles 17 days ago

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

26 days ago

• 13

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

30 days ago

• 27

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

20 days ago

• 7

Article

Improving Prompt Consistency with Structured Generations

29 days ago

• 46

upvoted a paper 17 days ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 51

upvoted 3 collections 17 days ago

Aya Datasets

Collection

The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 4 items • Updated 6 days ago • 8

C4AI Command R

Collection

C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 3 items • Updated 6 days ago • 11

C4AI Command R Plus

Collection

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 3 items • Updated 6 days ago • 17

upvoted an article 17 days ago

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

Apr 24

• 48

upvoted 3 papers 17 days ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published 27 days ago • 101

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published 27 days ago • 53

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published about 1 month ago • 114

upvoted a collection 26 days ago

Biomedical NLP papers

Collection

Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) • 110 items • Updated 7 days ago • 24

upvoted a paper 26 days ago

Capabilities of Gemini Models in Medicine

Paper • 2404.18416 • Published about 1 month ago • 21

upvoted a collection 4 months ago

OLMo Suite

Collection

Artifacts for the first set of OLMo models. • 12 items • Updated 14 days ago • 36

atayloraerospace PRO

AI & ML interests

Organizations

Taylor658's activity

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Vision Language Models Explained

Train custom AI models with the trainer API and adapt them to 🤗

Preference Tuning LLMs with Direct Preference Optimization Methods

Constitutional AI with Open LLMs

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Improving Prompt Consistency with Structured Generations

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)