Hui Sun's picture

Hui Sun

CocoSun

·

AI & ML interests

None yet

Organizations

CocoSun's activity

upvoted an article 5 days ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

6 days ago

• 63

upvoted an article 6 days ago

Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

10 days ago

• 14

upvoted an article 10 days ago

Article

Let's talk about LLM evaluation

By

•

10 days ago

• 82

upvoted a collection 11 days ago

PaliGemma FT Models

108 items • Updated 19 days ago • 17

upvoted 2 articles 19 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

20 days ago

• 132

Article

Hugging Face x LangChain : A new partner package in LangChain

20 days ago

• 71

upvoted an article 29 days ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

21 days ago

• 44

upvoted 2 collections about 1 month ago

OpenELM Pretrained Models

4 items • Updated Apr 23 • 38

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated 3 days ago • 301

upvoted an article about 1 month ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 70

upvoted a paper about 2 months ago

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12 • 26

upvoted 4 papers 2 months ago

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27 • 21

ViTAR: Vision Transformer with Any Resolution

Paper • 2403.18361 • Published Mar 27 • 48

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26 • 25

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Paper • 2403.10516 • Published Mar 15 • 15

upvoted 3 papers 3 months ago

SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?

Paper • 2402.01832 • Published Feb 2 • 4

Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings

Paper • 2403.07750 • Published Mar 12 • 19

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27 • 87

upvoted a collection 4 months ago

Sora Reference Papers

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Feb 20 • 50

upvoted 2 papers 4 months ago

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset

Paper • 2402.05937 • Published Feb 8 • 8

CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Paper • 2401.12208 • Published Jan 22 • 20

upvoted 2 papers 5 months ago

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16 • 35

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 131

upvoted 4 papers 6 months ago

Visual In-Context Prompting

Paper • 2311.13601 • Published Nov 22, 2023 • 14

ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs

Paper • 2311.13600 • Published Nov 22, 2023 • 41

Towards Accurate Differential Diagnosis with Large Language Models

Paper • 2312.00164 • Published Nov 30, 2023 • 8

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Paper • 2311.16079 • Published Nov 27, 2023 • 18

upvoted a paper 7 months ago

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 42

upvoted a paper 9 months ago

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 33

upvoted a paper 10 months ago

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Paper • 2305.07759 • Published May 12, 2023 • 29

upvoted a paper 11 months ago

Dynamic-Resolution Model Learning for Object Pile Manipulation

Paper • 2306.16700 • Published Jun 29, 2023 • 5