raincandy_U's picture

raincandy_U

raincandy-u

·

AI & ML interests

幻覚。

Recent Activity

liked a dataset 1 day ago

open-thoughts/OpenThoughts2-1M

liked a dataset 1 day ago

HuggingFaceFW/fineweb-2

liked a model 1 day ago

google/gemma-3-4b-it

View all activity

Organizations

raincandy-u's activity

upvoted 3 papers 10 months ago

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 11

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 55

LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

Paper • 2405.18377 • Published May 28, 2024 • 21

upvoted a collection 11 months ago

Mini Pretrain Datasets

9 items • Updated Jul 9, 2024 • 9

upvoted 2 papers 11 months ago

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14, 2024 • 33

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted 2 collections 12 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 1 day ago • 566

Llamafied Models

This is a collection of llamafied models - such as Qwen. • 5 items • Updated Apr 19, 2024 • 1