Aryanne's picture

Aryanne

Aryanne

·

AI & ML interests

LLMs, AI, GPU/CPU poor, any help is welcome 0x190ac445974a989a87dd223f212a76ca0090c804

Recent Activity

liked a Space 23 days ago

ByteDance/InfiniteYou-FLUX

updated a Space 30 days ago

Aryanne/Another_Fractal_Generator

liked a model 2 months ago

PygmalionAI/Pygmalion-3-12B

View all activity

Organizations

Aryanne's activity

upvoted a collection 5 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 2 days ago • 90

upvoted a paper 6 months ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 52

upvoted a paper 7 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted 2 collections 7 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 596

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14, 2024 • 11

upvoted an article 7 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 229

upvoted an article 8 months ago

Article

Introduction to ggml

Aug 13, 2024

• 184

upvoted a collection 10 months ago

MatMulfree LM

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 25

upvoted 7 papers about 1 year ago

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

Paper • 2401.00711 • Published Jan 1, 2024 • 2

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 78

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 65

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 614

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26, 2024 • 74

Universal Neurons in GPT2 Language Models

Paper • 2401.12181 • Published Jan 22, 2024 • 5

upvoted a collection about 1 year ago

Testing Might be broken

testing only models, • 10 items • Updated Feb 3, 2024 • 2

upvoted a paper over 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

upvoted 2 collections over 1 year ago

Merged Models

Using mergekit • 10 items • Updated Mar 1, 2024 • 3

StableLM (.gguf)

Models based on StableLM Models by Stability AI • 19 items • Updated Nov 27, 2023 • 3