11 6 203

James PRO

jtatman

jimtron

SmurfyKnuckles

AI & ML interests

improving domain specific models and re-sampling data, refining datasets for use in different modalities, small scale micro-llm clusters using quantized and smoothed models, and all emerging llm stack connecting technologies. Small models rock.

Organizations

jtatman's activity

upvoted an article 11 days ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

12 days ago

• 24

upvoted a paper 2 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 61

upvoted 2 papers 4 months ago

Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding

Paper • 2401.12954 • Published Jan 23 • 28

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11 • 33

upvoted a paper 5 months ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 253

upvoted a paper 6 months ago

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

Paper • 2311.10642 • Published Nov 17, 2023 • 23