3 11 3

Emil Zakirov PRO

Emil-Zakirov

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

marcelbinz/Llama-3.1-Centaur-70B-adapter:How can this be accessed for research without GPUs on hand?

liked a model 2 months ago

marcelbinz/Llama-3.1-Centaur-70B-adapter

upvoted a paper 4 months ago

Kolmogorov-Arnold Transformer

View all activity

Organizations

None yet

Emil-Zakirov's activity

New activity in marcelbinz/Llama-3.1-Centaur-70B-adapter about 2 months ago

How can this be accessed for research without GPUs on hand?

#7 opened 3 months ago by

BiasedByBytes

liked a model 2 months ago

marcelbinz/Llama-3.1-Centaur-70B-adapter

Updated Nov 18, 2024 • 109

upvoted a paper 4 months ago

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 40

upvoted a collection 4 months ago

MatMulfree LM

Collection

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 25

upvoted a paper 6 months ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 57

upvoted 2 papers 7 months ago

Transformers meet Neural Algorithmic Reasoners

Paper • 2406.09308 • Published Jun 13, 2024 • 43

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4, 2024 • 37

upvoted 2 papers 10 months ago

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

Paper • 2403.06504 • Published Mar 11, 2024 • 53

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12, 2024 • 74

upvoted a paper 12 months ago

LongAlign: A Recipe for Long Context Alignment of Large Language Models

Paper • 2401.18058 • Published Jan 31, 2024 • 20

liked a model about 1 year ago

cognitivecomputations/dolphin-2.5-mixtral-8x7b

Text Generation • Updated May 21, 2024 • 14.5k • 1.22k

upvoted a paper about 1 year ago

AppAgent: Multimodal Agents as Smartphone Users

Paper • 2312.13771 • Published Dec 21, 2023 • 52

liked a model over 1 year ago

mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24, 2024 • 3.54M • 3.53k

New activity in AIWaves/Debate over 1 year ago

I don't understand how to run your code on a computer.

#2 opened over 1 year ago by

Emil-Zakirov

upvoted 2 papers over 1 year ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 80

LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models

Paper • 2308.16137 • Published Aug 30, 2023 • 39

commented a paper over 1 year ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 80 •