-
Attention Is All You Need
Paper • 1706.03762 • Published • 49 -
LLaMA: Open and Efficient Foundation Language Models
Paper • 2302.13971 • Published • 13 -
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper • 2401.17464 • Published • 16 -
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
Paper • 2407.21770 • Published • 22
Justin PRO
jxtngx
AI & ML interests
None yet
Recent Activity
updated
a collection
7 days ago
Meta papers
updated
a collection
7 days ago
LM papers
upvoted
a
collection
12 days ago
Llama 3.3
Organizations
Collections
15
models
16
jxtngx/Nemotron-Mini-4B-Instruct-Q4_K_M-GGUF
Updated
•
3
jxtngx/Meta-Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
•
29
jxtngx/Llama-3.2-3B-Q4_K_M-GGUF
Text Generation
•
Updated
•
5
jxtngx/Meta-Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
•
8
jxtngx/Meta-Llama-3.2-1B-Q4_K_M-GGUF
Text Generation
•
Updated
•
82
•
2
jxtngx/Llama-3.1-Minitron-4B-Width-Base-Q4_K_M-GGUF
Updated
•
11
jxtngx/Meta-Llama-3.1-8B-Q4_K_M-GGUF
Text Generation
•
Updated
•
3
jxtngx/Meta-Llama-3.1-8B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
•
5
jxtngx/Meta-Llama-3.1-8B-Q8_0-GGUF
Text Generation
•
Updated
•
5
jxtngx/Meta-Llama-3.1-8B-Instruct-Q8_0-GGUF
Text Generation
•
Updated
•
5
datasets
None public yet