-
Attention Is All You Need
Paper • 1706.03762 • Published • 50 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 16 -
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper • 1907.11692 • Published • 7 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper • 1910.01108 • Published • 14
Taufiq Dwi Purnomo
taufiqdp
AI & ML interests
SLM, VLM
Recent Activity
updated
a model
38 minutes ago
taufiqdp/convnext-arutala-v2
upvoted
a
paper
about 4 hours ago
TransMLA: Multi-head Latent Attention Is All You Need
liked
a model
about 22 hours ago
agentica-org/DeepScaleR-1.5B-Preview
Organizations
Collections
1
spaces
7
Running
on
Zero
2
PaliGemma
🚀
Lightweight open vision-language model
Running
on
Zero
4
FLUX
🖼
Generate images from text prompts
Running
on
Zero
4
Phi 3 Mini 128k Instruct
📊
Phi-3, a family of open AI models developed by Microsoft.
Paused
Convert to Safetensors
🐶
Paused
gemma-1.1-7b-it
👑
Paused
Sentence Fix
📖
models
9

taufiqdp/convnext-arutala-v2
Image Classification
•
Updated
•
73

taufiqdp/convnext_tiny-arutala
Image Classification
•
Updated
•
134

taufiqdp/mobilenetv4_conv_medium.e500_r256_in1k-emotion
Image Classification
•
Updated
•
25

taufiqdp/mobilenetv4_conv_small.e2400_r224_in1k_nsfw_classifier
Image Classification
•
Updated
•
37
•
1

taufiqdp/convnext-eurosat
Image Classification
•
Updated
•
13

taufiqdp/train-checkpoint
Updated

taufiqdp/gemma-2b-q8_0-gguf
Updated
•
4

taufiqdp/stablelm-2-1_6b-indo-lora
Updated
•
2

taufiqdp/indonesian-sentiment
Text Classification
•
Updated
•
183
•
1