Dr. Joao Paulo Schwarz Schuler PRO
schuler
AI & ML interests
artificial intelligence
Recent Activity
replied to
their
post
1 day ago
š¢ New Research Alert: Making Language Models Smaller & Smarter!
Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.
The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.
š Key Findings:
ā¢ 77% parameter reduction.
ā¢ Maintained model capabilities.
ā¢ Improved generalization.
Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm
updated
a model
2 days ago
schuler/experimental-JP47D54
updated
a model
2 days ago
schuler/experimental-JP47D54B
Organizations
None yet
schuler's activity
submission issues
7
#1038 opened 2 months ago
by
pszemraj
use_remote_code=True
#27 opened 2 months ago
by
schuler
Gradio chatbot with AutoModelForCausalLM, AutoTokenizer and @spaces.GPU() (ZeroGPU space)
1
#3 opened 3 months ago
by
schuler
Source of the data?
#2 opened 3 months ago
by
schuler
Source of the data?
#1 opened 3 months ago
by
schuler
How to load the model ?
4
#7 opened 12 months ago
by
schuler
How to load the model ?
4
#7 opened 12 months ago
by
schuler
How to load the model ?
4
#7 opened 12 months ago
by
schuler