teleshop

prenes
ยท

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

None yet

prenes's activity

reacted to schuler's post with ๐Ÿš€๐Ÿ”ฅ๐Ÿ‘ about 1 month ago
view post
Post
7226
๐Ÿ“ข New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

๐Ÿ”‘ Key Findings:
โ€ข 77% parameter reduction.
โ€ข Maintained model capabilities.
โ€ข Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm
  • 2 replies
ยท
upvoted an article about 1 month ago
view article
Article

Open-source DeepResearch โ€“ Freeing our search agents

โ€ข 1.16k