Derrick Mwiti
mwitiderrick
AI & ML interests
None yet
Articles
Organizations
mwitiderrick's activity
Article
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
•
21
Article
Putting RL back in RLHF
•
53
Article
Welcome Gemma 2 - Google's new open LLM
•
88
Article
Proximal Policy Optimization (PPO)
•
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671174218273-61fa23acaff317f6566c4d96.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/OSA7VIz8CTnlKb72IdfMM.png)
upvoted
a
collection
24 days ago
Article
Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
•
146
Article
A Dive into Pretraining Strategies for Vision-Language Models
•
30
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671174218273-61fa23acaff317f6566c4d96.png)
upvoted
an
article
27 days ago
Article
Vision Language Models Explained
•
121
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671174218273-61fa23acaff317f6566c4d96.png)
upvoted
a
paper
3 months ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671174218273-61fa23acaff317f6566c4d96.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/OSA7VIz8CTnlKb72IdfMM.png)
upvoted
a
collection
6 months ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671174218273-61fa23acaff317f6566c4d96.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5df7e9e5da6d0311fd3d53f9/j21QZzv9_PGPUH5FbUaeM.png)
upvoted
a
collection
9 months ago
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1671174218273-61fa23acaff317f6566c4d96.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60466e4b4f40b01b66151416/OSA7VIz8CTnlKb72IdfMM.png)
upvoted
a
collection
9 months ago