Shashi Kumar Nagulakonda's picture

79 52

Shashi Kumar Nagulakonda PRO

iamsingularity

·

https://www.linkedin.com/in/sashikn

AI & ML interests

Generative AI, GPT, LLMs, SLMs, RAG, Fine-tuning, Chatbots, Agents

Recent Activity

upvoted an article 12 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a dataset 12 days ago

Anthropic/hh-rlhf

liked a Space 12 days ago

lmarena-ai/chatbot-arena-leaderboard

View all activity

Organizations

iamsingularity's activity

upvoted an article 12 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 215

liked a dataset 12 days ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 12.9k • 1.31k

liked a Space 12 days ago

Chatbot Arena Leaderboard

Display chatbot performance leaderboard

liked a dataset 12 days ago

lmsys/chatbot_arena_conversations

Viewer • Updated Sep 30, 2023 • 33k • 1.65k • 377

upvoted a paper 12 days ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 55

liked a model 12 days ago

shawhin/Qwen2.5-0.5B-DPO

Text Generation • Updated 29 days ago • 50 • 1

liked a dataset 12 days ago

shawhin/youtube-titles-dpo

Viewer • Updated 29 days ago • 1.14k • 241 • 1

liked 2 Spaces 13 days ago

Synthetic Data Generator

Build datasets using natural language

Argilla Space

upvoted a paper about 2 months ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3 • 24

upvoted a collection 2 months ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

upvoted an article 2 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

Oct 22, 2024

• 72

upvoted a paper 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 367

reacted to andito's post with ❤️🔥 4 months ago

Post

3380

Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs.

- SmolVLM generates tokens 7.5 to 16 times faster than Qwen2-VL! 🤯
- Other models at this size crash a laptop, but SmolVLM comfortably generates 17 tokens/sec on a macbook! 🚀
- SmolVLM can be fine-tuned on a Google collab! Or process millions of documents with a consumer GPU!
- SmolVLM even outperforms larger models in video benchmarks, despite not even being trained on videos!

Check out more!
Demo: HuggingFaceTB/SmolVLM
Blog: https://huggingface.co/blog/smolvlm
Model: HuggingFaceTB/SmolVLM-Instruct
Fine-tuning script: https://github.com/huggingface/smollm/blob/main/finetuning/Smol_VLM_FT.ipynb

upvoted an article 6 months ago

Article

Introducing the Open FinLLM Leaderboard

Oct 4, 2024

• 76

upvoted a paper 6 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

updated a collection 6 months ago

Favorites

8 items • Updated Sep 25, 2024

liked a model 7 months ago

ProsusAI/finbert

Text Classification • Updated May 23, 2023 • 2.82M • • 839

updated a collection 9 months ago

Leaderboards

1 item • Updated Jul 15, 2024