23 20 80

Kiran Kamble

kiranr

ki6an

AI & ML interests

nlp,llm

Recent Activity

reacted to wassemgtk's post with 🔥 about 1 month ago

# GESAL: Real-Time Adaptation for LLMs We’re excited to unveil **Graph-Enhanced Singular Adaptive Learning (GESAL)**, a framework that lets LLMs like `meta-llama/Llama-3.2-1B` adapt in real time using user feedback. Check out the code and white paper on GitHub! 🔗 **Code**: [https://github.com/writer/AI-Adaptive-Learning-GESAL](https://github.com/writer/AI-Adaptive-Learning-GESAL) --- ## Why GESAL? Static LLMs struggle to adapt without heavy retraining. GESAL solves this with: - **SVF**: Adapts weights via \( W' = U (\Sigma \cdot z) V^T \), using few parameters. - **Graph Memory**: Stores adaptations in nodes for scalability. - **RL**: Updates via \( J(z) = \mathbb{E}[\log \pi_z(y|x) r] \) based on feedback. --- ## How It Works Ask "How many R’s in ‘strawberry’?" If it says "2" and you say "no," GESAL learns to say "3" next time, avoiding repeats. --- ## Try It Built with Hugging Face’s `transformers`: ```bash pip install transformers torch numpy python Adaptive_Learning_(GESAL).py ``` Needs a Hugging Face token for Llama-3.2-1B. --- ## Results GESAL hits 95% accuracy after 5 feedbacks vs. LoRA’s 70%. It’s efficient (~0.5M params) and scalable.

new activity about 1 month ago

Writer/palmyra-large:Adding `safetensors` variant of this model

authored a paper about 2 months ago

Expect the Unexpected: FailSafe Long Context QA for Finance

View all activity

Organizations

kiranr's activity

upvoted a paper about 2 months ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 131

upvoted 2 papers 2 months ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 63

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 69

upvoted a paper 7 months ago

Writing in the Margins: Better Inference Pattern for Long Context Retrieval

Paper • 2408.14906 • Published Aug 27, 2024 • 142

upvoted 3 papers 8 months ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 60

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 42

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 58

upvoted a collection 9 months ago

DCLM

Collection

DCLM Models + Datasets • 6 items • Updated Oct 4, 2024 • 25

upvoted 6 papers about 1 year ago

ReALM: Reference Resolution As Language Modeling

Paper • 2403.20329 • Published Mar 29, 2024 • 21

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16, 2024 • 43

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Paper • 2402.04291 • Published Feb 6, 2024 • 50

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 114

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Paper • 2402.04248 • Published Feb 6, 2024 • 32

upvoted a collection about 1 year ago

Papers about model merging

Collection

referenced in the mergekit repo: https://github.com/cg123/mergekit • 4 items • Updated Feb 13, 2024 • 14

upvoted a collection over 1 year ago

Llamafied Yi

Collection

Yi base models converted to Llama architecture. • 4 items • Updated Nov 14, 2023 • 9

upvoted 2 papers over 1 year ago

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Paper • 2310.09520 • Published Oct 14, 2023 • 12

Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 76

upvoted a paper almost 2 years ago

Personality Traits in Large Language Models

Paper • 2307.00184 • Published Jul 1, 2023 • 20