Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Rutam Risaldar
thugCodeNinja
Follow
0 followers
ยท
1 following
https://www.datascienceportfol.io/rutamrisaldar
AI & ML interests
Machine Learning, Deep Learning, NLP, LLM, Explainable AI
Recent Activity
liked
a model
16 days ago
thugCodeNinja/robertafinetune
new
activity
16 days ago
thugCodeNinja/robertafinetune:
Adding `safetensors` variant of this model
reacted
to
singhsidhukuldeep
's
post
with ๐
2 months ago
Groundbreaking Research Alert: Rethinking RAG with Cache-Augmented Generation (CAG) Researchers from National Chengchi University and Academia Sinica have introduced a paradigm-shifting approach that challenges the conventional wisdom of Retrieval-Augmented Generation (RAG). Instead of the traditional retrieve-then-generate pipeline, their innovative Cache-Augmented Generation (CAG) framework preloads documents and precomputes key-value caches, eliminating the need for real-time retrieval during inference. Technical Deep Dive: - CAG preloads external knowledge and precomputes KV caches, storing them for future use - The system processes documents only once, regardless of subsequent query volume - During inference, it loads the precomputed cache alongside user queries, enabling rapid response generation - The cache reset mechanism allows efficient handling of multiple inference sessions through strategic token truncation Performance Highlights: - Achieved superior BERTScore metrics compared to both sparse and dense retrieval RAG systems - Demonstrated up to 40x faster generation times compared to traditional approaches - Particularly effective with both SQuAD and HotPotQA datasets, showing robust performance across different knowledge tasks Why This Matters: The approach significantly reduces system complexity, eliminates retrieval latency, and mitigates common RAG pipeline errors. As LLMs continue evolving with expanded context windows, this methodology becomes increasingly relevant for knowledge-intensive applications.
View all activity
Organizations
None yet
spaces
8
Sort:ย Recently updated
Runtime error
5
Deepfakedetect
๐จ
No application file
ElderlyCareCompanion
๐ข
Sleeping
Coupon Reommder
๐
Sleeping
Finetune
๐
Runtime error
2
SpoofDetection
๐
Sleeping
1
ChatGPTtextdetction
๐ป
Expand 8 spaces
models
4
Sort:ย Recently updated
thugCodeNinja/robertafinetune
Zero-Shot Classification
โข
Updated
16 days ago
โข
26
โข
1
thugCodeNinja/robertatemp
Text Classification
โข
Updated
Mar 30, 2024
โข
6
thugCodeNinja/DeepFakeDetect
Video Classification
โข
Updated
Mar 7, 2024
โข
2
thugCodeNinja/SpoofDetection
Audio Classification
โข
Updated
Feb 2, 2024
datasets
None public yet