Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ Apr 26 • 11
Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers Mar 12 • 3
Uniting Forces: Integrating Hugging Face with Langchain for Enhanced Natural Language Processing Dec 18, 2023 • 4
Hearing is Believing: Revolutionizing AI with Audio Classification via Computer Vision Oct 22, 2023 • 1
Fine-Tuning Fine-Tuning Direct Judgement Preference Optimization Paper • 2409.14664 • Published 29 days ago
Ankush Collection Transformer Articles DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention Paper • 2309.14327 • Published Sep 25, 2023 • 21 MambaVision: A Hybrid Mamba-Transformer Vision Backbone Paper • 2407.08083 • Published Jul 10 • 27 Memory^3: Language Modeling with Explicit Memory Paper • 2407.01178 • Published Jul 1 • 3 Teaching Transformers Causal Reasoning through Axiomatic Training Paper • 2407.07612 • Published Jul 10 • 2
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention Paper • 2309.14327 • Published Sep 25, 2023 • 21
Teaching Transformers Causal Reasoning through Axiomatic Training Paper • 2407.07612 • Published Jul 10 • 2
Andyrasika/vit-base-patch16-224-in21k-finetuned-lora-food101 Image Classification • Updated Mar 7 • 7 • 2