Ali Dadsetan's picture

16 4

Ali Dadsetan

dadsetan

·

@alidadsetan

AI & ML interests

NLP

Organizations

None yet

dadsetan's activity

upvoted 2 papers 4 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 52

upvoted a collection 5 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 16 days ago • 637

upvoted a paper 6 months ago

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

Paper • 2406.19280 • Published Jun 27 • 61

upvoted 2 papers 7 months ago

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14 • 27

upvoted 2 papers 9 months ago

Localizing Paragraph Memorization in Language Models

Paper • 2403.19851 • Published Mar 28 • 13

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11 • 90

upvoted 3 papers 10 months ago

What do we learn from inverting CLIP models?

Paper • 2403.02580 • Published Mar 5 • 3

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 603

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27 • 190

upvoted 5 papers 11 months ago

In-Context Principle Learning from Mistakes

Paper • 2402.05403 • Published Feb 8 • 14

StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis

Paper • 2401.17093 • Published Jan 30 • 19

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 69

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Paper • 2401.12070 • Published Jan 22 • 43

Patchscope: A Unifying Framework for Inspecting Hidden Representations of Language Models

Paper • 2401.06102 • Published Jan 11 • 20