Mert Inan

merterm

http://merterm.github.io/

AI & ML interests

multimodal dialogue, conversational ai, NeuroRoboNLP, sign language processing

Recent Activity

liked a dataset 16 days ago

wellesley-easel/StudentEval

updated a Space 3 months ago

merterm/Learning-Games-Experiment

upvoted a collection 3 months ago

OpenCoder

View all activity

Organizations

None yet

merterm's activity

upvoted a collection 3 months ago

OpenCoder

Collection

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 79

upvoted a paper 6 months ago

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 42

upvoted a collection 7 months ago

OLMo Suite

Collection

Artifacts for the first set of OLMo models. • 18 items • Updated 6 days ago • 71

upvoted 2 collections 8 months ago

Core ML Gallery Models

Collection

7 items • Updated Oct 4, 2024 • 34

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162

upvoted a paper 8 months ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 58

upvoted 2 collections 9 months ago

PaliGemma Release

Collection

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Dec 13, 2024 • 145

Granite Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated Dec 18, 2024 • 182

upvoted an article 9 months ago

Article

Releasing Swift Transformers: Run On-Device LLMs in Apple Devices

Aug 8, 2023

• 31

upvoted a collection 10 months ago

OpenELM Instruct Models

Collection

4 items • Updated Oct 4, 2024 • 117

upvoted 2 papers 11 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 138

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 126

upvoted a paper 12 months ago

Flamingo: a Visual Language Model for Few-Shot Learning

Paper • 2204.14198 • Published Apr 29, 2022 • 14

upvoted a collection 12 months ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 330

upvoted 4 papers about 1 year ago

upvoted a collection about 1 year ago

Mamba

Collection

Mamba SSM Models with hf_integration. • 7 items • Updated Dec 28, 2023 • 7

upvoted a paper over 1 year ago

AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn

Paper • 2306.08640 • Published Jun 14, 2023 • 26