Tanimazsin Tanimazsinoglu's picture

Tanimazsin Tanimazsinoglu PRO

tanimazsin130

·

AI & ML interests

None yet

Organizations

tanimazsin130's activity

upvoted 2 papers 2 months ago

Fully Open Source Moxin-7B Technical Report

Paper • 2412.06845 • Published Dec 8, 2024 • 11

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 50

upvoted a collection 3 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 205

upvoted 2 collections 4 months ago

Pangea

A Fully Open Multilingual Multimodal LLM for 39 Languages • 26 items • Updated 19 days ago • 18

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Jan 17 • 153

upvoted a paper 7 months ago

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26, 2024 • 43

upvoted 3 collections 7 months ago

INT4 LLMs for vLLM

Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! • 18 items • Updated Sep 26, 2024 • 8

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 15

FP8 LLMs for vLLM

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17, 2024 • 66

upvoted a paper 8 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 97

upvoted an article 8 months ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

By

•

Jul 9, 2024

• 43

upvoted 2 collections 8 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5, 2024 • 93

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162

upvoted 3 papers about 1 year ago

LLaMA Beyond English: An Empirical Study on Language Capability Transfer

Paper • 2401.01055 • Published Jan 2, 2024 • 54

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1, 2024 • 22

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 27