1 1733 589

Welcome to matlok

matlok

https://matlok.ai

matlok-ai

AI & ML interests

Welcome! We share large, open source multimodal datasets for training and fine-tuning AI to write python and build AI models, we curate collections of guides, papers, datasets, models and tools like frankenmerging AI models.

Recent Activity

updated a collection about 2 months ago

Models - Text - Math - GRPO

liked a model about 2 months ago

QuantFactory/deepseek-math-7b-instruct-GGUF

updated a collection about 2 months ago

Papers - Text - Classification - FastText

View all activity

Organizations

None yet

matlok's activity

upvoted 5 papers about 2 months ago

FastText.zip: Compressing text classification models

Paper • 1612.03651 • Published Dec 12, 2016 • 1

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 113

upvoted 6 papers 2 months ago

Zero Bubble Pipeline Parallelism

Paper • 2401.10241 • Published Nov 30, 2023 • 24

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 11

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 77

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 30

HunyuanVideo: A Systematic Framework For Large Video Generative Models

Paper • 2412.03603 • Published Dec 3, 2024 • 8

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 373

upvoted a paper 3 months ago

Memory Layers at Scale

Paper • 2412.09764 • Published Dec 12, 2024 • 3

upvoted a collection 3 months ago

Code Evaluation

Collection

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted 7 papers 3 months ago

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

Paper • 2102.04664 • Published Feb 9, 2021 • 2

Deep Data Flow Analysis

Paper • 2012.01470 • Published Nov 21, 2020 • 1

Classical Sorting Algorithms as a Model of Morphogenesis: self-sorting arrays reveal unexpected competencies in a minimal model of basal intelligence

Paper • 2401.05375 • Published Dec 15, 2023 • 1

Compiling C to Safe Rust, Formalized

Paper • 2412.15042 • Published Dec 19, 2024 • 1

MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Paper • 2410.20771 • Published Oct 28, 2024 • 3

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 95

wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 6