Hao Jiang's picture

Hao Jiang

TechxGenus

·

https://techxgenus.github.io/

TechxGenus

AI & ML interests

Code Intelligence; Large Language Model; AI Alignment; Efficient Inference

Recent Activity

liked a model about 16 hours ago

GSAI-ML/LLaDA-8B-Instruct

liked a model 3 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a model 3 days ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

View all activity

Organizations

None yet

TechxGenus's activity

upvoted a paper 3 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 6 days ago • 43

upvoted a collection 13 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 13 days ago • 80

upvoted a collection 28 days ago

Gemma 3 Release

17 items • Updated 6 days ago • 317

upvoted a paper about 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 179

upvoted a collection about 2 months ago

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated Feb 20 • 11

upvoted a paper about 2 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 136

upvoted 2 papers 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 374

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 96

upvoted a collection 3 months ago

DeepSeek-V3

4 items • Updated 15 days ago • 235

upvoted 3 collections 4 months ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 141

DeepSeek-V2.5

2 items • Updated Dec 10, 2024 • 40

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 26 days ago • 78

upvoted 2 papers 5 months ago

MagicQuill: An Intelligent Interactive Image Editing System

Paper • 2411.09703 • Published Nov 14, 2024 • 75

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 124

upvoted 2 collections 5 months ago

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated Nov 15, 2024 • 40

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 81

upvoted a paper 5 months ago

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Paper • 2311.14904 • Published Nov 25, 2023 • 4

upvoted 2 collections 6 months ago

Text to SVG papers

7 items • Updated Apr 30, 2024 • 5

SVG generation

6 items • Updated Apr 30, 2024 • 6

upvoted an article 6 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 229