Hao Jiang
TechxGenus
AI & ML interests
Code Intelligence; Large Language Model; AI Alignment; Efficient Inference
Recent Activity
liked
a model
1 day ago
tencent/Hunyuan-7B-Instruct
upvoted
a
paper
3 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
updated
a dataset
5 days ago
TechxGenus/deepseek_r1_code_1k
Organizations
None yet
TechxGenus's activity
full tool support to chat template
#5 opened 4 months ago
by
qwopqwop
Update max_position_embeddings to 128k context size instead of 32k
#4 opened 4 months ago
by
qwopqwop
Adding chat_template to tokenizer_config.json file
1
#3 opened 6 months ago
by
singhsidhukuldeep
Script request
3
#1 opened 6 months ago
by
singhsidhukuldeep
The model can be started using vllm, but no dialogue is possible.
3
#2 opened 6 months ago
by
SongXiaoMao
update license in readme.md
#1 opened 7 months ago
by
Chen-01AI
update license in readme.md
#1 opened 7 months ago
by
Chen-01AI
update license in readme.md
#1 opened 7 months ago
by
Chen-01AI
Update README.md with license information
#1 opened 7 months ago
by
Chen-01AI
Update README.md with license information
1
#1 opened 7 months ago
by
Chen-01AI
Bad generated text using tgi
1
#1 opened 9 months ago
by
erfanium
Could you please share the initial weights of one of the experts from jamba?
3
#4 opened 10 months ago
by
danielpark
Example Code for Initializing from Scratch
1
#3 opened 10 months ago
by
tanimazsin130
Question about MoA
#2 opened 10 months ago
by
TechxGenus
example code missing `import torch`
1
#1 opened 10 months ago
by
Lyte
[Request] Potential Release Of Training Code?
2
#1 opened 10 months ago
by
Lyte
Smaller version to ease implementation experiments?
7
#12 opened 10 months ago
by
compilade
Coding performance of base model?
4
#11 opened 10 months ago
by
rombodawg
AQLM please for this model
1
#5 opened 10 months ago
by
AiModelsMarket