Hao Jiang
TechxGenus
AI & ML interests
Code Intelligence; Large Language Model; AI Alignment; Efficient Inference
Recent Activity
liked
a model
2 days ago
meta-llama/Llama-4-Scout-17B-16E-Instruct
liked
a model
2 days ago
meta-llama/Llama-4-Maverick-17B-128E-Instruct
liked
a model
2 days ago
Dream-org/Dream-v0-Instruct-7B
Organizations
None yet
TechxGenus's activity
full tool support to chat template
#5 opened 6 months ago
by
qwopqwop
Update max_position_embeddings to 128k context size instead of 32k
#4 opened 6 months ago
by
qwopqwop
Adding chat_template to tokenizer_config.json file
1
#3 opened 8 months ago
by
singhsidhukuldeep

Script request
3
#1 opened 9 months ago
by
singhsidhukuldeep

The model can be started using vllm, but no dialogue is possible.
3
#2 opened 9 months ago
by
SongXiaoMao

update license in readme.md
#1 opened 9 months ago
by
Chen-01AI
update license in readme.md
#1 opened 9 months ago
by
Chen-01AI
update license in readme.md
#1 opened 9 months ago
by
Chen-01AI
Update README.md with license information
#1 opened 9 months ago
by
Chen-01AI
Update README.md with license information
1
#1 opened 10 months ago
by
Chen-01AI
Bad generated text using tgi
1
#1 opened 12 months ago
by
erfanium

Could you please share the initial weights of one of the experts from jamba?
3
#4 opened about 1 year ago
by
danielpark
Example Code for Initializing from Scratch
1
#3 opened about 1 year ago
by
tanimazsin130
Question about MoA
#2 opened about 1 year ago
by
TechxGenus

example code missing `import torch`
1
#1 opened about 1 year ago
by
Lyte

[Request] Potential Release Of Training Code?
2
#1 opened about 1 year ago
by
Lyte

Smaller version to ease implementation experiments?
7
#12 opened about 1 year ago
by
compilade

Coding performance of base model?
4
#11 opened about 1 year ago
by
rombodawg

AQLM please for this model
1
#5 opened about 1 year ago
by
AiModelsMarket