Hao Jiang's picture

Hao Jiang

TechxGenus

·

https://techxgenus.github.io/

TechxGenus

AI & ML interests

Code Intelligence; Large Language Model; AI Alignment; Efficient Inference

Recent Activity

liked a model 2 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a model 2 days ago

meta-llama/Llama-4-Maverick-17B-128E-Instruct

liked a model 2 days ago

Dream-org/Dream-v0-Instruct-7B

View all activity

Organizations

None yet

TechxGenus's activity

New activity in TechxGenus/Mistral-Large-Instruct-2407-AWQ 6 months ago

full tool support to chat template

#5 opened 6 months ago by

Update max_position_embeddings to 128k context size instead of 32k

#4 opened 6 months ago by

New activity in TechxGenus/Mistral-Large-Instruct-2407-AWQ 8 months ago

Adding chat_template to tokenizer_config.json file

#3 opened 8 months ago by

singhsidhukuldeep

Script request

#1 opened 9 months ago by

singhsidhukuldeep

New activity in TechxGenus/Mistral-Large-Instruct-2407-AWQ 9 months ago

The model can be started using vllm, but no dialogue is possible.

#2 opened 9 months ago by

New activity in TechxGenus/Yi-9B-200K-AWQ 9 months ago

update license in readme.md

#1 opened 9 months ago by

New activity in TechxGenus/Yi-9B-200K-GPTQ 9 months ago

update license in readme.md

#1 opened 9 months ago by

New activity in TechxGenus/Yi-9B-AWQ 9 months ago

update license in readme.md

#1 opened 9 months ago by

New activity in TechxGenus/Yi-9B-Coder 9 months ago

Update README.md with license information

#1 opened 9 months ago by

New activity in TechxGenus/Yi-9B-GPTQ 10 months ago

Update README.md with license information

#1 opened 10 months ago by

New activity in TechxGenus/starcoder2-7b-AWQ 12 months ago

Bad generated text using tgi

#1 opened 12 months ago by

New activity in TechxGenus/Mini-Jamba-v2 12 months ago

Could you please share the initial weights of one of the experts from jamba?

#4 opened about 1 year ago by

Example Code for Initializing from Scratch

#3 opened about 1 year ago by

New activity in jetmoe/jetmoe-8b about 1 year ago

Question about MoA

#2 opened about 1 year ago by

New activity in TechxGenus/Mini-Jamba-v2 about 1 year ago

Fast Mamba kernels are not available. Make sure to they are installed and that the mamba module is on a CUDA device

#2 opened about 1 year ago by

New activity in TechxGenus/Mini-Jamba about 1 year ago

example code missing `import torch`

#1 opened about 1 year ago by

New activity in TechxGenus/Mini-Jamba-v2 about 1 year ago

[Request] Potential Release Of Training Code?

#1 opened about 1 year ago by

New activity in ai21labs/Jamba-v0.1 about 1 year ago

Smaller version to ease implementation experiments?

#12 opened about 1 year ago by

Coding performance of base model?

#11 opened about 1 year ago by

New activity in TechxGenus/starcoder2-15b-instruct about 1 year ago

AQLM please for this model

#5 opened about 1 year ago by