Mayank Mishra

mayank-mishra

AI & ML interests

Large Language Models, Distributed Training and Inference

Articles

Organizations

mayank-mishra's activity

New activity in ibm-granite/granite-8b-code-instruct about 13 hours ago

Model template

3
#1 opened about 22 hours ago by alex0dd
New activity in ibm-granite/granite-3b-code-base 1 day ago

Context length

5
#3 opened 3 days ago by mrfakename
New activity in ibm-granite/granite-3b-code-base 2 days ago

Licensing

4
#4 opened 2 days ago by tonylek

Release GGUF models?

1
#5 opened 2 days ago by CosmicSound
New activity in ibm-granite/granite-3b-code-base 3 days ago

Question

3
#2 opened 4 days ago by mrfakename
New activity in ibm-granite/granite-3b-code-base 8 days ago

Initial model card version

#1 opened 10 days ago by amezasor
New activity in blog-explorers/README about 1 month ago

[Support] Community Articles

28
#5 opened about 2 months ago by victor
New activity in ibm/MoLFormer-XL-both-10pct about 1 month ago
New activity in aurora-m/aurora-m-biden-harris-redteamed about 2 months ago

Update README.md

1
#1 opened 3 months ago by cabbage972
New activity in tiiuae/falcon-180B 7 months ago

Is Gigatron open source?

#6 opened 8 months ago by mayank-mishra
New activity in mayank-mishra/starcoder-GPTQ-4bit-128g 11 months ago
New activity in mosaicml/mpt-7b 12 months ago
New activity in mayank-mishra/starcoderbase-GPTQ-8bit-128g 12 months ago

Running this on consumer hardware

2
#1 opened about 1 year ago by piratos
New activity in bigcode/starcoder about 1 year ago

What are 0..7.bin?

2
#14 opened about 1 year ago by lozhnikov
New activity in bigcode/starcoderbase about 1 year ago

KeyError: 'gpt_bigcode'

1
#4 opened about 1 year ago by Bilibili
New activity in bigcode/gpt_bigcode-santacoder about 1 year ago
New activity in bigcode/santacoder about 1 year ago
New activity in bigscience/bloom over 1 year ago
New activity in bigscience/bloom almost 2 years ago