Surya Bhupatiraju
suryabhupa
AI & ML interests
part of the Gemma Team -- Language models, Reinforcement Learning
Organizations
suryabhupa's activity
inquiry for gemma-7b : d_model
1
#61 opened 12 months ago
by
seongwoon
Hallucinations, misspellings etc. Something seems broken?
21
#10 opened 8 months ago
by
sam-paech
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65ad56b4c2eef2ba1154618c/CRD6nLAX-34Dmqn1D4xbg.png)
tokenizer chat_template has no role system
2
#9 opened 8 months ago
by
wnma3mz
Citation URL redirects to Gemma-1
1
#8 opened 8 months ago
by
yumemio
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6631f0f5992449cefe17e068/EXkQ3_IA16xGtP1yEetc9.png)
Asking same thing twice or thrice in hugging face chat breaks it , same thing on ollama
1
#7 opened 8 months ago
by
Jayakumark
transformers load fails?
7
#6 opened 8 months ago
by
bdambrosio
![](https://cdn-avatars.huggingface.co/v1/production/uploads/641b67291911d3be67457cea/1idJEo5LHZ_XKJJUKssen.jpeg)
Flash attention 2 is not working
3
#9 opened 8 months ago
by
nalf3in2
Unable to reproduce the score of gemma_2b at pass@1 in humaneval.
3
#53 opened 10 months ago
by
ChiYuqi
What do they mean by maj@1 ?
3
#44 opened 9 months ago
by
joserass
Fine-Tune a gemma model for question answering
17
#62 opened 12 months ago
by
Iamexperimenting
save, loading and inferencing the Gemma model
13
#64 opened 12 months ago
by
Iamexperimenting
Need info on pre-training and instruction-tuning data
3
#64 opened 12 months ago
by
markding
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d1218684bfbee86b6ee521/BpXX_XUP80IfdGAvbs_VI.png)
Inference with RTX 3090 got OOM
3
#89 opened 10 months ago
by
kathylee
Weird Performance Issue with Gemma-7b compared to Gemma-2b with Qlora
6
#91 opened 10 months ago
by
UserDAN
What's the context window for this model?
6
#73 opened 12 months ago
by
siddheshgunjal
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65479a8ae3486f8a5eb961aa/mzNEwPjvGBNBUe210AzgH.png)
pretraining Gemma for domain dataset
8
#41 opened 11 months ago
by
Iamexperimenting
gemma -2b with multi-gpu
3
#44 opened 11 months ago
by
Iamexperimenting
<pad> spam issue
13
#40 opened 12 months ago
by
Zewsic
evaluation loss not calculated during during?
2
#43 opened 11 months ago
by
Iamexperimenting
Dont download, google scuttled this model
16
#77 opened 11 months ago
by
Tom-Neverwinter