Dimitris Roussis
droussis
AI & ML interests
All things data for LLMs, NMT, evaluation, safety, alignment, and more
Recent Activity
upvoted
an
article
2 days ago
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
updated
a model
3 days ago
ilsp/Llama-Krikri-8B-Instruct-GGUF
updated
a model
3 days ago
ilsp/Llama-Krikri-8B-Instruct
Organizations
droussis's activity
Thinking token generation
1
#2 opened 17 days ago
by
thtang
What languages were you trained in?
2
#7 opened 9 days ago
by
NickyNicky

Bug on the tokenizer, using the code that you provided for the inference.
6
#2 opened 30 days ago
by
Ptrnk
Seems very promising
3
#1 opened about 1 month ago
by
gstrat88
Is this the same as Kurage?
2
#2 opened 4 months ago
by
droussis

Context extension?
1
#4 opened 5 months ago
by
droussis

Dataset fails loading
#2 opened 6 months ago
by
droussis

Dataset fails loading
#1 opened 7 months ago
by
droussis

About context size and difference in quality
3
#1 opened 10 months ago
by
droussis

Future plans (Llama 3?)
1
#3 opened 11 months ago
by
velocity

LLama-Factory inference issue
14
#2 opened 12 months ago
by
ianss
Regarding quality assessment
2
#1 opened about 1 year ago
by
droussis

Community request: more languages
8
#1 opened about 1 year ago
by
emre

Which part of HC3?
#1 opened over 1 year ago
by
droussis

Difference between 40K and 395K dataset
#2 opened over 1 year ago
by
droussis

The model output is totally corrupted
4
#5 opened over 1 year ago
by
fernandofernandes

Fix weights by putting the right value in `lm_head.weight`
3
#3 opened over 1 year ago
by
sgugger

The model output is totally corrupted
4
#5 opened over 1 year ago
by
fernandofernandes
