VincentN's picture

41 1 3

VincentN

vince62s

·

https://www.linkedin.com/in/vincentnguyenngoc/

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

utter-project/EuroLLM-9B-Instruct:Training data

new activity 15 days ago

HuggingFaceTB/smollm-corpus:Going multilingual

new activity 17 days ago

HuggingFaceFW/fineweb-edu:Most of the data is duplicated?

View all activity

Organizations

vince62s's activity

New activity in utter-project/EuroLLM-9B-Instruct 5 days ago

Training data

#2 opened 4 months ago by

New activity in HuggingFaceTB/smollm-corpus 15 days ago

Going multilingual

#16 opened 15 days ago by

New activity in HuggingFaceFW/fineweb-edu 17 days ago

Most of the data is duplicated?

#7 opened 10 months ago by

New activity in utter-project/EuroLLM-9B about 1 month ago

bigger model

#7 opened about 1 month ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-8B 2 months ago

When it comes to maths, the model does not generate correct Latex tags which triggers awful outputs.

#6 opened 3 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Llama-8B 3 months ago

missing special_tokens_map.json file

#2 opened 3 months ago by

New activity in vince62s/wmt22-cometkiwi-da-roberta-large 3 months ago

Adding `safetensors` variant of this model

#1 opened 4 months ago by

New activity in HuggingFaceFW/fineweb 4 months ago

Text Clusters

#55 opened 4 months ago by

New activity in Unbabel/TowerInstruct-Mistral-7B-v0.2 6 months ago

Results discussion

#6 opened 6 months ago by

New activity in utter-project/EuroLLM-1.7B-Instruct 6 months ago

Clarification on the way the tokenizer should be used

#6 opened 6 months ago by

New activity in eole-nlp/cometkiwi-xxl-eole 6 months ago

Quality Estimation

#1 opened 6 months ago by

New activity in utter-project/EuroLLM-1.7B 7 months ago

Dataset for pretraining ?

#2 opened 7 months ago by

New activity in Unbabel/TowerInstruct-Mistral-7B-v0.2 7 months ago

Question on the usage of TowerBlocks to train this model

#3 opened 7 months ago by

New activity in Unbabel/TowerInstruct-7B-v0.2 7 months ago

Bug with the HF Tokenizer

#7 opened 7 months ago by

New activity in vince62s/wmt23-cometkiwi-da-roberta-xl 9 months ago

IndexError: index out of range in self

#2 opened 10 months ago by

Will there be a XXL version released in the future ?

#3 opened 9 months ago by

New activity in vince62s/wmt23-cometkiwi-da-roberta-xl 10 months ago

Half precision

#1 opened 10 months ago by

New activity in Unbabel/wmt23-cometkiwi-da-xl 10 months ago

Can we run this in FP16 instead of FP32 ?

#3 opened about 1 year ago by

New activity in rhysjones/phi-2-orange-v2 about 1 year ago

TruthfulQA score look odd, no ?

#3 opened about 1 year ago by

New activity in google/gemma-2b about 1 year ago

torch.cuda.OutOfMemoryError

#26 opened about 1 year ago by