41 7 39

Huu Nguyen

huu-ontocord

AI & ML interests

None yet

Recent Activity

updated a dataset about 22 hours ago

ontocord/MixtureVitae

updated a dataset 2 days ago

ontocord/interleaved_seed2_obelics

updated a dataset 2 days ago

ontocord/megawiki_with_gov_docs

View all activity

Organizations

huu-ontocord's activity

New activity in LLM360/TxT360 26 days ago

URLs -> cluster label

#12 opened 26 days ago by

huu-ontocord

commented a paper about 1 month ago

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Paper • 2502.19261 • Published Feb 26 • 7 •

New activity in Qwen/Qwen2.5-7B-Instruct-1M 2 months ago

Did the base 1M start from qwen2.5-7b?

#4 opened 2 months ago by

huu-ontocord

New activity in HuggingFaceTB/smoltalk 2 months ago

Update SmolTalk distilabel pipelines link.

#6 opened 2 months ago by

tranhd95

New activity in HuggingFaceFV/finevideo 4 months ago

Cleanup TTS

#16 opened 6 months ago by

huu-ontocord

New activity in lmms-lab/muchomusic 4 months ago

Can you add a description of dataset and license?

#3 opened 4 months ago by

huu-ontocord

New activity in ontocord/VALID 4 months ago

Wedataset and other format

#2 opened 4 months ago by

huu-ontocord

New activity in SwayStar123/preprocessed_commoncatalog-cc-by 5 months ago

Hi - is this florence2 of common catalog cc-by?

#2 opened 5 months ago by

huu-ontocord

New activity in mlfoundations/MINT-1T-PDF-CC-2024-18 5 months ago

can you explain how you associate multiple images to the same text? is it by filename-*.tiff?

#4 opened 5 months ago by

huu-ontocord

commented a paper 5 months ago

A Survey of Small Language Models

Paper • 2410.20011 • Published Oct 25, 2024 • 44 •

New activity in Magpie-Align/MagpieLM-DPO-Data-v0.1 5 months ago

could you add a license to this dataset please?

#2 opened 5 months ago by

huu-ontocord

New activity in Magpie-Align/Magpie-Qwen2-Pro-200K-Chinese 5 months ago

can you please add a license, preferably cc-by or Apache?

#4 opened 5 months ago by

huu-ontocord

New activity in openbmb/UltraInteract_pair 7 months ago

Can you add the observation and critique data for each step?

#3 opened 7 months ago by

huu-ontocord

New activity in open-llm-leaderboard/open_llm_leaderboard 8 months ago

phi-3-small-128k MATH Lvl 5 is 0

#897 opened 8 months ago by

huu-ontocord

New activity in xiaociwei/YFCC15M-LLaVA-Cap 8 months ago

license

#2 opened 8 months ago by

huu-ontocord

commented 2 papers 10 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 93 •

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 93 •

New activity in ajibawa-2023/Children-Stories-Collection 10 months ago

Information and DOI request

#1 opened 11 months ago by

wiirginia

New activity in common-canvas/commoncatalog-cc-by-sa 10 months ago

releasing just the captions and alt?

#2 opened 10 months ago by

huu-ontocord

New activity in microsoft/Phi-3-medium-128k-instruct 11 months ago

about multilingual label

#8 opened 11 months ago by

StatPan