Tomer Ronen
tomer-nv
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Organizations
tomer-nv's activity
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#19 opened 3 months ago
by
tomer-nv
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#18 opened 3 months ago
by
tomer-nv
fixed cache over-alloc bug
#17 opened 3 months ago
by
tomer-nv