Daniel Huynh PRO
dhuynh95
AI & ML interests
None yet
Articles
Organizations
dhuynh95's activity
upvoted
an
article
11 days ago
Article
Improving Prompt Consistency with Structured Generations
•
35
upvoted
a
paper
15 days ago
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models
Paper
•
2404.02575
•
Published
•
46
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models
Paper
•
2404.03543
•
Published
•
15
Training LLMs over Neurally Compressed Text
Paper
•
2404.03626
•
Published
•
21
Long-context LLMs Struggle with Long In-context Learning
Paper
•
2404.02060
•
Published
•
32
Poro 34B and the Blessing of Multilinguality
Paper
•
2404.01856
•
Published
•
12
Octopus v2: On-device language model for super agent
Paper
•
2404.01744
•
Published
•
52
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
Paper
•
2403.18421
•
Published
•
20
Long-form factuality in large language models
Paper
•
2403.18802
•
Published
•
23
The Unreasonable Ineffectiveness of the Deeper Layers
Paper
•
2403.17887
•
Published
•
74
Can large language models explore in-context?
Paper
•
2403.15371
•
Published
•
30
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Paper
•
2403.15246
•
Published
•
8
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Paper
•
2403.14624
•
Published
•
50
When Do We Not Need Larger Vision Models?
Paper
•
2403.13043
•
Published
•
24
Reverse Training to Nurse the Reversal Curse
Paper
•
2403.13799
•
Published
•
12
RAFT: Adapting Language Model to Domain Specific RAG
Paper
•
2403.10131
•
Published
•
58
Larimar: Large Language Models with Episodic Memory Control
Paper
•
2403.11901
•
Published
•
30
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper
•
2403.12968
•
Published
•
20
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper
•
2403.03507
•
Published
•
172
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Paper
•
2402.17193
•
Published
•
23
Do Large Language Models Latently Perform Multi-Hop Reasoning?
Paper
•
2402.16837
•
Published
•
24
Watermarking Makes Language Models Radioactive
Paper
•
2402.14904
•
Published
•
21
Beyond Language Models: Byte Models are Digital World Simulators
Paper
•
2402.19155
•
Published
•
44
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Paper
•
2402.14658
•
Published
•
77
upvoted
a
collection
3 months ago
upvoted
a
paper
3 months ago
Fine-tuning Language Models for Factuality
Paper
•
2311.08401
•
Published
•
26
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Paper
•
2311.12022
•
Published
•
22
Orca 2: Teaching Small Language Models How to Reason
Paper
•
2311.11045
•
Published
•
68
Instruction-Following Evaluation for Large Language Models
Paper
•
2311.07911
•
Published
•
17
Tuna: Instruction Tuning using Feedback from Large Language Models
Paper
•
2310.13385
•
Published
•
8
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Paper
•
2310.13332
•
Published
•
14
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper
•
2310.13127
•
Published
•
10