3 40 30

Daniel Huynh PRO

dhuynh95

AI & ML interests

None yet

Articles

Automatic Hallucination detection with SelfCheckGPT NLI

Nov 27, 2023

• 1

StarCoder Memorization Experiment Highlights Privacy Risks of Fine-Tuning On Code

Nov 2, 2023

Introducing BlindChat, an open-source and privacy-by-design Conversational AI fully in-browser

Sep 22, 2023

AI Total Cost of Ownership Calculator: Evaluate the cost of in-house AI deployment vs AI APIs

Sep 20, 2023

• 1

Organizations

dhuynh95's activity

upvoted an article 17 days ago

Article

Improving Prompt Consistency with Structured Generations

19 days ago

• 41

upvoted a paper 21 days ago

Flamingo: a Visual Language Model for Few-Shot Learning

Paper • 2204.14198 • Published Apr 29, 2022 • 13

upvoted 2 papers 29 days ago

Compression Represents Intelligence Linearly

Paper • 2404.09937 • Published Apr 15 • 27

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published about 1 month ago • 51

upvoted 6 papers about 1 month ago

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3 • 46

upvoted 11 papers about 2 months ago

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text

Paper • 2403.18421 • Published Mar 27 • 20

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27 • 23

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 75

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22 • 30

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Paper • 2403.15246 • Published Mar 22 • 8

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21 • 50

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19 • 24

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20 • 12

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 60

Larimar: Large Language Models with Episodic Memory Control

Paper • 2403.11901 • Published Mar 18 • 30

LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

Paper • 2403.12968 • Published Mar 19 • 20

upvoted 6 papers 2 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 172

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23

Do Large Language Models Latently Perform Multi-Hop Reasoning?

Paper • 2402.16837 • Published Feb 26 • 24

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22 • 21

Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29 • 44

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22 • 77

upvoted 3 papers 3 months ago

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19 • 15

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 73

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 90

upvoted a collection 3 months ago

LLM Hallucination Detection Papers

Collection

Collection of LLM hallucination and evaluation papers that I've been exploring and implementing. Some of them have my comments and annotated doodles. • 12 items • Updated Feb 20 • 12

upvoted a paper 3 months ago

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

Paper • 2303.08896 • Published Mar 15, 2023 • 4

upvoted 4 papers 6 months ago

Fine-tuning Language Models for Factuality

Paper • 2311.08401 • Published Nov 14, 2023 • 26

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Paper • 2311.12022 • Published Nov 20, 2023 • 22

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 68

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 17

upvoted 3 papers 7 months ago

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 8

Democratizing Reasoning Ability: Tailored Learning from Large Language Model

Paper • 2310.13332 • Published Oct 20, 2023 • 14

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

Paper • 2310.13127 • Published Oct 19, 2023 • 10

upvoted a collection 7 months ago

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26 • 31