somtimz (Costa Pissaris)

upvoted an article 5 months ago

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 154

upvoted a paper 8 months ago

RAG Does Not Work for Enterprises

Paper • 2406.04369 • Published May 31, 2024 • 1

upvoted a paper 9 months ago

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 29

upvoted a collection about 1 year ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 40

upvoted 4 papers about 1 year ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 157

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 51

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 45

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 192

upvoted a paper over 1 year ago

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 73

upvoted a collection over 1 year ago

Nemotron 3 8B

Collection

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Jan 17 • 48

upvoted 7 papers over 1 year ago

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

Paper • 2307.12856 • Published Jul 24, 2023 • 36

Costa Pissaris

AI & ML interests

Organizations

somtimz's activity

Let's talk about LLM evaluation

RAG Does Not Work for Enterprises

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Preference Datasets for DPO

Mixtral of Experts

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Gemini: A Family of Highly Capable Multimodal Models

GAIA: a benchmark for General AI Assistants

Orca 2: Teaching Small Language Models How to Reason

Nemotron 3 8B

Eureka: Human-Level Reward Design via Coding Large Language Models

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Table-GPT: Table-tuned GPT for Diverse Table Tasks

Large Language Models as Analogical Reasoners

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Towards Generalist Biomedical AI

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis