charliewaters (Charlie Waters)

upvoted 30 papers 2 months ago

UniVTG: Towards Unified Video-Language Temporal Grounding

Paper • 2307.16715 • Published Jul 31, 2023 • 10

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Paper • 2307.16368 • Published Jul 31, 2023 • 11

LP-MusicCaps: LLM-Based Pseudo Music Captioning

Paper • 2307.16372 • Published Jul 31, 2023 • 37

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Paper • 2407.16607 • Published Jul 23 • 21

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28 • 19

3D Question Answering for City Scene Understanding

Paper • 2407.17398 • Published Jul 24 • 21

ATHAR: A High-Quality and Diverse Dataset for Classical Arabic to English Translation

Paper • 2407.19835 • Published Jul 29 • 20

SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain

Paper • 2407.19584 • Published Jul 28 • 60

A Large Encoder-Decoder Family of Foundation Models For Chemical Language

Paper • 2407.20267 • Published Jul 24 • 31

Adapting Safe-for-Work Classifier for Malaysian Language Text: Enhancing Alignment in LLM-Ops Framework

Paper • 2407.20729 • Published Jul 30 • 25

JaColBERTv2.5: Optimising Multi-Vector Retrievers to Create State-of-the-Art Japanese Retrievers with Constrained Resources

Paper • 2407.20750 • Published Jul 30 • 21

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Paper • 2404.03653 • Published Apr 4 • 33

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 89

WalkTheDog: Cross-Morphology Motion Alignment via Phase Manifolds

Paper • 2407.18946 • Published Jul 11 • 12

Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification

Paper • 2407.19340 • Published Jul 27 • 56

Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians

Paper • 2403.17898 • Published Mar 26 • 14

Improving Text-to-Image Consistency via Automatic Prompt Optimization

Paper • 2403.17804 • Published Mar 26 • 15

2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Paper • 2403.17888 • Published Mar 26 • 26

sDPO: Don't Use Your Data All at Once

Paper • 2403.19270 • Published Mar 28 • 38

LITA: Language Instructed Temporal-Localization Assistant

Paper • 2403.19046 • Published Mar 27 • 17

Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation

Paper • 2403.19319 • Published Mar 28 • 11

SHIC: Shape-Image Correspondences with no Keypoint Supervision

Paper • 2407.18907 • Published Jul 26 • 39

upvoted a collection 2 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 16 days ago • 676

upvoted 8 papers 2 months ago

Mosaic-SDF for 3D Generative Models

Paper • 2312.09222 • Published Dec 14, 2023 • 15

Pixel Aligned Language Models

Paper • 2312.09237 • Published Dec 14, 2023 • 14

StemGen: A music generation model that listens

Paper • 2312.08723 • Published Dec 14, 2023 • 47

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

Paper • 2311.13073 • Published Nov 22, 2023 • 56

LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes

Paper • 2311.13384 • Published Nov 22, 2023 • 50

PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction

Paper • 2311.12024 • Published Nov 20, 2023 • 18

PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Paper • 2311.12198 • Published Nov 20, 2023 • 22

MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

Paper • 2311.12052 • Published Nov 18, 2023 • 32

upvoted 4 papers 3 months ago

Internal Consistency and Self-Feedback in Large Language Models: A Survey

Paper • 2407.14507 • Published Jul 19 • 45

Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition

Paper • 2407.13559 • Published Jul 18 • 12

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding

Paper • 2407.12594 • Published Jul 17 • 19

SciCode: A Research Coding Benchmark Curated by Scientists

Paper • 2407.13168 • Published Jul 18 • 13

Charlie Waters

AI & ML interests

Organizations

charliewaters's activity