-
meta-llama/Llama-2-7b-hf
Text Generation • Updated • 1.12M • 1.4k -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 173 -
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
Paper • 2401.04398 • Published • 18 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 28
Alina
iblub
·
AI & ML interests
None yet
Organizations
None yet
Collections
2
models
17
iblub/a2c-PandaReachDense-v2
Reinforcement Learning
•
Updated
iblub/detr-finetuned-balloon
Object Detection
•
Updated
•
22
iblub/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
iblub/ppo-lunar-lander-week8
Reinforcement Learning
•
Updated
iblub/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
24
iblub/ppo-Pyramid
Reinforcement Learning
•
Updated
•
12
iblub/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
iblub/SnowballTarget1
Reinforcement Learning
•
Updated
•
15
iblub/Reinforce-PixelCopter-01
Reinforcement Learning
•
Updated
iblub/Reinforce-01
Reinforcement Learning
•
Updated
datasets
None public yet