Knut Jägersberg
KnutJaegersberg
AI & ML interests
NLP, opinion mining, narrative intelligence
Articles
Organizations
KnutJaegersberg's activity
upvoted
a
paper
about 8 hours ago
upvoted
an
article
4 days ago
Article
Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+
By
•
•
4upvoted
a
paper
8 days ago
upvoted
a
paper
23 days ago
upvoted
a
paper
26 days ago
upvoted
a
collection
27 days ago
upvoted
a
paper
about 1 month ago
upvoted
a
collection
about 2 months ago
upvoted
a
paper
about 2 months ago
upvoted
a
collection
3 months ago
Weaver: Foundation Models for Creative Writing
Paper
•
2401.17268
•
Published
•
39
Quantifying the Carbon Emissions of Machine Learning
Paper
•
1910.09700
•
Published
•
3
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
Paper
•
2401.12070
•
Published
•
40
ChatQA: Building GPT-4 Level Conversational QA Models
Paper
•
2401.10225
•
Published
•
27
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
•
2401.08967
•
Published
•
26
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Paper
•
2401.08417
•
Published
•
25
TeleChat Technical Report
Paper
•
2401.03804
•
Published
•
7
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Paper
•
2401.01335
•
Published
•
61
aMUSEd: An Open MUSE Reproduction
Paper
•
2401.01808
•
Published
•
26
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
72
Supervised Knowledge Makes Large Language Models Better In-context Learners
Paper
•
2312.15918
•
Published
•
8
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation
Paper
•
2312.14187
•
Published
•
48
MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications
Paper
•
2310.15777
•
Published
•
2
upvoted
a
paper
8 months ago
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper
•
2307.09288
•
Published
•
233
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Paper
•
2306.08568
•
Published
•
26
Stay on topic with Classifier-Free Guidance
Paper
•
2306.17806
•
Published
•
26
Instruction Mining: High-Quality Instruction Data Selection for Large Language Models
Paper
•
2307.06290
•
Published
•
9
Extending Context Window of Large Language Models via Positional Interpolation
Paper
•
2306.15595
•
Published
•
52