Peter Szemraj

pszemraj

AI & ML interests

metallic intuition

Organizations

pszemraj's activity

New activity in HuggingFaceFW/fineweb-edu-classifier 8 days ago

fix example code

1
#2 opened 8 days ago by pszemraj
New activity in pszemraj/Llama-3-6.3b-v0.1 9 days ago

Excellent Approach

2
#1 opened 10 days ago by 1littlecoder
New activity in pszemraj/Mistral-v0.3-6B 17 days ago
New activity in open-llm-leaderboard/open_llm_leaderboard 25 days ago

ALL Jamba models failing

16
#690 opened about 2 months ago by devingulliver
New activity in EleutherAI/pile-t5-large about 2 months ago

why UMT5

5
#1 opened 2 months ago by pszemraj
New activity in BEE-spoke-data/smol_llama-101M-GQA about 2 months ago

Link to code repository

1
#3 opened about 2 months ago by ewre324
New activity in BEE-spoke-data/smol_llama-220M-openhermes about 2 months ago

What is the model architecture?

1
#2 opened about 2 months ago by ewre324
New activity in amazingvince/Not-WizardLM-2-7B about 2 months ago

add link to colab example

#1 opened about 2 months ago by pszemraj
New activity in pszemraj/qmsum-cleaned 2 months ago

Empty test cases

2
#2 opened 2 months ago by StDestiny
New activity in pszemraj/led-large-book-summary 2 months ago

Hardware requirements

1
#19 opened 3 months ago by Arunmass
New activity in ai21labs/Jamba-v0.1 3 months ago

Jambaleo

#10 opened 3 months ago by pszemraj
New activity in BEE-spoke-data/gutenberg-en-v1-clean 3 months ago
New activity in postbot/gpt-neo-1.3B-emailgen 3 months ago
New activity in pszemraj/distilgpt2-HC3 3 months ago
New activity in BEE-spoke-data/TinyLlama-3T-1.1bee 3 months ago
New activity in BEE-spoke-data/smol_llama-220M-GQA 3 months ago
New activity in pszemraj/pegasus-x-large-book-summary 4 months ago

Billsum Evaluation

1
#6 opened over 1 year ago by mlkorra
New activity in TuringsSolutions/NYTWritingStyleGuide 5 months ago

Rename Main to data.json

3
#1 opened 5 months ago by davanstrien
New activity in upstage/SOLAR-10.7B-v1.0 5 months ago

Data for Continued Pre-Training

6
#8 opened 6 months ago by pszemraj

Not able to test it.

2
#21 opened 6 months ago by JESUSCOLIN
New activity in pszemraj/flan-t5-large-grammar-synthesis 6 months ago

Grammar explanation

2
#14 opened 6 months ago by Ejentos
New activity in jbochi/coedit-base 6 months ago

cc-by-nc vs dataset license

2
#1 opened 6 months ago by pszemraj
New activity in BEE-spoke-data/smol_llama-101M-GQA 6 months ago

GPU used for training

1
#2 opened 6 months ago by Locutusque
New activity in jinaai/jina-embeddings-v2-base-en 6 months ago

Is there a ETA for large version?

6
#31 opened 7 months ago by kk3dmax
New activity in abidlabs/GPT-Baker 6 months ago
New activity in BEE-spoke-data/govdocs1-image 7 months ago
New activity in BEE-spoke-data/code-tutorials-en 7 months ago
New activity in google/switch-c-2048 7 months ago

Thank you

23
#9 opened 7 months ago by ehartford
New activity in BEE-spoke-data/scientificbeekeeping 7 months ago
New activity in pszemraj/midjourney-messages-cleaned 7 months ago
New activity in BEE-spoke-data/smol_llama-81M-tied 7 months ago
New activity in postbot/pythia-160m-hq-emails 7 months ago
New activity in BEE-spoke-data/smol_llama-101M-GQA 7 months ago
New activity in postbot/gpt2-medium-emailgen 7 months ago
New activity in BEE-spoke-data/code_contests_instruct 7 months ago