New discussion

Falcon models slow inference

10
#59 opened about 1 year ago by mikeytrw

I need an API of Falcon

8
#56 opened about 1 year ago by JustMe4Real

Extracting attention maps

#49 opened about 1 year ago by roeehendel

Fix the kv-cache dimensions

1
#47 opened about 1 year ago by cchudant

Multi GPU inference issue

1
#39 opened about 1 year ago by eastwind

Fine-tuning on a new language

4
#35 opened about 1 year ago by AliMirlou

Flash attention

2
#34 opened about 1 year ago by utensil

about evaluating on humaneval

#33 opened about 1 year ago by dongZheX

Finetune on "uncensored" dataset?

1
#32 opened about 1 year ago by sivarajan

Tokenizer Details

#31 opened about 1 year ago by kye

Import dataset and chat with it

2
#27 opened about 1 year ago by phdykd

请求:DOI

#16 opened about 1 year ago by Huanghai

Finetune wtih QLoRA please

7
#14 opened about 1 year ago by supercharge19

How to set trust_remote_code to true?

13
#9 opened about 1 year ago by gmjolt

[Bug] Does not work

58
#3 opened about 1 year ago by catid