New discussion

Falcon models slow inference

10
#59 opened about 1 year ago by mikeytrw

I need an API of Falcon

8
#56 opened about 1 year ago by JustMe4Real

Extracting attention maps

#49 opened about 1 year ago by roeehendel

Fix the kv-cache dimensions

1
#47 opened about 1 year ago by cchudant

Multi GPU inference issue

1
#39 opened over 1 year ago by eastwind

Fine-tuning on a new language

4
#35 opened over 1 year ago by AliMirlou

Flash attention

2
#34 opened over 1 year ago by utensil

about evaluating on humaneval

#33 opened over 1 year ago by dongZheX

Finetune on "uncensored" dataset?

1
#32 opened over 1 year ago by sivarajan

Tokenizer Details

#31 opened over 1 year ago by kye

Import dataset and chat with it

2
#27 opened over 1 year ago by phdykd

请求:DOI

#16 opened over 1 year ago by Huanghai

Finetune wtih QLoRA please

7
#14 opened over 1 year ago by supercharge19

[Bug] Does not work

58
#3 opened over 1 year ago by catid