Gustavo de Rosa

#64 opened 5 months ago by

philschmid

New activity in microsoft/Phi-3-mini-128k-instruct 4 months ago

Add attention_bias to make TGI work

#68 opened 5 months ago by

philschmid

New activity in microsoft/Phi-3-small-128k-instruct 5 months ago

Upload 3 files

#1 opened 5 months ago by

bapatra

New activity in microsoft/Phi-3-small-8k-instruct 5 months ago

Upload 3 files

#1 opened 5 months ago by

bapatra

New activity in microsoft/Phi-3-mini-128k-instruct-onnx 5 months ago

error in tokenizer

#5 opened 5 months ago by

Algaliarept

New activity in microsoft/Phi-3-mini-128k-instruct 5 months ago

Upload 25 files

#59 opened 5 months ago by

jf8441

New activity in microsoft/Phi-3-mini-4k-instruct 5 months ago

Fine-tuning is not improving the domain knowledge? it is very complicated, could you help?

5

#50 opened 5 months ago by

aaditya

New activity in microsoft/Phi-3-mini-128k-instruct 5 months ago

Model can't stop from explaining itself

14

#53 opened 5 months ago by

fedeparra

Pipeline tokenizer does not get initialized when loading from string

#50 opened 5 months ago by

connorboyle

model destroyed by deleting tokens?

#52 opened 5 months ago by

NickyNicky

Phi-3 instruct model continuously repeating the answer

#55 opened 5 months ago by

bitmman-nch

Get target_modules

#56 opened 5 months ago by

danny3

New activity in microsoft/Phi-3-mini-4k-instruct 5 months ago

System prompts ignored in chat completions

18

#51 opened 5 months ago by

joshuaturner

Phi-3 support for GPU training with MPS on Mac

#52 opened 5 months ago by

chaishirin

Leaking Training Data

3

#40 opened 5 months ago by

qunitindk

New activity in microsoft/Phi-3-mini-128k-instruct 5 months ago

fix(tokenizer_config): Adjusts `rstrip` of special tokens.

#54 opened 5 months ago by

gugarosa

New activity in microsoft/Phi-3-mini-4k-instruct 5 months ago

fix(tokenizer_config): Adjusts `rstrip` of special tokens.

#53 opened 5 months ago by

gugarosa

Instruct model needs work

8

#35 opened 5 months ago by

Nafnlaus

New activity in microsoft/Phi-3-mini-128k-instruct 5 months ago

gguf

30

#24 opened 6 months ago by

LaferriereJC

Very good model for its size (7b and 14b are gonna be amazing)

4

#43 opened 5 months ago by

rombodawg

Lmsys Leaderboard Update! Impressive MMLU Scores

#44 opened 5 months ago by

saishf

Probably Memory Leak issue？

#46 opened 5 months ago by

Tieni

clarification on the usage of `short_factor` and `long_factor`?

#49 opened 5 months ago by

J22

Could you aid in getting Llama.cpp to support the 128K version?

#48 opened 5 months ago by

BoscoTheDog

Tokenizer

#23 opened 6 months ago by

vschindepalli

New activity in microsoft/Phi-3-mini-4k-instruct 5 months ago

the phi3 model not learning from training data using orpo.

3

#44 opened 5 months ago by

Imran1

is the '\n' after `'<|end|>'`?

#43 opened 5 months ago by

J22

Cant download the model in databricks

#36 opened 5 months ago by

Buchhibabu

ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn. Run `pip install flash_attn`

#48 opened 5 months ago by

Sulilili

Loading SPM tokenizer shows 32000 vocab size instead of 32064

4

#47 opened 5 months ago by

jrc

How to configure config.json properly to suppress noise?

#49 opened 5 months ago by

don412

New activity in microsoft/Phi-3-mini-4k-instruct-gguf 5 months ago

New gguf chat template ignores the 'system' role when preparing chat completions

3

#11 opened 5 months ago by

joshuaturner

Model breaks if I use word "unani".

#9 opened 5 months ago by

mdmujtabaraza

More quants?

4

#5 opened 6 months ago by

MoonRide

Can't reproduce

5

#3 opened 6 months ago by

ayyylol

New activity in microsoft/phi-1_5 5 months ago

Something broken on last update

7

#85 opened 6 months ago by

Nayjest

New activity in microsoft/Phi-3-mini-128k-instruct 5 months ago

No base model?

#45 opened 5 months ago by

ucalyptus

Unable to deploy this model on databricks due to transformers version

#47 opened 5 months ago by

enkefalos

New activity in microsoft/Phi-3-mini-4k-instruct-onnx 5 months ago

What is the difference between "cpu-int4-rtn-block-32-acc-level-4" and "cpu-int4-rtn-block-32"?

#3 opened 5 months ago by

Zhubarb

New activity in microsoft/Phi-3-mini-4k-instruct 5 months ago

Is the system message irrelevant for Phi3? I don't see any system message in the chat template of tokenizer_config.json.

#46 opened 5 months ago by

Snehith749

New activity in microsoft/Phi-3-mini-4k-instruct-gguf 5 months ago

general.architecture = 'llama' in .gguf metadata

#6 opened 6 months ago by

mattjcly

New activity in microsoft/Phi-3-mini-128k-instruct 5 months ago

Adding Evaluation Results

#41 opened 5 months ago by

leaderboard-pr-bot

Update sample_finetune.py

#42 opened 5 months ago by

wwwaj

New activity in microsoft/Phi-3-mini-4k-instruct 5 months ago

Adding Evaluation Results

#41 opened 5 months ago by

leaderboard-pr-bot

Update sample_finetune.py

#42 opened 5 months ago by

wwwaj

is phi-3 foundation model was released or only instruct fine tuned models relasesd

#37 opened 5 months ago by

naveen237

Update tokenizer_config.json

#39 opened 5 months ago by

mrm8488

Update config.json

#38 opened 5 months ago by

mrm8488

[AUTOMATED] Model Memory Requirements

#33 opened 5 months ago by

[AUTOMATED] Model Memory Requirements

#32 opened 5 months ago by

[AUTOMATED] Model Memory Requirements

#31 opened 5 months ago by

[AUTOMATED] Model Memory Requirements

#30 opened 5 months ago by

[AUTOMATED] Model Memory Requirements

#29 opened 6 months ago by

[AUTOMATED] Model Memory Requirements

#28 opened 6 months ago by

Why is EOS token added at the end of chat template?

#26 opened 6 months ago by

mspanrin

Is sliding window used or not?

#25 opened 6 months ago by

J22

你好呀

#24 opened 6 months ago by

dddfhrs

[AUTOMATED] Model Memory Requirements

#23 opened 6 months ago by

[AUTOMATED] Model Memory Requirements

#22 opened 6 months ago by

[AUTOMATED] Model Memory Requirements

#21 opened 6 months ago by