microsoft
/

Phi-3-small-128k-instruct

Text Generation

Model card Files Files and versions Community

Resources

View closed (12)

tokenizer gives unexpected results

#23 opened 1 day ago by

Does it support system?

#22 opened 2 days ago by

Upload triton_flash_blocksparse_attn.py

#21 opened 3 days ago by

"TypeError: phi3 isn't supported yet ". Could not quantize the phi3 model using the AWQ quantization method.

#20 opened 11 days ago by

rabinghimire737

Multi-GPU case device mismatch while finetuning.

#19 opened 18 days ago by

CheckpointError in `triton_flash_blocksparse_attn.py` while finetuning

#18 opened 19 days ago by

Fix for FlashAttention RuntimeError & Triton Multi GPU fix.

#17 opened 26 days ago by

Out of resource: shared memory

#16 opened 28 days ago by

RuntimeError: FlashAttention only support fp16 and bf16 data type

#15 opened 30 days ago by

Adding sample_finetune.py to small model

#9 opened about 1 month ago by

The attention mask and the pad token id were not set

#8 opened about 1 month ago by