Gustavo de Rosa
gugarosa
AI & ML interests
None yet
Organizations
gugarosa's activity
Add attention_bias to make TGI work
1
#64 opened 5 months ago
by
philschmid
Add attention_bias to make TGI work
1
#68 opened 5 months ago
by
philschmid
Upload 3 files
#1 opened 5 months ago
by
bapatra
Upload 3 files
#1 opened 5 months ago
by
bapatra
error in tokenizer
1
#5 opened 5 months ago
by
Algaliarept
Upload 25 files
#59 opened 5 months ago
by
jf8441
Fine-tuning is not improving the domain knowledge? it is very complicated, could you help?
5
#50 opened 5 months ago
by
aaditya
Model can't stop from explaining itself
14
#53 opened 5 months ago
by
fedeparra
Pipeline tokenizer does not get initialized when loading from string
1
#50 opened 5 months ago
by
connorboyle
model destroyed by deleting tokens?
1
#52 opened 5 months ago
by
NickyNicky
Phi-3 instruct model continuously repeating the answer
2
#55 opened 5 months ago
by
bitmman-nch
Get target_modules
1
#56 opened 5 months ago
by
danny3
System prompts ignored in chat completions
18
#51 opened 5 months ago
by
joshuaturner
Phi-3 support for GPU training with MPS on Mac
1
#52 opened 5 months ago
by
chaishirin
Leaking Training Data
3
#40 opened 5 months ago
by
qunitindk
fix(tokenizer_config): Adjusts `rstrip` of special tokens.
#54 opened 5 months ago
by
gugarosa
fix(tokenizer_config): Adjusts `rstrip` of special tokens.
#53 opened 5 months ago
by
gugarosa
Instruct model needs work
8
#35 opened 5 months ago
by
Nafnlaus
gguf
30
#24 opened 6 months ago
by
LaferriereJC
Very good model for its size (7b and 14b are gonna be amazing)
4
#43 opened 5 months ago
by
rombodawg
Lmsys Leaderboard Update! Impressive MMLU Scores
1
#44 opened 5 months ago
by
saishf
Probably Memory Leak issue?
2
#46 opened 5 months ago
by
Tieni
clarification on the usage of `short_factor` and `long_factor`?
1
#49 opened 5 months ago
by
J22
Could you aid in getting Llama.cpp to support the 128K version?
1
#48 opened 5 months ago
by
BoscoTheDog
Tokenizer
1
#23 opened 6 months ago
by
vschindepalli
the phi3 model not learning from training data using orpo.
3
#44 opened 5 months ago
by
Imran1
is the '\n' after `'<|end|>'`?
1
#43 opened 5 months ago
by
J22
Cant download the model in databricks
1
#36 opened 5 months ago
by
Buchhibabu
ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn. Run `pip install flash_attn`
1
#48 opened 5 months ago
by
Sulilili
Loading SPM tokenizer shows 32000 vocab size instead of 32064
4
#47 opened 5 months ago
by
jrc
How to configure config.json properly to suppress noise?
2
#49 opened 5 months ago
by
don412
New gguf chat template ignores the 'system' role when preparing chat completions
3
#11 opened 5 months ago
by
joshuaturner
Model breaks if I use word "unani".
2
#9 opened 5 months ago
by
mdmujtabaraza
More quants?
4
#5 opened 6 months ago
by
MoonRide
Can't reproduce
5
#3 opened 6 months ago
by
ayyylol
Something broken on last update
7
#85 opened 6 months ago
by
Nayjest
No base model?
2
#45 opened 5 months ago
by
ucalyptus
Unable to deploy this model on databricks due to transformers version
1
#47 opened 5 months ago
by
enkefalos
What is the difference between "cpu-int4-rtn-block-32-acc-level-4" and "cpu-int4-rtn-block-32"?
1
#3 opened 5 months ago
by
Zhubarb
general.architecture = 'llama' in .gguf metadata
1
#6 opened 6 months ago
by
mattjcly
Adding Evaluation Results
#41 opened 5 months ago
by
leaderboard-pr-bot
Update sample_finetune.py
#42 opened 5 months ago
by
wwwaj
Adding Evaluation Results
#41 opened 5 months ago
by
leaderboard-pr-bot
Update sample_finetune.py
#42 opened 5 months ago
by
wwwaj
is phi-3 foundation model was released or only instruct fine tuned models relasesd
1
#37 opened 5 months ago
by
naveen237
Update tokenizer_config.json
2
#39 opened 5 months ago
by
mrm8488
Update config.json
1
#38 opened 5 months ago
by
mrm8488
[AUTOMATED] Model Memory Requirements
#33 opened 5 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#32 opened 5 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#31 opened 5 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#30 opened 5 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#29 opened 6 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#28 opened 6 months ago
by
model-sizer-bot
Why is EOS token added at the end of chat template?
2
#26 opened 6 months ago
by
mspanrin
Is sliding window used or not?
1
#25 opened 6 months ago
by
J22
[AUTOMATED] Model Memory Requirements
#23 opened 6 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#22 opened 6 months ago
by
model-sizer-bot
[AUTOMATED] Model Memory Requirements
#21 opened 6 months ago
by
model-sizer-bot