Instrct version models keep missing the first few letters of the answer

#43
by cooldog - opened

I just tried the Inference API on the model page, and I noticed that both 2b/7b-it appear to miss the first few letters in their answers, why is that?

example:

<start_of_turn>user
Who is Steve Jobs?<end_of_turn>
<start_of_turn>model
ure, Steve Jobs was an American entrepreneur and businessman who co-founded Apple Inc. in 1976 and revolutionized the personal computer industry with the Apple Mac and the iPhone. He was born in 1955 and passed away in 2011.
<start_of_turn>user
Then who are the other co-founders of Apple?<end_of_turn>
<start_of_turn>model
ure, Steve Wozniak and John Sculley were the other co-founders of Apple Inc.
Google org

hmm, I have not seen this before, or in others' issues, are you doing any extra postprocessing by chance?

Google org

This might be a bug on Hugging Face with the widget with the conversational models. We're releasing a new conversational widget soon!

cc @Wauplin @mishig @Xenova for visibility

cc @sbrandeis as well, who is working on it :)

hmm, I have not seen this before, or in others' issues, are you doing any extra postprocessing by chance?

No. Actually it is reproducible I assume. I tried again it still skip the "S" in "Sure".

Acutally with the default example "how does the brain work" it also happened. The first "*" was missing.

<start_of_turn>user
How does the brain work?<end_of_turn>
<start_of_turn>model
*The brain is a complex organ that plays a vital role in many aspects of human cognition and behavior.** It is made up of billions of neurons, which are specialized cells that communicate with each other through electrical and chemical signals.
Google org
edited Feb 26

This is most likely due to a prefix space NOT being added/generated, but the decoding still removing the (non-existent) prefix space. Either way, this should be fixed with the updated version of the conversational widget. Thanks for reporting!

Google org

Can you confirm the problem is fixed @cooldog ?

Can you confirm the problem is fixed @cooldog ?

Hi, was any change made to the widget? I just tried it, and it seems a bit weird.

  1. if I dont add the line breaker at the end of model, it returned as below: no line breaker added automatically, but the word "Sure" was complete.
    image.png

  2. If I add a line breaker, it returned as below: extra breaker added, and "Sure" missing the first two letters.
    image.png

I assume this can be reproduced on your browser too?

Google org

I don't see this issue locally, but I have a difficult time clearing the example box, @osanseviero is there a fix here incoming?

Google org
This comment has been hidden

Sign up or log in to comment