Sample code using HF

#29
by vanshils - opened

Would it be possible to provide a sample code for inference of chat completion request using AutoModelForCausalLM and AutoTokenizer which gives same behaviour as mistral-chat?

Thanks for updating the model card @ybelkada .
If possible could you please update the example/or provide a new one with usage of chat template. Currently the template is a little bit hard to find as we have to dive in mistral-common codebase to see how they perform encode_chat_completion_request.
Once again thanks for the example.

Sign up or log in to comment