Loading the model without any webUI

by MrGobbs - opened May 19, 2023

May 19, 2023

I wanted to use the model in a python code with pytorch. I did not want to use a web-ui just plain old command terminal. I wanted to know how I can do it.

ehartford

Cognitive Computations org May 19, 2023

Llama.cpp or python

ehartford

Cognitive Computations org May 19, 2023

I have a little guide for vicuna here you can do it with my models too, the ggml that TheBloke published.

https://erichartford.com/vicuna

Delcos

May 22, 2023

Just import transformers and then get the model the settings you want, or just don't do sampling and then you're good to go.

ljhwild

Aug 27, 2023

•

edited Aug 27, 2023

Can I use this with python using llama ccp?
So would that be correct:
LLM = Llama(MODEL, verbose=False, n_ctx=2048)
and have MODEL replaced with the quantized bin file,
and n_ctx=161984 ? Or is there anything else that needs to be done?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment