torch gradio transformers sentence-transformers qdrant-client llama-cpp-python einops