streamlit pandas torch transformers openpyxl datasets rapidfuzz scikit-learn