为何不能使用inference API调用？

by tianhaotiger - opened May 26, 2023

May 26, 2023

Could not load model xyz-nlp/XuanYuan2.0 with any of the following classes: (<class 'transformers.models.bloom.modeling_bloom.BloomForCausalLM'>,).

xyz-nlp

Owner May 26, 2023

This comment has been hidden

tianhaotiger

May 26, 2023

transformers版本4.29.1

szhhh

May 26, 2023

•

edited May 26, 2023

网页右边的compute也用不了。Could not load model xyz-nlp/XuanYuan2.0 with any of the following classes: (<class 'transformers.models.bloom.modeling_bloom.BloomForCausalLM'>,).

ruiguo

May 26, 2023

每次对话max_tokens是多少？最新的数据到多会？有实时金融行情吗？

xyz-nlp

Owner May 26, 2023

This comment has been hidden

xyz-nlp

Owner May 26, 2023

Could not load model xyz-nlp/XuanYuan2.0 with any of the following classes: (<class 'transformers.models.bloom.modeling_bloom.BloomForCausalLM'>,).

已支持，但是线上CPU需要运行很长时间，且可能会有以下提示：The model xyz-nlp/XuanYuan2.0 is too large to be loaded automatically (352GB > 10GB). For commercial use please use PRO spaces (https://huggingface.co/spaces) or Inference Endpoints (https://huggingface.co/inference-endpoints).