shareAI
/

llama3.1-8b-instruct-dpo-zh

Question Answering

Model card Files Files and versions Community

llama3.1-8b-instruct-dpo-zh / README.md

Baicai003's picture

Update README.md

b730de1 verified about 2 months ago

|

No virus

1.53 kB

	---
	license: apache-2.0
	datasets:
	- shareAI/DPO-zh-en-emoji
	language:
	- zh
	- en
	pipeline_tag: question-answering
	tags:
	- dpo
	- llama3.1
	- llama3
	- chat
	---
	# llama3.1-8b-instruct 中文DPO版
	### 模型介绍
	像原版instruct一样，喜欢用有趣中文和表情符号回答问题。
	Github：https://github.com/CrazyBoyM/llama3-Chinese-chat

	特点：偏好中文和emoji表情，且不损伤原instruct版模型能力。实测中文DPO版问答性能体验超过现在市面上任何llama3.1中文微调版（微调会大面积破坏llama3.1原版能力，导致遗忘）
	![Alt text](https://modelscope.cn/api/v1/models/baicai003/Llama3-Chinese-instruct-DPO-beta0.5/repo?Revision=master&FilePath=image.png&View=true)

	### 训练细节
	DPO(beta 0.5) + lora rank128, alpha256 + 打开"lm_head", "input_layernorm", "post_attention_layernorm", "norm"层训练.

	### 模型部署
	网页脚本文件：https://github.com/CrazyBoyM/llama3-Chinese-chat/blob/main/deploy/web_streamlit_for_instruct_v2.py
	已经在模型文件夹里内置了一份web.py，可以直接使用。
	```
	pip install streamlit
	pip install transformers==4.40.1
	streamlit run web.py ./llama3.1-8b-instruct-dpo-zh
	```

	### 模型下载
	SDK下载
	```bash
	#安装ModelScope
	pip install modelscope
	```
	```python
	#SDK模型下载
	from modelscope import snapshot_download
	model_dir = snapshot_download('shareAI/llama3.1-8b-instruct-dpo-zh')
	```
	Git下载
	```
	#Git模型下载
	git clone https://www.modelscope.cn/shareAI/llama3.1-8b-instruct-dpo-zh.git
	```