File size: 2,022 Bytes
13e5966
baffbec
 
 
 
 
 
 
 
13e5966
baffbec
13e5966
baffbec
13e5966
055922a
 
13e5966
baffbec
13e5966
baffbec
 
13e5966
baffbec
 
 
13e5966
baffbec
 
 
 
13e5966
baffbec
 
 
 
 
 
13e5966
baffbec
055922a
baffbec
 
 
 
 
 
 
13e5966
baffbec
13e5966
baffbec
 
 
 
 
 
13e5966
baffbec
13e5966
baffbec
13e5966
baffbec
13e5966
baffbec
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
---
license: apache-2.0
language:
- ja
- en
tags:
- japanese
- causal-lm
inference: false
---
# CyberAgentLM3-22B-Chat (CALM3-22B-Chat)

## Model Description

CyberAgentLM3 is a decoder-only language model pre-trained on 2.0 trillion tokens from scratch.  
CyberAgentLM3-Chat is a fine-tuned model specialized for dialogue use cases.

## Usage

```python
from transformers import AutoModelForCausalLM, AutoTokenizer, TextStreamer

model = AutoModelForCausalLM.from_pretrained("cyberagent/calm3-22b-chat", device_map="auto", torch_dtype="auto")
tokenizer = AutoTokenizer.from_pretrained("cyberagent/calm3-22b-chat")
streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)

messages = [
    {"role": "system", "content": "あなたは親切なAIアシスタントです。"},
    {"role": "user", "content": "AIによって私たちの暮らしはどのように変わりますか?"}
]

input_ids = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to(model.device)
output_ids = model.generate(input_ids,
                            max_new_tokens=1024,
                            temperature=0.5,
                            streamer=streamer)
```

## Prompt Format
CALM3-Chat uses ChatML as the prompt format.
```
<|im_start|>system
あなたは親切なAIアシスタントです。<|im_end|>
<|im_start|>user
AIによって私たちの暮らしはどのように変わりますか?<|im_end|>
<|im_start|>assistant
```

## Model Details

* **Model size**: 22B
* **Context length**: 16384
* **Model type**: Transformer-based Language Model
* **Language(s)**: Japanese, English
* **Developed by**: [CyberAgent, Inc.](https://www.cyberagent.co.jp/)
* **License**: Apache-2.0

## Author

[Ryosuke Ishigami](https://huggingface.co/rishigami)

## How to cite

```tex
@misc{cyberagent-calm3-22b-chat,
      title={cyberagent/calm3-22b-chat},
      url={https://huggingface.co/cyberagent/calm3-22b-chat},
      author={Ryosuke Ishigami},
      year={2024},
}
```