Update README.md
Browse files
README.md
CHANGED
@@ -21,6 +21,8 @@ This is NanoLM-70M-Instruct-v1. The model currently supports **English only**.
|
|
21 |
|
22 |
The tokenizer and model architecture of NanoLM-70M-Instruct-v1 are the same as [SmolLM-135M](https://huggingface.co/HuggingFaceTB/SmolLM-135M), but the number of layers has been reduced from 30 to 12.
|
23 |
|
|
|
|
|
24 |
As a result, NanoLM-70M-Instruct-v1 has only 70 million parameters.
|
25 |
|
26 |
Despite this, NanoLM-70M-Instruct-v1 still demonstrates instruction-following capabilities.
|
|
|
21 |
|
22 |
The tokenizer and model architecture of NanoLM-70M-Instruct-v1 are the same as [SmolLM-135M](https://huggingface.co/HuggingFaceTB/SmolLM-135M), but the number of layers has been reduced from 30 to 12.
|
23 |
|
24 |
+
Essentially, it is a pure LLaMA architecture, specifically LlamaForCausalLM.
|
25 |
+
|
26 |
As a result, NanoLM-70M-Instruct-v1 has only 70 million parameters.
|
27 |
|
28 |
Despite this, NanoLM-70M-Instruct-v1 still demonstrates instruction-following capabilities.
|