Mxode commited on
Commit
2ee8f54
1 Parent(s): cce4818

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -21,6 +21,8 @@ This is NanoLM-70M-Instruct-v1. The model currently supports **English only**.
21
 
22
  The tokenizer and model architecture of NanoLM-70M-Instruct-v1 are the same as [SmolLM-135M](https://huggingface.co/HuggingFaceTB/SmolLM-135M), but the number of layers has been reduced from 30 to 12.
23
 
 
 
24
  As a result, NanoLM-70M-Instruct-v1 has only 70 million parameters.
25
 
26
  Despite this, NanoLM-70M-Instruct-v1 still demonstrates instruction-following capabilities.
 
21
 
22
  The tokenizer and model architecture of NanoLM-70M-Instruct-v1 are the same as [SmolLM-135M](https://huggingface.co/HuggingFaceTB/SmolLM-135M), but the number of layers has been reduced from 30 to 12.
23
 
24
+ Essentially, it is a pure LLaMA architecture, specifically LlamaForCausalLM.
25
+
26
  As a result, NanoLM-70M-Instruct-v1 has only 70 million parameters.
27
 
28
  Despite this, NanoLM-70M-Instruct-v1 still demonstrates instruction-following capabilities.