Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
glm-4-9b-chat
like
479
Transformers
Safetensors
Chinese
English
chatglm
glm
thudm
custom_code
arxiv:
2406.12793
License:
glm-4 (other)
Model card
Files
Files and versions
Community
68
Train
Use this model
flash_attention_2
#51
by
zxdu20
- opened
28 days ago
base:
refs/heads/main
←
from:
refs/pr/51
Discussion
Files changed
+345
-232
Add eager and sdpa attention implementations
835c7179
Add support for flash attention 2
a7eaddd0
Merge branch 'main' into attention
29038ea1
zxdu20
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University org
28 days ago
No description provided.
zxdu20
changed pull request status to
open
28 days ago
zxdu20
changed pull request status to
merged
28 days ago
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment