Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
handraise-dev
/
gguf-inference
like
1
Text Generation
multilingual
nlp
code
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Deploy
main
gguf-inference
Commit History
change handler
001e799
syberWolf
commited on
Jul 4
simplify handler
96a0103
syberWolf
commited on
Jul 4
changes for testin
6d33a5e
syberWolf
commited on
Jul 4
update endpoint
e126c73
syberWolf
commited on
Jul 4
forgot comma
52afa01
syberWolf
commited on
Jul 4
update handler
eff3ac4
syberWolf
commited on
Jul 4
fix bug
e8628b3
syberWolf
commited on
Jul 4
try to actually use the GPU
1e592e3
syberWolf
commited on
Jul 4
test qwen
b47e2d8
syberWolf
commited on
Jul 4
fix handler bug
5d540d6
syberWolf
commited on
Jul 4
updates for phi
7e91a22
syberWolf
commited on
Jul 4
update handler and add flash attention
f96aa72
syberWolf
commited on
Jul 4
update device map
c676380
syberWolf
commited on
Jul 4
push updates to handler
c1cb360
syberWolf
commited on
Jul 4
add flash-attn
53b72cd
syberWolf
commited on
Jun 19
fix reqs
1ac35d8
syberWolf
commited on
Jun 19
load new phi
1aeb34c
syberWolf
commited on
Jun 19
updates with cls
f345890
syberWolf
commited on
Jun 19
revert some sstuff
e870084
syberWolf
commited on
Jun 19
fix bug
ca5a36d
syberWolf
commited on
Jun 19
updates to handler again
a60a771
syberWolf
commited on
Jun 19
dd handler change
b3aebd1
syberWolf
commited on
Jun 19
update reqs
c145e37
syberWolf
commited on
Jun 19
fix
87f7af9
syberWolf
commited on
Jun 19
undo reqs and fix handler
ef898a1
syberWolf
commited on
Jun 19
change reqs
ef4c960
syberWolf
commited on
Jun 19
update handler
17c9dd0
syberWolf
commited on
Jun 18
add handler
fe371ad
syberWolf
commited on
Jun 18
llamacpp quants
5e521f7
syberWolf
commited on
Jun 18
initial commit
e8149d7
verified
handraise-dev
commited on
Jun 18