Collaboration?

#10

by dnhkng - opened 7 days ago

Discussion

dnhkng

7 days ago

https://x.com/zhouwenmeng/status/1834899729165304198

Qwen2.5 + RYS + Calme?

MaziyarPanahi

Owner 6 days ago

We've been waiting for that since we've heard the news! I'll make some RLHF and do some merging.

These models change sometimes, old pipelines may not work as best, so let's evaluate them first then we do the rest by upscaling it

dnhkng

6 days ago

Yeah, it will take about ~4 days to run RYS on Qwen2.5. Did you also try to fine tune RYS-Large-Base (the model based on Qwen2)?

That might be a better process than the previous Qwen2 -> Calme -> Calme-RYS -> Calme-RYS-2.4.

prudant

about 7 hours ago

any chance to get a qwen 2.5 32b and 14b versions ? it would be amazing! how much gpu power need to tune those models? I have 4x3090= 96 gb vram, but I think its too litle for that tuning task.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment