Collaboration?

#10
by dnhkng - opened

We've been waiting for that since we've heard the news! I'll make some RLHF and do some merging.

These models change sometimes, old pipelines may not work as best, so let's evaluate them first then we do the rest by upscaling it

Yeah, it will take about ~4 days to run RYS on Qwen2.5. Did you also try to fine tune RYS-Large-Base (the model based on Qwen2)?

That might be a better process than the previous Qwen2 -> Calme -> Calme-RYS -> Calme-RYS-2.4.

any chance to get a qwen 2.5 32b and 14b versions ? it would be amazing! how much gpu power need to tune those models? I have 4x3090= 96 gb vram, but I think its too litle for that tuning task.

Sign up or log in to comment