DeepSeek-R1-Distill-Qwen-14B
A model based on Qwen 14B and additionally trained on outputs from DeepSeek’s R1 model, published as part of DeepSeek R1’s release.
Have a question? Noticed something wrong? Let us know.
If you would like a reply, please include your name and email address.
Your comment will be reviewed. We may not be able to respond to every submission.
There’s been an error in submitting your feedback. Please try again later.