Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Cool idea! You mentioned the model struggling with Chinese a bit. Have you tried any Chinese models, e.g. DeepSeek or GLM? I imagine they probably have a lot more Chinese in the pretraining. (And their English is certainly fine too!)




I have personally had success with using Kimi for Chinese creative writing making the same assumption that Moonshot, as a Chinese company, has more/better Mandarin language pretraining data



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: