Show HN: GEITje-7B – A New Large Open Dutch Language Model https://ift.tt/W1HsYdy

Show HN: GEITje-7B – A New Large Open Dutch Language Model Tried my hand at training a large open Dutch language model, for which the pickings are slim right now. GEITje is a large open Dutch language model with 7 billion parameters, based on Mistral 7B. I've continued pretraining it on 10 billion tokens of Dutch text. This has improved its Dutch language skills and increased its knowledge of Dutch topics. There's an experimental chat-finetuned model too, called GEITje-chat. I have to say the experience of gathering a dataset and training a model has been very educational for me. Being forced to deal with every detail yourself really deepens your understanding of a subject. It's all in the hands of the community now. Can't wait to see what they do with it! Want to try it out? There's a demo live at Hugging Face Spaces right now: https://ift.tt/QTVf1LY https://ift.tt/rXDldwB December 15, 2023 at 11:55PM

Komentar

Postingan populer dari blog ini

Show HN: Interactive exercises for GNU grep, sed and awk https://ift.tt/OxeFwah

Show HN: My Book Bulletproof TLS and PKI (Second Edition) Is Out https://ift.tt/5PZ9mxF