Show HN: GEITje-7B – A New Large Open Dutch Language Model https://ift.tt/W1HsYdy

Show HN: GEITje-7B – A New Large Open Dutch Language Model Tried my hand at training a large open Dutch language model, for which the pickings are slim right now. GEITje is a large open Dutch language model with 7 billion parameters, based on Mistral 7B. I've continued pretraining it on 10 billion tokens of Dutch text. This has improved its Dutch language skills and increased its knowledge of Dutch topics. There's an experimental chat-finetuned model too, called GEITje-chat. I have to say the experience of gathering a dataset and training a model has been very educational for me. Being forced to deal with every detail yourself really deepens your understanding of a subject. It's all in the hands of the community now. Can't wait to see what they do with it! Want to try it out? There's a demo live at Hugging Face Spaces right now: https://ift.tt/QTVf1LY https://ift.tt/rXDldwB December 15, 2023 at 11:55PM

Komentar

Postingan populer dari blog ini

Youth Voices for Vision Zero SF

Tracks in the Sky: Overhead Lines Then and Now

Show HN: Thread – AI-powered Jupyter Notebook built using React https://ift.tt/vy2PWqS