Show HN: DistilKitPlus, a distillation framework between any LLMs https://ift.tt/HaM2s5D

Show HN: DistilKitPlus, a distillation framework between any LLMs Over the past few months, I have built a distillation toolkit that supports cross-tokenizer distillation (e.g., distilling from LLaMA to Qwen vocab, or others). This approach has worked well on reasoning datasets like AIME, and we’ve validated on models like Phi and Qwen. We’ve also integrated Modal for quick deployment (with $30/month credits to try it out). Would love any feedback! GitHub: https://ift.tt/81wFjdG Docs: https://ift.tt/u3RQGY4 https://ift.tt/81wFjdG May 5, 2025 at 11:12PM

Komentar

Postingan populer dari blog ini

Show HN: Guish – A GUI for constructing and executing Unix pipelines https://ift.tt/HrXz5ub

Twin Peaks for All: Survey Results

Launch HN: Stacker (YC S20) – Create Apps from Airtable or Google Sheets https://ift.tt/3i3ZJso