Show HN: Papermusic (draw an instrument, then play it) https://ift.tt/jcF6EBR

Show HN: Papermusic (draw an instrument, then play it) This was a fun experiment to try PaliGemma (open vision-language model). I found that PaliGemma performed better than Gemini Flash for this type of specific image task, especially around latency. (~0.9 seconds for PaliGemma inference on a VM, vs. 3-4 seconds for Gemini Flash.) Would love feedback on ways to potentially improve this setup. https://ift.tt/TpRLEd8 June 17, 2024 at 11:26PM

Komentar

Postingan populer dari blog ini

Twin Peaks for All: Survey Results

Show HN: Guish – A GUI for constructing and executing Unix pipelines https://ift.tt/HrXz5ub

Launch HN: Riot (YC W20) – Phishing training for your team https://ift.tt/2QIueZL