Show HN: VRE Dataset generation for MultiTask vision models training from videos https://ift.tt/KAX0gWD

Oktober 09, 2024

Show HN: VRE Dataset generation for MultiTask vision models training from videos Been working on this tool for my PhD which involves training multi task vision models using various pre-trained models as inputs or pseudolabels in order to improve generalization. I work mostly on UAV datasets, but it should work okay on indoor scenes or self driving (at least Marigold and Mask2Former). For example, this dataset was generated using this tool: https://ift.tt/jR0qwBu I'm quite aggressively trying to "just get the nn.Module" from the public repos that other researchers put up in their overly convoluted frameworks. A simple `forward(rgb_input: torch.Tensor) -> torch.Tensor` is nice, having 100 imports from a generic framework that has versions incompatibilities with everything else is not. PS: most mains are standalone runnable too, i.e. - https://ift.tt/cwXFySR... or - https://ift.tt/cwXFySR... https://ift.tt/XSyUsqA October 10, 2024 at 12:39AM

Cari Blog Ini

BlogViral

Show HN: VRE Dataset generation for MultiTask vision models training from videos https://ift.tt/KAX0gWD

Komentar

Posting Komentar

Postingan populer dari blog ini

Show HN: Guish – A GUI for constructing and executing Unix pipelines https://ift.tt/HrXz5ub

Launch HN: Wide Open School https://ift.tt/2WY1nob

Launch HN: PillarPlus (YC W20) – Automatically create construction blueprints https://ift.tt/2yet5m3