Postingan

Show HN: Understudy – Teach a desktop agent by demonstrating a task once https://ift.tt/tVbvoLO

Show HN: Understudy – Teach a desktop agent by demonstrating a task once I built Understudy because a lot of real work still spans native desktop apps, browser tabs, terminals, and chat tools. Most current agents live in only one of those surfaces. Understudy is a local-first desktop agent runtime that can operate GUI apps, browsers, shell tools, files, and messaging in one session. The part I'm most interested in feedback on is teach-by-demonstration: you do a task once, the agent records screen video + semantic events, extracts the intent rather than coordinates, and turns it into a reusable skill. Demo video: https://www.youtube.com/watch?v=3d5cRGnlb_0 In the demo I teach it: Google Image search -> download a photo -> remove background in Pixelmator Pro -> export -> send via Telegram. Then I ask it to do the same for Elon Musk. The replay isn't a brittle macro: the published skill stores intent steps, route options, and GUI hints only as a fallback. In this examp...

Show HN: Verge Browser a self-hosted isolated browser sandbox for AI agents https://ift.tt/TnZ49Vv

Show HN: Verge Browser a self-hosted isolated browser sandbox for AI agents Built this because I wanted a better browser runtime for Openclaw, which can run on any server no only on Mac mini, emm. When it needs me to login or perform some operations, I can simply use noVNC to operate, and then leave everything else to it. https://ift.tt/03DbzAy March 12, 2026 at 11:57PM

Show HN: OneCLI – Vault for AI Agents in Rust https://ift.tt/rNmyOsS

Show HN: OneCLI – Vault for AI Agents in Rust We built OneCLI because AI agents are being given raw API keys. And it's going about as well as you'd expect. We figured the answer isn't "don't give agents access," it's "give them access without giving them secrets." OneCLI is an open-source gateway that sits between your AI agents and the services they call. You store your real credentials once in OneCLI's encrypted vault, and give your agents placeholder keys. When an agent makes an HTTP call through the proxy, OneCLI matches the request by host/path, verifies the agent should have access, swaps the placeholder for the real credential, and forwards the request. The agent never touches the actual secret. It just uses CLI or MCP tools as normal. Try it in one line: docker run --pull always -p 10254:10254 -p 10255:10255 -v onecli-data:/app/data ghcr.io/onecli/onecli The proxy is written in Rust, the dashboard is Next.js, and secrets are AES-256-G...

Show HN: A2Apex – Test, certify, and discover trusted A2A agents https://ift.tt/FnEfHYN

Show HN: A2Apex – Test, certify, and discover trusted A2A agents Hey HN, I built A2Apex ( https://a2apex.io ) — a testing and reputation platform for AI agents built on Google's A2A protocol. The problem: AI agents are everywhere, but there's no way to verify they actually work. No standard testing. No directory of trusted agents. No reputation system. What A2Apex does: - Test — Point it at any A2A agent URL. We run 50+ automated compliance checks: agent card validation, live endpoint testing, state machine verification, streaming, auth, error handling. - Certify — Get a 0-100 trust score with Gold/Silver/Bronze badges you can embed in your README or docs. - Get Listed — Every tested agent gets a public profile page in the Agent Directory with trust scores, skills, test history, and embeddable badges. Think of it as SSL Labs (testing) + npm (directory) + LinkedIn (profiles) — for AI agents. Stack: Python/FastAPI, vanilla JS, SQLite. No frameworks, no build tools. Runs on a Mac ...

Show HN: I built an ISP infrastructure emulator from scratch with a custom vBNG https://ift.tt/lqLpTbn

Show HN: I built an ISP infrastructure emulator from scratch with a custom vBNG Demo: https://ift.tt/7jF62IX GitHub: https://ift.tt/m17yiSv Aether is a multi-BNG (Broadband Network Gateway) ISP infrastructure lab built almost from scratch that emulates IPoE IPv4 subscriber management end-to-end. It supports IPoE/Ipv4 networks and runs a python-based vBNG with RADIUS AAA, per-subscriber traffic shaping, and traffic simulation emulated on Containerlab. It is also my first personal networking project, built roughly over a month. Motivations behind the project I'm a CS sophomore. About three years ago, I was assigned, as an intern, to build a OSS/BSS platform for a regional ISP by myself without mentoring. Referencing demo.splynx.com , I developed most of the BSS side ( bookkeeping, accounting, inventory management ), but, in terms of networking, I managed to install and setup RADIUS and that was about it. I didn't have anyone to mentor me or ask questions to, so I had given up the...

Show HN: Satellite imagery object detection using text prompts https://ift.tt/DFYdS5K

Show HN: Satellite imagery object detection using text prompts I built a browser-based tool for detecting objects in satellite imagery using vision-language models (VLMs). You draw a polygon on the map and enter a text prompt such as "swimming pools", "oil tanks", or "buses". The system scans the selected area tile-by-tile and returns detections projected back onto the map as GeoJSON. Pipeline: select area and zoom level, split the region into mercantile tiles, run each tile with the prompt through a VLM, convert predicted bounding boxes to geographic coordinates (WGS84), and render the results back on the map. It works reasonably well for distinct structures in a zero-shot setting. occluded objects are still better handled by specialized detectors like YOLO models. There is a public demo and no login required. I am mainly interested in feedback on detection quality, performance tradeoffs between VLMs and specialized detectors, and potential real-world use...

Show HN: Agentic Data Analysis with Claude Code https://ift.tt/bXINaLg

Show HN: Agentic Data Analysis with Claude Code Hey HN, as a former data analyst, I’ve been tooling around trying to get agents to do my old job. The result is this system that gets you maybe 80% of the way there. I think this is a good data point for what the current frontier models are capable of and where they are still lacking (in this case — hypothesis generation and general data intuition). Some initial learnings: - Generating web app-based reports goes much better if there are explicit templates/pre-defined components for the model to use. - Claude can “heal” broken charts if you give it access to chart images and run a separate QA loop. Would either feedback from the community or to hear from others that have tried similar things! https://ift.tt/CKlfiA7 March 10, 2026 at 11:44PM