Postingan

Menampilkan postingan dari April, 2026

Show HN: AgentSearch – Self-hosted search and MCP for AI agents, no API keys https://ift.tt/GTFCQO1

Show HN: AgentSearch – Self-hosted search and MCP for AI agents, no API keys https://ift.tt/c6fXlV7 April 24, 2026 at 01:25AM

Show HN: Turning a Gaussian Splat into a videogame https://ift.tt/KZCsNc0

Show HN: Turning a Gaussian Splat into a videogame https://ift.tt/5FyHviA April 23, 2026 at 09:18PM

Show HN: Core – open-source AI butler that clears your backlog without you https://ift.tt/uNo7JlG

Show HN: Core – open-source AI butler that clears your backlog without you Hi HN, we're Manik, Manoj and Harshith, and we're building CORE ( https://ift.tt/WAf92Ct ), an open source AI butler that acts and clears out your backlog. Write `[ ] Fix the search auth bug` in a scratchpad. Three minutes later, without you at the keyboard, CORE picks it up, pulls the relevant context from your codebase, drafts a plan in the task description, and spins up a Claude Code session in the background to do the work. You review the output in the task chat and unblock it when it gets stuck. Every AI tool today is reactive. You open a chat, brief the agent, it responds. Before anything moves, you've already done the real work: opened the Sentry error, found the commit, read the Slack thread, grabbed the Linear ticket, and stitched it all together into a prompt. The model isn't the bottleneck. You are. Demo Video: https://www.youtube.com/watch?v=PFk4RJvQg1Y CORE removes you from that loop...

Show HN: Netlify for Agents https://ift.tt/1aivtdn

Show HN: Netlify for Agents I launched Netlify with a Show HN more than 11 years today, for humans. Today we're launching our Agent first version of Netlify. Super early days for this, but I expect it to become as important as our original launch over time. It's as hard to perfect these flows as it was to perfect some of the initial human DX flows, since the agents are non-deterministic and keeps changing and evolving, and we'll have more to show soon on our eval tooling for this. Try it out with an agent, and we would love feedback on what works and what doesn't as we keep iterating on making Netlify better for our new agent friends. https://netlify.ai April 22, 2026 at 11:57PM

Show HN: A free tool for non-technical folks to easily publish a website https://ift.tt/DJvz1Qa

Show HN: A free tool for non-technical folks to easily publish a website It's easier than ever for anyone to make a website, even without paying for a drag-and-drop builder like Squarespace. But there are still too many barriers for your average non-technical person to publish a site on the web. I'd bet most people don't know there are free ways to host a website, and even if they find an explainer, technical platforms like Cloudflare and GitHub (let alone the command line) can be intimidating. So I made weejur, which is basically a super simple UI front-end for GitHub Pages. You log in with OAuth, and then you can just paste HTML or upload files to publish a website. If you don't have a GitHub account, you can sign up right in the OAuth flow. It's completely free, and you can view the source here [1]. My hope is this makes it easier for people who don't know anything about web hosting to create and share their own websites. Feel free to try it out and please sh...

Show HN: Ctx – a /resume that works across Claude Code and Codex https://ift.tt/LI2GFPh

Show HN: Ctx – a /resume that works across Claude Code and Codex ctx is a local SQLite-backed skill for Claude Code and Codex that stores context as a persistent workstream that can be continued across agent sessions. Each workstream can contain multiple sessions, notes, decisions, todos, and resume packs. It essentially functions as a /resume that can work across coding agents. Here is a video of how it works: https://ift.tt/oqwLz7j I initially built ctx because I wanted to try a workstream that I started on Claude and continue it from Codex. Since then, I’ve added a few quality of life improvements, including the ability to search across previous workstreams, manually delete parts of the context with, and branch off existing workstreams.. I’ve started using ctx instead of the native ‘/resume’ in Claude/Codex because I often have a lot of sessions going at once, and with the lists that these apps currently give, it’s not always obvious which one is the right one to pick back up. ctx g...

Show HN: Einlang, a math-intuitive language with lots of good stuff https://ift.tt/vSb2Z7B

Show HN: Einlang, a math-intuitive language with lots of good stuff With Einlang, you can write codes as let x = [ [[1.0, 2.0], [3.0, 4.0]], [[5.0, 6.0], [7.0, 8.0]] ]; let row_sums[..batch, i] = sum[j](x[..batch, i, j]); let loss = sum[..batch, i](row_sums[..batch, i] * row_sums[..batch, i]); let dloss_dx = @loss / @x; Einlang also supports recurrence. You can write codes like let alpha = 0.25; let x[0] = 8.0; let x[k in 1..6] = { let prev = x[k - 1]; let loss = prev * prev; let g = @loss / @prev; prev - alpha * g }; https://ift.tt/I1J9Ph7 April 20, 2026 at 11:03PM

Show HN: Tmux-bar – One-tap switching between windows in current tmux session https://ift.tt/ZY9RimH

Show HN: Tmux-bar – One-tap switching between windows in current tmux session I revived an old idea I had, a small native macOS menu bar app that shows your tmux windows as Touch Bar buttons, so switching windows is one tap away. It runs quietly in the menu bar, watches which terminal is focused (Terminal, iTerm2, Ghostty), and refreshes the Touch Bar with your current tmux windows. This was a fun “vibe coding” side project, but also a practical tool I wanted for my own workflow. Hope it can be useful to someone else. https://ift.tt/S6ZXTbe April 20, 2026 at 10:54PM

Show HN: Themeable HN https://ift.tt/ouBrOyl

Gambar
Show HN: Themeable HN Hi HN, I needed a distraction from the new and scary thing I was working on this weekend, so I grabbed HN's stylesheet, extracted CSS variables for every colour it defined, then re-applied them back onto HN using its own selectors (plus some extras for bits of HN which are styled inline, and some separation between header and content styling), allowing it to be themed with variables. After using them to implement a dark mode (and a pure black variant for OLED), plugging everything into my existing HN browser extension which already lets you apply custom CSS, and making it handle theme switching via attributes on , custom styles and theming are now manageable using it. There are a few examples in the friendly release notes above, with screenshots and copy-pasteable CSS. If you just want to grab the stylesheet which sets up the CSS variables and application rules to use in your own thing, it's here [1], but not _everything_ is themeable without a bit more wo...

Show HN: Agentkit-CLI, one canonical context file for AI coding agents https://ift.tt/IUGHr3P

Show HN: Agentkit-CLI, one canonical context file for AI coding agents https://mikiships.github.io/agentkit-cli/ April 20, 2026 at 10:04PM

Show HN: A privacy-first, local-LLM note app for iOS (Google Keep alternative) https://ift.tt/bpu4PAO

Show HN: A privacy-first, local-LLM note app for iOS (Google Keep alternative) https://ift.tt/yKE4pRm April 19, 2026 at 11:59PM

Show HN: Free PDF redactor that runs client-side https://ift.tt/yC5sAqP

Show HN: Free PDF redactor that runs client-side I recently needed to verify past employment and to do so I was going to upload paystubs from a previous employer, however I didn't want to share my salary in that role. I did a quick search online and most sites required sign-up or weren't clear about document privacy. I conceded and signed up for a free trial of Adobe Acrobat so I could use their PDF redaction feature. I figured there should be a dead simple way of doing this that's private, so I decided to create it myself. What this does is rasterize each page to an image with your redactions burned in, then it rebuilds the PDF so the text layer is permanently destroyed and not just covered up and easily retrievable. I welcome any and all feedback as this is my first live tool, thanks! https://redactpdf.net April 20, 2026 at 01:39AM

Show HN: Faceoff – A terminal UI for following NHL games https://ift.tt/zDX9bo5

Show HN: Faceoff – A terminal UI for following NHL games Faceoff is a TUI app written in Python to follow live NHL games and browse standings and stats. I got the inspiration from Playball, a similar TUI app for MLB games that was featured on HN. The app was mostly vibe-coded with Claude Code, but not one-shot. I added features and fixed bugs by using it, as I spent way too much time in the terminal over the last few months. Try it out with `uvx faceoff` (requires uv). https://ift.tt/WYfJAhT April 20, 2026 at 12:44AM

Show HN: Google Gemini Is Scanning Your Photos – and the EU Said No https://ift.tt/37vrP4X

Show HN: Google Gemini Is Scanning Your Photos – and the EU Said No Google has expanded its Personal Intelligence feature so that Gemini can now access your Google Photos face data, Gmail, YouTube history, and search activity to generate personalized AI images — live for US paid subscribers as of April 2026. https://ift.tt/U3tOxob... April 19, 2026 at 11:36PM

Show HN: AI Subroutines – Run automation scripts inside your browser tab https://ift.tt/417CnEw

Show HN: AI Subroutines – Run automation scripts inside your browser tab We built AI Subroutines in rtrvr.ai. Record a browser task once, save it as a callable tool, replay it at: zero token cost, zero LLM inference delay, and zero mistakes. The subroutine itself is a deterministic script composed of discovered network calls hitting the site's backend as well as page interactions like click/type/find. The key architectural decision: the script executes inside the webpage itself, not through a proxy, not in a headless worker, not out of process. The script dispatches requests from the tab's execution context, so auth, CSRF, TLS session, and signed headers get added to all requests and propagate for free. No certificate installation, no TLS fingerprint modification, no separate auth stack to maintain. During recording, the extension intercepts network requests (MAIN-world fetch/XHR patch + webRequest fallback). We score and trim ~300 requests down to ~5 based on method, timing re...

Show HN: Praxis – Lab data to publication-ready figures in one Python package https://ift.tt/bTpfJrM

Show HN: Praxis – Lab data to publication-ready figures in one Python package https://ift.tt/XHbxSs9 April 19, 2026 at 01:15AM

Show HN: I built Panda to get up to 99% token savings https://ift.tt/mHAaxRz

Show HN: I built Panda to get up to 99% token savings https://ift.tt/ClNSxvd April 18, 2026 at 05:00PM

Show HN: Waputer – The WebAssembly Computer https://ift.tt/6kYAnuv

Show HN: Waputer – The WebAssembly Computer Waputer is an operating system that runs entirely in the browser. When you visit the website at https://waputer.app , a kernel written in JavaScript sets up a filesystem and launches a WebAssembly program, which in turn talks to the kernel to handle the display and input. A purely terminal-based version is at https://waputer.dev . My original intention was to create programs that run in the browser that have a lot more in common with the desktop. The traditional "hello world" program is not really suited for the web. Waputer changes that. The GitHub repo at https://ift.tt/YjgZcXE gives a very brief overview of compiling a C program and running it on Waputer. There is a blog available from the main site that has a long-form explanation of Waputer and my motivations if you want some additional reading. https://waputer.app April 18, 2026 at 12:46AM

Show HN: Bird, a CLI for Tired Brains https://ift.tt/SJC0oQE

Show HN: Bird, a CLI for Tired Brains https://ift.tt/FS4HZVB April 18, 2026 at 12:13AM

Show HN: PanicLock – Close your MacBook lid disable TouchID –> password unlock https://ift.tt/KLnVqbG

Show HN: PanicLock – Close your MacBook lid disable TouchID –> password unlock https://ift.tt/LZFdG8k April 17, 2026 at 11:38PM

Show HN: Stage – Putting humans back in control of code review https://ift.tt/jYkKsr7

Show HN: Stage – Putting humans back in control of code review Hey HN! We're Charles and Dean, and we're building Stage: a code review tool that guides you through reading a PR step by step, instead of piecing together a giant diff. Here's a demo video: https://ift.tt/anDedrW . You can play around with some example PRs here: https://ift.tt/OHvftMp . Teams are moving faster than ever with AI these days, but more and more engineers are merging changes that they don't really understand. The bottleneck isn't writing code anymore, it's reviewing it. We're two engineers who got frustrated with GitHub's UI for code review. As coding agents took off, we saw our PR backlog pile up faster than we could handle. Not only that, the PRs themselves were getting larger and harder to understand, and we found ourselves spending most of our time trying to build a mental model of what a PR was actually doing. We built Stage to make reviewing a PR feel more like reading chap...

Show HN: Arrow. Point your phone, walk where it says, find out where https://ift.tt/sKAgU1X

Show HN: Arrow. Point your phone, walk where it says, find out where https://kouh.me/arrow April 16, 2026 at 11:36PM

Show HN: I built a music theory course with games and spaced repetition https://ift.tt/X6UxKS5

Show HN: I built a music theory course with games and spaced repetition I’ve spent a year building a theory learning path that starts from scratch and goes all the way up to topics like Secondary Dominants and Borrowed Chords. It uses a combination of games, interactive lessons and spaced repetition to help you understand and remember concepts. Not just learn something new and forget it in a few days. I’m trying to figure out: 1. Is the progression logical? 2. What am I missing that you’d like to see in there? 3. Where does it get confusing and could use more clarification? https://ift.tt/X0dpWjt April 16, 2026 at 10:29PM

Show HN: CodeBurn – Analyze Claude Code token usage by task https://ift.tt/e8IT0Uj

Show HN: CodeBurn – Analyze Claude Code token usage by task Built this after realizing I was spending ~$1400/week on Claude Code with almost no visibility into what was actually consuming tokens. Tools like ccusage give a cost breakdown per model and per day, but I wanted to understand usage at the task level. CodeBurn reads the JSONL session transcripts that Claude Code stores locally (~/.claude/projects/) and classifies each turn into 13 categories based on tool usage patterns (no LLM calls involved). One surprising result: about 56% of my spend was on conversation turns with no tool usage. Actual coding (edits/writes) was only ~21%. The interface is an interactive terminal UI built with Ink (React for terminals), with gradient bar charts, responsive panels, and keyboard navigation. There’s also a SwiftBar menu bar integration for macOS. Happy to hear feedback or ideas. https://ift.tt/x2bXt6I April 14, 2026 at 05:57AM

Show HN: MCP server gives your agent a budget (save tokens, get smarter results) https://ift.tt/ykMrxQ7

Show HN: MCP server gives your agent a budget (save tokens, get smarter results) As a consultant I foot my own Cursor bills, and last month was $1,263. Opus is too good not to use, but there's no way to cap spending per session. After blowing through my Ultra limit, I realized how token-hungry Cursor + Opus really is. It spins up sub-agents, balloons the context window, and suddenly, a task I expected to cost $2 comes back at $8. My bill kept going up, but was I really going to switch to a worse model? No. So I built l6e: an MCP server that gives your agent the ability to budget. It works with Cursor, Claude Code, Windsurf, Openclaw, and every MCP-compatible application. Saving money was why I built it, but what surprised me was that the process of budgeting changed the agent's behavior. An agent that understands the limitations of the resources doesn't try to speculatively increase the context window with extra files. It doesn't try to reach every possible API. The age...

Show HN: A Claude Code–driven tutor for learning algorithms in Go https://ift.tt/V5J8HSi

Show HN: A Claude Code–driven tutor for learning algorithms in Go https://ift.tt/yRlDxCJ April 15, 2026 at 12:41AM

Show HN: LangAlpha – what if Claude Code was built for Wall Street? https://ift.tt/QVt7SUP

Show HN: LangAlpha – what if Claude Code was built for Wall Street? Some technical context on what we ran into building this. MCP tools don't really work for financial data at scale. One tool call for five years of daily prices dumps tens of thousands of tokens into the context window. And data vendors pack dozens of tools into a single MCP server, schemas alone can eat 50k+ tokens before the agent does anything useful. So we auto-generate typed Python modules from the MCP schemas at workspace init and upload them into the sandbox. The agent just imports them like a normal library. Only a one-line summary per server stays in the prompt. We have around 80 tools across our servers and the prompt cost is the same whether a server has 3 tools or 30. This part isn't finance-specific, it works with any MCP server. The other big thing was making research actually persist across sessions. Most agents treat a single deliverable (a PDF, a spreadsheet) as the end goal. In investing that...

Show HN: pg_grpc – Call gRPC services directly from PostgreSQL https://ift.tt/C9uaTW7

Show HN: pg_grpc – Call gRPC services directly from PostgreSQL https://ift.tt/ifnY4rb April 14, 2026 at 12:50AM

Show HN: 15 yrs of Django in prod: patterns I keep using (agent skills) https://ift.tt/kDZNKUR

Show HN: 15 yrs of Django in prod: patterns I keep using (agent skills) https://ift.tt/9y8p3Rg April 13, 2026 at 10:16PM

Show HN: Rekal – Long-term memory for LLMs in a single SQLite file https://ift.tt/6fOuzy5

Show HN: Rekal – Long-term memory for LLMs in a single SQLite file I got tired of repeating myself to my LLM every session. rekal is an MCP server that stores memories in SQLite and retrieves them with hybrid search (BM25 + vectors + recency decay). One file, local embeddings, no API keys. https://ift.tt/BmMPwiu April 13, 2026 at 04:25AM

Show HN: Claudraband – Claude Code for the Power User https://ift.tt/iecGOW1

Show HN: boringBar – a taskbar-style dock replacement for macOS https://ift.tt/jUbixO9

Show HN: boringBar – a taskbar-style dock replacement for macOS Hi HN! I recently switched from a Fedora/GNOME laptop to a MacBook Air. My old setup served me well as a portable workstation, but I’ve started traveling more while working remotely and needed something with similar performance but better battery life. The main thing I missed was a simple taskbar that shows the windows in the current workspace instead of a Dock that mixes everything together. I built boringBar so I would not have to use the Dock. It shows only the windows in the current Space, lets you switch Spaces by scrolling on the bar, and adds a desktop switcher so you can jump directly to any Space. You can also hide the system Dock, pin apps, preview windows with thumbnails, and launch apps from a searchable menu (I keep Spotlight disabled because for some reason it uses a lot of system resources on my machine). I’ve been dogfooding it for a few months now, and it finally felt polished enough to share. It’s for peo...

Show HN: T4 – a versioned datastore with branching and time-travel (S3-backed) https://ift.tt/hZRmXQe

Show HN: T4 – a versioned datastore with branching and time-travel (S3-backed) Hi HN, I built t4, a datastore that stores its WAL and snapshots in S3. Instead of traditional storage, it writes append-only segments to object storage and reconstructs state from checkpoints + WAL. A side effect of this model is that the database becomes naturally versioned: - you can restore any past state - branch from any point (with copy-on-write) - replay history I started this as an experiment to replace etcd in Kubernetes, but it’s evolving into a general-purpose versioned state store. Curious what people think about: - using object storage as the primary persistence layer - whether branching/time-travel is actually useful in practice https://ift.tt/aSDs3VU April 13, 2026 at 12:22AM

Show HN: We scanned uscis.gov for third-party trackers. The results are jarring https://ift.tt/amDKlIv

Show HN: We scanned uscis.gov for third-party trackers. The results are jarring https://ift.tt/q5JioSc April 11, 2026 at 08:43PM

Show HN: OpenDescent, decentralised encrypted messenger, no servers, no accounts https://ift.tt/9yS0VNf

Show HN: OpenDescent, decentralised encrypted messenger, no servers, no accounts https://ift.tt/jWc7ZeL April 11, 2026 at 11:33PM

Show HN: FluidCAD – Parametric CAD with JavaScript https://ift.tt/UPboXO6

Show HN: FluidCAD – Parametric CAD with JavaScript Hello HN users, This is a CAD by code project I have been working on on my free time for more than year now. I built it with 3 goals in mind: - It should be familiar to CAD designers who have used other programs. Same workflow, same terminology. - Reduce the mental effort required to create models as much as possible. This is achieved by: - Provide live rendering and visual guidance as you type. - Allow the user to reference existing edges/faces on the scene instead of having to calculate everything. - Provide interactive mouse helpers for features that are hard to write by code: Only 3 interactive modes for now: Edge trimming, Sketch region extrude, Bezier curve drawing. - Implicit coding whenever possible: e.g: There are sensible defaults for most parameters. The program will automatically fuse intersecting objects together so you do not have to worry about what object needs to be fused with what. - It should be reasonably fast: The ...

Show HN: Figma for Coding Agents https://ift.tt/qhHKBXg

Show HN: Figma for Coding Agents Feels a bit like Figma, but for coding agents. Instead of going back and forth with prompts, you give the agent a DESIGN.md that defines the design system up front, and it generally sticks to it when generating UI. Google Stitch seems to be moving in this direction as a standard, so we put together a small collection of DESIGN.md files based on popular web sites. https://getdesign.md April 10, 2026 at 10:20PM

Show HN: I built a Cargo-like build tool for C/C++ https://ift.tt/h92cANe

Show HN: I built a Cargo-like build tool for C/C++ I love C and C++, but setting up projects can sometimes be a pain. Every time I wanted to start something new I'd spend the first hour writing CMakeLists.txt, figuring out find_package, copying boilerplate from my last project, and googling why my library isn't linking. By the time the project was actually set up I'd lost all momentum. So, I built Craft - a lightweight build and workflow tool for C and C++. Instead of writing CMake, your project configuration goes in a simple craft.toml: [project] name = "my_app" version = "0.1.0" language = "c" c_standard = 99 [build] type = "executable" Run craft build and Craft generates the CMakeLists.txt automatically and builds your project. Want to add dependencies? That's just a simple command: craft add --git https://ift.tt/TXfjvLB --links raylib craft add --path ../my_library craft add sfml Craft will clone the dependency, regenerate the...

Show HN: LLM-Wiki but for Early Founders https://ift.tt/HtwUFZE

Show HN: LLM-Wiki but for Early Founders https://ift.tt/WVO3HvL April 9, 2026 at 11:27PM

Show HN: I built Dirac, Hash Anchored AST native coding agent, costs -64.8 pct https://ift.tt/cLiEqTe

Show HN: I built Dirac, Hash Anchored AST native coding agent, costs -64.8 pct Fully open source, a hard fork of cline. Full evals on the github page that compares 7 agents (Cline, Kilo, Ohmypi, Opencode, Pimono, Roo, Dirac) on 8 medium complexity tasks. Each task, each diff and correctness + cost info on the github Dirac is 64.8% cheaper than the average of the other 6. https://ift.tt/qQZUH5v April 9, 2026 at 07:06PM

Show HN: Homebutler – I manage my homelab from chat. AI never gets raw shell https://ift.tt/sryzeOj

Show HN: Homebutler – I manage my homelab from chat. AI never gets raw shell https://homebutler.dev April 9, 2026 at 07:09PM

Show HN: CSS Studio. Design by hand, code by agent https://ift.tt/rRhzI8C

Show HN: CSS Studio. Design by hand, code by agent Hi HN! I've just released CSS Studio, a design tool that lives on your site, runs on your browser, sends updates to your existing AI agent, which edits any codebase. You can actually play around with the latest version directly on the site. Technically, the way this works is you view your site in dev mode and start editing it. In your agent, you can run /studio which then polls (or uses Claude Channels) an MCP server. Changes are streamed as JSON via the MCP, along with some viewport and URL information, and the skill has some instructions on how best to implement them. It contains a lot of the tools you'd expect from a visual editing tool, like text editing, styles and an animation timeline editor. https://cssstudio.ai April 9, 2026 at 06:23PM

Show HN: OpenMix, open-source computational framework for formulation science https://ift.tt/weV4KSG

Show HN: OpenMix, open-source computational framework for formulation science I built OpenMix because computational chemistry has great tools for individual molecules (RDKit, DeepChem) but nothing for mixtures. pip install openmix. Apache 2.0. Technical blog: https://ift.tt/RWB9Alg... https://ift.tt/RYbW5XC April 9, 2026 at 12:12AM

Show HN: I built a navigation app that displays weather along the route https://ift.tt/L1uJegK

Show HN: I built a navigation app that displays weather along the route Hello HN, I live in northern part of USA where winters are snowy. Whenever I took long trips, I always wondered what the weather along the route is going to be. For those who live in northern USA, you know weather can change frequently, so when you're traveling matters a lot, not just the route. To solve this problem for myself, I built https://navimodo.com/ . NaviModo calculates the route, and then checks weather along the route based on your start time, and displays weather along the route. Change start time, and the whole thing is recalculated. I am not expecting any commercialization for this, just wanted to scratch an itch, and just did it. I have ideas for adding additional features (suggestions for when to take breaks based on bad weather, auto-suggestion of start time to avoid bad weather, etc.,) but will add more as time goes by. Any feedback is welcome! https://navimodo.com/ April 6, 2026 at 08:58PM

Show HN: A reasoning hierarchical robotics pipeline you can run in the browser https://ift.tt/hHdC941

Show HN: A reasoning hierarchical robotics pipeline you can run in the browser This demo combines the flexible task programming and reasoning of Gemini ER (what is the scene, and what should I do?) and classical camera calibration, kinematics, motion controllers. Each layer is independently swappable, and the AI model doesn't need to know anything about the robot's embodiment. This recreates the modularity of a Sense-Plan-Act architecture while retaining the semantic reasoning of a foundation AI model. A writeup explaining the tradeoffs is linked from the page https://ift.tt/3sQPgRD . https://avikde.github.io/vla-pipeline/ April 8, 2026 at 12:35AM

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps https://ift.tt/aFmJ1jE

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps I wanted to test mobile apps in plain English instead of relying on brittle selectors like XPath or accessibility IDs. With a vision-based agent, that part actually works well. It can look at the screen, understand intent, and perform actions across Android and iOS. The bigger problem showed up around how tests are defined and maintained. When test flows are kept outside the codebase (written manually or generated from PRDs), they quickly go out of sync with the app. Keeping them updated becomes a lot of effort, and they lose reliability over time. I then tried generating tests directly from the codebase (via MCP). That improved sync, but introduced high token usage and slower generation. The shift for me was realizing test generation shouldn’t be a one-off step. Tests need to live alongside the codebase so they stay in sync and have more context. I kept the execution vision-based (no brittle selectors), b...

Show HN: MCP 2000 – Browser-based drum machine with AI-generated sounds https://ift.tt/N7DIqMo

Show HN: MCP 2000 – Browser-based drum machine with AI-generated sounds https://ift.tt/VstO0o9 April 6, 2026 at 10:35PM

Show HN: A Dad Joke Website https://ift.tt/thEW3jC

Show HN: A Dad Joke Website A dad joke website where you can rate random dad jokes, 1-5 groans. Sourced from 4 different places, all cited, all categorized, and ranked by top voted. Help me create the worlds best dadabase! https://joshkurz.net/ April 6, 2026 at 12:54AM

Show HN: Gecit – DPI bypass using eBPF sock_ops, no proxy or VPN https://ift.tt/yTwPD2A

Show HN: Gecit – DPI bypass using eBPF sock_ops, no proxy or VPN https://ift.tt/1pAi0cN April 5, 2026 at 11:15PM

Show HN: A Common Lisp implementation in development https://ift.tt/kWpfL37

Show HN: A Common Lisp implementation in development https://ift.tt/xQhKU3v April 5, 2026 at 09:56PM

Show HN: Kaoslabs – High-intensity AI video and visual experiments https://ift.tt/dX8NTFi

Show HN: Kaoslabs – High-intensity AI video and visual experiments "I've been building a sandbox on a Linux VPS to push AI video generation and visualization to the extreme. It's a mix of experimental generative art and high-intensity visuals. Built with Python, running on Debian. Check it out and let me know what you think!" https://kaoslabs.org April 4, 2026 at 11:54PM

Show HN: DocMason – Agent Knowledge Base for local complex office files https://ift.tt/VPz7GSi

Show HN: DocMason – Agent Knowledge Base for local complex office files I think everyone has already read Karpathy's Post about LLM Knowledge Bases. Actually for recent weeks I am already working on agent-native knowledge base for complex research (DocMason). And it is purely running in Codex/Claude Code. I call this paradigm is: The repo is the app. Codex is the runtime. During my daily working life, I have tons of office documents with knowledge from all teams, and as an IT Architect, I need to combine them altogether to handle complex deep research (which normal LLM definitely could not help). That is the originally reason I built DocMason, and I am using it in everyday which support me on lots of complex topics. I have already open-sourced this repo. And I think it takes Karpathy's concept a step further for real-world usage in three ways: 1. It could handle most kinds of office docs (pptx, docx, excels, even .eml). And really extract multimodal information from all IT arch...

Show HN: A game where you build a GPU https://ift.tt/0J4G6QH

Show HN: A game where you build a GPU Thought the resources for GPU arch were lacking, so here we are https://ift.tt/b8KjEaG April 4, 2026 at 11:45PM

Show HN: DocMason – AI Agent Knowledge Base for local complex office files https://ift.tt/djyHvQa

Show HN: DocMason – AI Agent Knowledge Base for local complex office files https://ift.tt/TSPHquO April 4, 2026 at 11:41PM

Show HN: Aurion OS, A 1.8MB OS with a browser, try it live (C/x86 ASM) https://ift.tt/OwPZ5xD

Show HN: Aurion OS, A 1.8MB OS with a browser, try it live (C/x86 ASM) I posted Aurion OS a few weeks ago on HN. Since then, the OS has gone from Beta to v1.0 Release with a lot of improvements: Blaze Browser: HTML/CSS/JS rendering with tabs and a developer console (local only, no full http/https support for now) Installer with user account setup and app selection Multi-resolution support (800x600 to 2560x1440, I plan to add 4096x2160 pixels in next versions) Unix-style luka@aurion prompt Serbian keyboard layout Python interpreter and Make build system 50+ terminal commands Window manager improvements and bug fixes 1.8MB ISO (entire OS including the browser and GUI) Supports QEMU, VirtualBox, VMware, and v86 You can try it live in the link above, or grab the ISO from GitHub: https://ift.tt/Y6tBwFE Built solo as a hobby/learning project. I'm 13. I'd love any feedback, suggestions! https://aurionos.vercel.app/ April 3, 2026 at 11:01PM

Show HN: Most products have no idea what their AI agents did yesterday https://ift.tt/lHQxvFo

Show HN: Most products have no idea what their AI agents did yesterday We build collaboration SDKs at Velt (YC W22). Comments, presence, real-time editing (CRDT), recording, notifications. A pattern we keep seeing: products add AI agents that write, edit, and approve things. Human actions get logged. Agent actions don't. Same workflow, different accountability. We shipped Activity Logs to fix this. Same record for humans and AI agents. Immutable by default. Auto-captures collaboration events, plus createActivity() for your own. Curious how others are handling this. https://ift.tt/nkeMbyA April 3, 2026 at 01:25AM

Show HN: I tested 15 free AI models at building real software on a $25/year VPS https://ift.tt/8Hj7gIC

Show HN: I tested 15 free AI models at building real software on a $25/year VPS https://ift.tt/wsWVH5o April 3, 2026 at 12:13AM

Show HN: Portcullis, a review gate for curl|bash https://ift.tt/G5Kl9s2

Show HN: Portcullis, a review gate for curl|bash https://ift.tt/AI2X3BZ April 2, 2026 at 11:39PM

Show HN: Zerobox – Sandbox any command with file and network restrictions https://ift.tt/h9vqwIE

Show HN: Zerobox – Sandbox any command with file and network restrictions I'm excited to introduce Zerobox, a cross-platform, single binary process sandboxing CLI written in Rust. It uses the sandboxing crates from the OpenAI Codex repo and adds additional functionalities like secret injection, SDK, etc. Watch the demo: https://www.youtube.com/watch?v=wZiPm9BOPCg Zerobox follows the same sandboxing policy as Deno which is deny by default. The only operation that the command can run is reading files, all writes and network I/O are blocked by default. No VMs, no Docker, no remote servers. Want to block reads to /etc? zerobox --deny-read=/etc -- cat /etc/passwd cat: /etc/passwd: Operation not permitted How it works: Zerobox wraps any commands/programs, runs an MITM proxy and uses the native sandboxing solutions on each operating system (e.g BubbleWrap on Linux) to run the given process in a sandbox. The MITM proxy has two jobs: blocking network calls and injecting credentials at the n...

Show HN: Max Headbox, a local agent that fits on a Raspberry Pi 5 https://ift.tt/2ERM3iD

Show HN: Max Headbox, a local agent that fits on a Raspberry Pi 5 https://ift.tt/8wuxW9M April 1, 2026 at 09:57PM