Postingan

Show HN: Playwright Best Practices AI SKill https://ift.tt/5ndGKSx

Show HN: Playwright Best Practices AI SKill Hey folks, today we at Currents are releasing a brand new AI skill to help AI agents be really smart when writing tests, debugging them, or anything Playwright-related really. This is a very comprehensive skill, covering everyday topics like fixing flakiness, authentication, or writing fixtures... to more niche topics like testing Electron apps, PWAs, iFrames and so forth. It should make your agent much better at writing, debugging and maintaining Playwright code. for whoever didn't learn about skills yet, it's a new powerful feature that allows you to make the AI agents in your editor/cli (Cursor, Claude, Antigravity, etc) experts in some domain and better at performing specific tasks. (See https://ift.tt/8M6kKQb ) You can install it by running: npx skills add https://ift.tt/TVPtGyS... The skill is open-source and available under MIT license at https://ift.tt/TVPtGyS... -> check out the repo for full documentation and understandin...

Show HN: Viberails – Easy AI Audit and Control https://ift.tt/i0zwJdE

Show HN: Viberails – Easy AI Audit and Control Hello HN. I'm Maxime, founder at LimaCharlie ( https://limacharlie.io ), a Hyperscaler for SecOps (access building blocks you need to build security operations, like AWS does for IT). We’ve engineered a new product on our platform that solves a timely issue acting as a guardrail between your AI and the world: Viberails ( https://ift.tt/j4VSdJg ) This won't be new to folks here, but we identified 4 challenges teams face right now with AI tools: 1. Auditing what the tools are doing. 2. Controlling toolcalls (and their impact on the world). 3. Centralized management. 4. Easy access to the above. To expand: Audit logs are the bread and butter for security, but this hasn't really caught up in AI tooling yet. Being able to look back and say "what actually happened" after the fact is extremely valuable during an incident and for compliance purposes. Tool calls are how LLMs interact with the world, we should be able to exerci...

Show HN: Tabstack Research – An API for verified web research (by Mozilla) https://ift.tt/da1QLrX

Show HN: Tabstack Research – An API for verified web research (by Mozilla) Hi HN, My team and I are building Tabstack to handle the web layer for AI agents. Today we are sharing Tabstack Research, an API for multi-step web discovery and synthesis. https://ift.tt/zZLF8fl In many agent systems, there is a clear distinction between extracting structured data from a single page and answering a question that requires reading across many sources. The first case is fairly well served today. The second usually is not. Most teams handle research by combining search, scraping, and summarization. This becomes brittle and expensive at scale. You end up managing browser orchestration, moving large amounts of raw text just to extract a few claims, and writing custom logic to check if a question was actually answered. We built Tabstack Research to move this reasoning loop into the infrastructure layer. You send a goal, and the system: - Decomposes it into targeted sub-questions to hit different data ...

Show HN: GitHub Browser Plugin for AI Contribution Blame in Pull Requests https://ift.tt/wtFnRYM

Show HN: GitHub Browser Plugin for AI Contribution Blame in Pull Requests https://ift.tt/USt0Tzd February 3, 2026 at 09:35PM

Show HN: TrueLedger – a local-first personal finance app with no cloud back end https://ift.tt/wdicHag

Show HN: TrueLedger – a local-first personal finance app with no cloud back end Hi HN, I built TrueLedger because I didn’t want a personal finance app that requires a cloud account or bank credential access just to work. TrueLedger is a local-first personal finance app. All data stays on the user’s device and works fully offline. Technical choices: - SQLite for local storage across platforms - SQLCipher (AES-256) for encrypted databases - Web version runs entirely client-side using SQLite WASM - Encrypted, deterministic JSON backups for portability without a server Demo (runs fully client-side): https://ift.tt/aMKzhLl Source: https://ift.tt/Za0AYQn Happy to answer questions about local-first design or encryption tradeoffs. https://ift.tt/aMKzhLl February 3, 2026 at 11:36PM

Show HN: C discrete event SIM w stackful coroutines runs 45x faster than SimPy https://ift.tt/Gbxohlt

Show HN: C discrete event SIM w stackful coroutines runs 45x faster than SimPy Hi all, I have built Cimba , a multithreaded discrete event simulation library in C. Cimba uses POSIX pthread multithreading for parallel execution of multiple simulation trials, while the coroutines provide concurrency inside each simulated trial universe. The simulated processes are based on asymmetric stackful coroutines with the context switching hand-coded in assembly. The stackful coroutines make it natural to express agentic behavior by conceptually placing oneself "inside" that process and describing what it does. A process can run in an infinite loop or just as a one-shot customer passing through the system, yielding and resuming execution from any level of its call stack, acting both as an active agent and a passive object as needed. This is inspired by my own experience programming in Simula67, many moons ago, where I found the coroutines more important than the deservedly famous object-...

Show HN: ItemGrid – Free inventory management for single-location businesses https://ift.tt/IPZrMW5

Show HN: ItemGrid – Free inventory management for single-location businesses Hey HN, After building Box QR (personal inventory tracker), I kept hearing "I need this for my business." So I'm exploring ItemGrid - lightweight inventory management that doesn't suck. The problem: Small businesses are stuck between Google Sheets (messy, no mobile scanning) and enterprise software (expensive, overcomplicated). What ItemGrid does: Visual grid interface QR/barcode scanning Multi-location support Free for 1 location forever $8/user when you grow Right now it's just a landing page collecting validation signups. Not building the full product until I hit 50-100 signups to confirm real demand. Would love feedback, especially if you've dealt with inventory headaches. https://itemgrid.io https://itemgrid.io February 3, 2026 at 10:49PM