Postingan

Show HN: Statewright – Visual state machines that make AI agents reliable https://ift.tt/nczyieZ

Show HN: Statewright – Visual state machines that make AI agents reliable Agentic problem solving in its current state is very brittle. I fell in love with it, but it creates as many problems as it solves. I'm Ben Cochran, I spent 20+ years in the trenches with full-stack Engineering, DevOps, high performance computing & ML with stints at NVIDIA, AMD and various other organizations most recently as a Distinguished Engineer. For agents to work reliably you either need massive parameter counts or massive context windows to keep the solution spaces workable. Most people are brute forcing reliability with bigger models and longer prompts. What if I made the problem smaller instead of making the model bigger? I took a different approach by using smaller models: models in the 13-20B parameter range and set them to task solving real SWE-bench problems. I constrained the tool and solution spaces using formal state machines. Each state in the machine defines which tools the model can a...

Fix It! Week Is Back May 18, 2026: Taking Care of Muni from the Ground Up

Fix It! Week Is Back May 18, 2026: Taking Care of Muni from the Ground Up By Sevilla Mann Fix It! Week subway closures allow us to do maintenance work that makes our system safer and more reliable. The second Fix It! Week of the year is coming up on May 18, 2026, and we’re here to help you prepare. These Fix It! Week closures ensure our crews can move safely as they carry out critical repairs and inspections. Their work helps to keep our system safe and reliable for everyone. The details: When: Market Street subway service will end early at 9:30 p.m. each night from Monday, March 18 through Thursday, May 21. Where: West Portal through Embarcadero stations Bus shuttles/service: Bus... Published 2026-05-12T00:00:00Z https://ift.tt/3d2gv81

Show HN: Gigacatalyst – Extend your SaaS with an embedded AI builder https://ift.tt/dDpwP9X

Show HN: Gigacatalyst – Extend your SaaS with an embedded AI builder Hi HN, I’m Namanyay from Gigacatalyst (link: https://ift.tt/edyx5Ho ). Gigacatalyst allows sales, CS, and users to build one-off features, so your SaaS can support long-tail customer workflows and engineers aren’t pulled away from the roadmap. When you sell software to large businesses, you realize that each customer needs their own workflow and features. Traditionally, this either means long engineering roadmaps or the customers end up using workarounds. But what if everyone could build their critical missing features just by talking to an AI? That’s what we do at Gigacatalyst. We provide an AI customization layer for your customers, CS team, and sales team to build these missing critical workflows without needing any engineers at all. Think Lovable, but built on top of YOUR platform. We connect to your product's APIs, learn your data model and design system, and let non-technical users build governed apps via...

Show HN: SyncBank – Self-hosted bank sync for EU banks https://ift.tt/HWouhbB

Show HN: SyncBank – Self-hosted bank sync for EU banks https://syncbank.app/ May 12, 2026 at 01:02AM

Show HN: TikTok but for Scientific Papers https://ift.tt/RHp7ZFJ

Show HN: TikTok but for Scientific Papers https://andreaturchet.github.io/website/index.html May 11, 2026 at 11:05PM

Show HN: Learn2Burp – Surgery-free solution for R-CPD https://ift.tt/ZNKe4bG

Show HN: Learn2Burp – Surgery-free solution for R-CPD R-CPD (Retrograde Cricopharyngeus Dysfunction) is a condition where a muscle in the throat never learned to relax properly, making it impossible to burp. It affects more people than you'd think and causes significant discomfort, extreme bloating, and social anxiety. The most common medical treatment is a botox injection, but it's expensive and not accessible to everyone. I'm a Software Engineer from Germany and suffered from R-CPD my entire life before curing myself last year. I wanted to make the self-teaching process easier for everyone who comes after me, so I built Learn2Burp. It walks you through exercises with video guidance, builds a workout plan around your specific situation, and includes a burp tracker. There's also a wiki covering the questions I wish I'd had answers to when I started. If you or someone you know has R-CPD, there's also a dedicated r/noburp community worth checking out. https://lear...

Show HN: adamsreview – better multi-agent PR reviews for Claude Code https://ift.tt/agrC8Dy

Show HN: adamsreview – better multi-agent PR reviews for Claude Code I built adamsreview, a Claude Code plugin that runs deeper, multi-stage PR reviews using parallel sub-agents, validation passes, persistent JSON state, and optional ensemble review via Codex CLI and PR bot comments. On my own PRs, it has been catching dramatically more real bugs than Claude’s built-in /review, /ultrareview, CodeRabbit, Greptile, and Codex’s built-in review, while producing fewer false positives. adamsreview is six Claude Code slash commands packaged as a plugin: review, codex-review, add, promote, walkthrough, and fix. I modeled it after the built-in /review command and extended it meaningfully. You can clear context between review stages because state is stored in JSON artifacts on disk, with built-in scripts for keeping it updated. The walkthrough command uses Claude’s AskUserQuestion feature to walk you through uncertain findings or items needing human review one by one. Then, the fix command dispa...