惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

I
InfoQ
宝玉的分享
宝玉的分享
Help Net Security
Help Net Security
Last Week in AI
Last Week in AI
罗磊的独立博客
V
Vulnerabilities – Threatpost
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Security Latest
Security Latest
Scott Helme
Scott Helme
S
SegmentFault 最新的问题
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
K
Kaspersky official blog
T
Tailwind CSS Blog
S
Schneier on Security
雷峰网
雷峰网
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
NISL@THU
NISL@THU
O
OpenAI News
A
About on SuperTechFans
博客园 - 聂微东
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
有赞技术团队
有赞技术团队
W
WeLiveSecurity
G
GRAHAM CLULEY
博客园 - Franky
H
Hacker News: Front Page
Forbes - Security
Forbes - Security
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tenable Blog
人人都是产品经理
人人都是产品经理
N
Netflix TechBlog - Medium
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
T
Threat Research - Cisco Blogs
腾讯CDC
C
Cyber Attacks, Cyber Crime and Cyber Security
Engineering at Meta
Engineering at Meta
大猫的无限游戏
大猫的无限游戏
Recorded Future
Recorded Future
酷 壳 – CoolShell
酷 壳 – CoolShell
MyScale Blog
MyScale Blog
L
LINUX DO - 热门话题
P
Privacy International News Feed
P
Privacy & Cybersecurity Law Blog
Schneier on Security
Schneier on Security
阮一峰的网络日志
阮一峰的网络日志
Martin Fowler
Martin Fowler
博客园 - 司徒正美
L
LINUX DO - 最新话题
D
DataBreaches.Net
量子位

HN's home page

Rainbow Query Language | Hacker News Exec into Node via Kubectl An AI native hedge fund The Seven-Action Documentation Model | Hacker News Package Manager for Kubectl Plugins Tongan Castaways | Hacker News Tech overlords plan for conscious AI to conquer the cosmos. What could go wrong? Data Breach Disclosure Lag Is Getting Worse How LLMs Work | Hacker News I Dropped PRDs for Shape Up Go Experiments Explained | Hacker News FCA's Palantir deal could expose UK financial data to Trump's US, critics fear WebXR BCI for Neural-Adaptive Avatar Control in Mixed Reality The first murder conviction via DNA analysis Tom Interviews Theo de Raadt of the OpenBSD Project (2019) [video] Show HN: Replace shell commands with bun shell typescript scripts Quay.io Is Down | Hacker News AI driven analysis of brokerage account fees in the UK Bill Gates Spent Years Crafting His Image. Now It's Cracking Using LLMs to secure source code Wi-Fi 8 in the Lab [video] The household battery revolution that could change energy bills and the world Is Python Becoming Pinyin? | Hacker News Livia – Executive Assistant | Hacker News FindMyPipe – Query Apple Find My from Linux for AI Agents Show HN: Agent skill for creating product launch videos with Remotion RecruitMyself – AI job search copilot for resumes and applications AI coding agents and the erosion of system understanding The 'Resting' Generation and South Korea's Youth Recession AMD Computex 2026: 10 Years of AM4, AM5 Support Through 2029 Docker Networking Explained | Hacker News Textbooks in Tokenland | Hacker News Key Chemistry Question Answered, No Quantum Computer Required Gifts For Retrocomputing Fans – remix yesterday's tech with a modern spin Miscellany № 49: introducing the quasiquote – Shady Characters Amazon Thinks the Future of Data Centers Is a Technical Problem It Just Solved A brief history of the UUID (2017) Flying High Unpressurized (2016) | Hacker News Five Years of Trying to Add Recursion to Lychee How British comfort food won over the French Blorp Language | Hacker News Decache – you might have the internet's lost media in your PC's cache folders Criminal Activities and Migration | Hacker News A free, open-source library of DESIGN.md files for AI-generated UIs MiniMax M3 | Hacker News People are apparently farming citations on ResearchGate – Chuniversiteit Hacker News Basketeer – a typed TS SDK for your Tesco account, with nutrition data 'Penguin' decays from CERN's Large Hadron Collider experiment hint new physics Emergence World: A Laboratory for Evaluating Long-Horizon Agent Autonomy Homebrew lead Mike McQuaid: Sandboxes and Worktrees - My Secure Agentic AI Setup Lean, Not Backpressure | Hacker News AI Dangers Eclipse Nuclear Weapons at Singapore Defense Forum Open source analytics that answers backbase How turkey hacked the hair-transplant industry How GPT Image 2 Is Transforming Marketing Workflows in 2026 Improve Git monorepo performance with a file system monitor Strava for Claude Code MiniMax M3 on Qubrid AI There's Something Else We Should Be Worrying About Celebrity Profile of an A.I. Actress What Is Windows K2? | Hacker News AI is devoid of meaning and humanity. Its vapid voice suits the political moment Show HN: Interpreto – Live Translation for Travel Taxicab Geometry Sealed classes and interfaces in Java (2025) Show HNs | Hacker News My AI Skill Edited This Video That Explains My AI Skill – Arcturus Labs Amazon Pinpoint End of Support The Mystery of the Backward Index MP/M's Process Dispatcher SlimTide Reviews: A Modern Solution for Metabolism and Energy Learning Lustre: Type-safe front end development with gleam Thomas Mann: Goethe Heartened by Panama (As Suez for English, or Danube-Rhine) How to make Message Log of the Unreal Engine 100 times faster Sum-product, unit distances, and number fields Can Meta Buy Belief? | Hacker News Twenty Years of Bigtable | Hacker News Show HN: Combine WigglyPaint GIFs into Video Show HN: AgentThreatBench – Benchmark for AI Agent Memory Security Genius Spotted in the Wild Napkins: Where Ethernet, Compaq and Facebook’s cool data center got their starts (2011) Moderate caffein use alters sleep-related EEG Nvidia Announces RTX Spark | Hacker News Show HN: Ministry of Everything – CLI agent harness for a single operator CEOs blame AI for layoffs, MIT prof says it fits a pattern to find cover story Bugs I didn't expect while building a zsh cleanup script for macOS dev machines Nvidia jumps into PCs with new chip debuting in laptops from Microsoft, Dell, HP Nvidia unveils PC 'superchip' in challenge to Apple and Intel Show HN: Having fun making mini static site apps Synthea API: Create Synthetic Medical Records as a Service Berkshire Hathaway to buy Taylor Morrison for $6.8B in cash The most complex model we understand [video] SanDisk stock is +4,440.53% in the past year Driftwm: What if your window manager worked like a whiteboard? US Immigration enforcement looks into buying ad data AI Is Creating More Work for Australia's Workplace Tribunal Finding New Biblical Cross-References with Codex Glide: A tiling window manager for macOS Ultra-highly efficient enrichment of uranium from seawater via studtite nanodots (2024)
Miuse: Agents for Guidance with Physical Tasks
pratt3000 · 2026-06-18 · via HN's home page

AI still can’t act in the physical world — but it can see, reason, and guide the human who does. Imagine wearing a small camera while an AI sees what you see and talks you through the job in real time: “turn off that valve,” “use the ⅜-inch wrench,” “that part looks worn — replace it.” Instead of months or years of training, workers can become effective immediately with AI coaching and on-demand access to new skills.

Think of coding agents, but for physical work guidance. You have experts write skill folders (for something like HVAC repair) and allow users to use these skill for doing tasks.

I have already built this and am waiting for appstore approval right now. I have been able to do origami, cooking, mic assembly - sort of (its not smooth enough yet, and also doesnt describe things well enough. saying 3/8th wrench is easy but how do you say fold the paper along the x=y line without sounding confusing and mathematical. models just arent built to describe such things and it shows.)

I am also facing the following problems - agents are slow. The typical plan -> do -> correct -> replan cycle is bottlenecked at the 'do' stage because as opposed to the agent being able to do the task, the human has to do it. But also in general, AGENTS ARE SLOW. thinking is faster in humans, its one shot and at the same time also well thought out in parallel - when needed. I want to figure a way so both thinkings can sort of connect from time to time otherwise it just seems out of sync. + building a gate for when longer thinking should correct the answer given by shorter thinking. - verification is harder than in agents. code pretty much spits out true or false but here is more of a 0 to 1 but its a lot less deterministic in physical tasks. I have been thinking how much of a value add adding more sensors (heat, etc) would be for the product. I am guessing its also very task dependent. - static vs pov camera: Phone vs meta glasses - surprisingly phone has its own advantages (fov is the same so its easier to distinguish between frames, and you can also capture more of the workspace since its not super upclose like your eyes would be. - humans seems to have a 3d representation of the worspace in their mind even though we may not be able to see whats behind us there is something in our heads (not sure how its stored) which tell us that the hammer is behind me and there is a nail box roughly behind my foot or so. Additionally we can also feel stuff not in our view before identifiying it, this isnt being modelled into the agent. - we think while we talk, we think while the other person talks, we also think when noone talks - agents only see to do the 3rd thing. maybe I can do 2 as well, but doing 1 is tricky.

Either way... how does this idea sound? 1. Is it better than watching a youtube tutorial, and is it a significant value add over sending a pic to chatgpt and asking what to do? For coding it seems obvious because there used to be so much back and forth when building something, but to all my hardware folks, how often do you check your version of stack overflow when building something? 2. Probably a better question would be, what is the bottleneck when you are building something. (say a robot, car, etc?). I plan on participating in more robotics hackathons just to get a better idea of this. In coding the bottleneck has been sort of getting x lines of code right so that the thing works. claude code basically does that for you, but whats the equivalent in physical tasks.

I have also been thinking that if robotics works out this becomes moot but 1. Its going to take 15+ years for robotics to get good enough imo - although I do doubt this a lot sometimes. 2. I think people still will do some physical tasks themselves regardless of robotics working.