惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

L
LangChain Blog
宝玉的分享
宝玉的分享
酷 壳 – CoolShell
酷 壳 – CoolShell
N
Netflix TechBlog - Medium
F
Fortinet All Blogs
T
Tailwind CSS Blog
Google DeepMind News
Google DeepMind News
Jina AI
Jina AI
J
Java Code Geeks
Recent Announcements
Recent Announcements
The Cloudflare Blog
D
DataBreaches.Net
Hugging Face - Blog
Hugging Face - Blog
WordPress大学
WordPress大学
Vercel News
Vercel News
月光博客
月光博客
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Microsoft Azure Blog
Microsoft Azure Blog
雷峰网
雷峰网
H
Help Net Security
博客园 - Franky
S
SegmentFault 最新的问题
T
The Blog of Author Tim Ferriss
博客园_首页
C
Check Point Blog
腾讯CDC
美团技术团队
Martin Fowler
Martin Fowler
The GitHub Blog
The GitHub Blog
M
MIT News - Artificial intelligence
Apple Machine Learning Research
Apple Machine Learning Research
P
Proofpoint News Feed
U
Unit 42
人人都是产品经理
人人都是产品经理
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Engineering at Meta
Engineering at Meta
M
Microsoft Research Blog - Microsoft Research
阮一峰的网络日志
阮一峰的网络日志
G
Google Developers Blog
Stack Overflow Blog
Stack Overflow Blog
B
Blog
Last Week in AI
Last Week in AI
博客园 - 三生石上(FineUI控件)
博客园 - 聂微东
云风的 BLOG
云风的 BLOG
H
Hackread – Cybersecurity News, Data Breaches, AI and More
李成银的技术随笔
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
博客园 - 叶小钗
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知

Forbes - Innovation

Three Mistakes Brands Make In The Age Of AI Driven E-Commerce ‘Pause Windows Updates’—Microsoft Starts Fixing PC Problem Quordle Hints Today: Sunday, May 24 Clues And Answers ‘What Matters Most’—Google Is Changing Your Gmail Inbox Huawei Watch Fit 5 Pro Has A Fun Panda Feature That Encourages You To Move NYT ‘Connections’ Hints And Answers For Sunday, May 24 Formula 1’s Data Explosion: The Petabyte Race Weekend Is Not Far Off After Destiny 2’s Closure, PlayStation’s Live Service Push Is Officially In Shambles Sony Launches Reon Pocket Pro Plus Wearable Air Conditioner In Time For Summer Heatwaves ‘The Boroughs’ Rotten Tomatoes Review Score Passes ‘Stranger Things’ Apple To Release iOS 26.5.1 iPhone Update In Coming Days, Report Says The ‘GTA 6’ Release Date Is Confirmed, And Everything We’ve Learned Recently The No. 1 Skill That Makes Love Feel Easy, By A Psychologist More Details About The End Of ‘Destiny 2’ From Inside Bungie Waymos Have Trouble With Floods, Which Is Surprising ‘Obsession’ Is About To Hit Two Wild Box Office Milestones Why Can’t We See Ultraviolet Light? An Evolutionary Biologist Explains Why Is SpaceX Launching History’s Biggest Rocket During A Fuel Crisis? The iPhone 18 Release Strategy: Why Apple Is Staggering Its Next Lineup Are Sharks Even Sharks? New Genetic Study Could Rewrite Shark Evolution. Why Artificial Light Should Now Be Legally Classed As Pollution MacBooks Can Now Drive Two Screens Without Drivers With This New Dock Lamborghini CEO Winkelmann: Our Supercars Won’t Go All Electric Yet Sketchy Imbalances In Data Training Are Distorting AI-Generated Mental Health Guidance Ray J Vs. Supa Hot Fire: Start Time And How To Watch Oleksandr Usyk Vs. Rico Verhoeven: Start Time And Ring Walks In Egypt WWE Saturday Night's Main Event Final Card And Start Time Senator Cassidy’s Loss Shows Political Risk of Public Health Leadership NYT Mini Answers Today: Saturday, May 23 Crossword Hints Today’s NYT Strands Hints, Spangram, Answers For Saturday, May 23 (Staying Alive) Today’s Wordle #1799 Hints And Answer For Saturday, May 23 NYT ‘Pips’ Hints, Answers And Walkthrough For Saturday, May 23 Ozzy Osbourne’s Family Is Resurrecting Him As An AI Hologram The Critics Must Be Crazy, ‘The Mandalorian And Grogu’ Is An Absolute Blast SpaceX Faces A Crucial Launch Test Ahead Of Its IPO What To Watch This Weekend: New Shows And Movies To Stream On Netflix, Hulu, Prime Video, Apple TV And More The AI Breakthrough That Has Mathematicians Paying Attention ‘Zero Parades’ Review: The Opera Never Tires 10 Best Star Wars Games To Play In 2026 The Rise Of The Multimodal LLM Apple Loop: iPhone 18 Pro Upgrades, Fortnite Returns To The App Store, iPhone Fold Delays Android Headlines: Galaxy Z Fold 8 Feature, RedMagic 11S Pro Debuts, Anker’s AI Earbuds Immigration Service May Significantly Restrict Green Cards In The U.S. Starbucks Drops AI As Meta And Intuit Cut 11,000 Jobs Why Hybrid AI Is No Longer Optional In Banking And Finance Did AI Really Beat ER Doctors At Diagnosis? Here’s What The Study Showed Did AI Really Beat ER Doctors At Diagnosis? No, Here’s What Study Really Showed Quordle Hints Today: Saturday, May 23 Clues And Answers Samsung Unveils Immersive Moomin Takeover And Pop-Up Experience At London King's Cross Manziel Vs. Menery Full Card, Ring Walk Times For Brand Risk 14 The AI Video Race Is Moving Beyond Pretty Clips Google Signals AI Video’s Shift From Clip Generation To Production Crafoord Prize Winner Ramanathan: Climate Action Enters Its “How” Phase Marriage Benefits Men's Life Expectancy More Than Women's 3 Steps Not To Ignore In Nature Plans The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good 2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word 'Where' 2026 America Innovates | Fracking Pioneer Harold Hamm Calls Oil And Gas The Most Reliable Energy For AI Why Tom Hardy Was Reportedly Just Fired From ‘Mobland’ Season 3 How Small Studios Outrun Bigger Teams Sony Launches Reon Pocket Pro Plus Wearable Air Conditioner In Time For Summer Heatwaves Industry 5.0 Is Changing The Meaning Of Automation Garmin Watches, Coros And More Now Pair Better With Strava NYT Connections Hints Today: Saturday, May 23 Groups And Answers (#1077) The Architectural Difference Between Legal Productivity AI And EDiscovery AI ‘The Mandalorian And Grogu’ Sets A Rotten Tomatoes Audience Score Record How AI Tools Are Redefining The Role Of Technical Founders Apple Spotlights Student Entrepreneurs In Great Ideas Start Here Campaign The Growing Cybersecurity Risks To The Supply Chain In The AI Era Your Website Is Decaying Consumer Intent Faster Than You Think With ‘Destiny 2’ Gone, No ‘Destiny 3’ Is Coming ​How Operational Access Can Ensure Readiness For The Next Storm Why Russians Are In Despair Over Truck-Busting ’Martian’ Drones New ‘Crimson Desert’ Patch Adds Another Long-Time Player Request The Architecture Behind Cost-Effective AI Agents How To Think About High-Stakes Dispute Resolution Why Do Our Fingers Get Wrinkly In Water? An Evolutionary Biologist Explains You Can Build A CRM In A Day. You Still Can't Run A Company In One. 6 Teachable Moments From An Atlanta Rush Hour Downpour Why Your AI-Generated Marketing Content Sounds Generic ​The Accountability Crisis In The Creator Economy Scaling Across Borders: What It Takes To Succeed Globally Apple Rolls Out Two Crucial Health Features For Apple Watch And AirPods In India Competitive Advantage In Logistics Isn't AI ​Why AI Can Write Code, But It Can't Teach Engineers Critical Thinking The Importance Of Red Teaming For Scaling Enterprise AI Agents Why The Next AI Moat Won’t Be Productivity, But Emotional Value Banking’s AI Problem Isn’t The Model. It’s The Plumbing The Case For Structural Reform Through Tokenization SpaceX Scrubs Starship Launch As $2 Trillion IPO Nears LEGO F1 Ferrari Helmet Review (43014): Rough Build, Spectacular Finish Oleksandr Usyk Vs. Rico Verhoeven: Date, Time And How To Watch If Majoring In Computer Science Is Doomed Due To AI, The Latest Claim Is That Majoring In Philosophy Is The Next Best Choice MVP's Nakisa Bidarian On Rousey-Carano Viewership, Shields' Ban And PFL Co-Promotion See A ‘Planet Parade’ As Three Worlds Shine After Sunset This Weekend Soundcore’s Liberty 5 Are First Earbuds To Use Anker’s Thus AI Chip Code Ninjas: The AI-In-Education Problem Isn’t Cheating. It’s Passivity. Today’s Wordle #1798 Hints And Answer For Friday, May 22 NYT ‘Pips’ Hints, Answers And Walkthrough For Friday, May 22 Apple Teases iOS 27 AI Upgrades With Major Accessibility Overhaul To iPhone
Putting The Senses In AI
John Werner, · 2026-05-24 · via Forbes - Innovation
Digital eye.

Digital generated image of multicolored particles forming eye shape against black background.

getty

It’s no secret that smart wearables are becoming a big industry, and in the context of that, the “awareness” of hardware through sensory apparatus is a big factor. The machines that see (in their own ways) and experience the world around them utilize sensory items like cameras and other analytical tools, to feed data into the LLM or brain of the system.

I wrote last week about a doubling in the smart glasses sector last year, making that a bigger part of tech retail. Then there are all of these robotic applications to business, with automation making greater inroads into production, service jobs, and even things like janitorial work.

"In a benign scenario, probably none of us will have a job," said richest man in the world Elon Musk, according to reporting by Eric Revell at Fox Business. "There will be universal high income – and not universal basic income – universal high income. There'll be no shortage of goods or services."

That’s a pretty rosy projection, but the idea is not lost on many with front row seats to this wave of advancement: that in the end, AGI will become so capable that it can do almost any rote task with a great degree of success.

Analyzing Progress

There’s also the question of how we get there. I saw a panel at this year’s April Imagination in Action event at MIT, where a group of accomplished people discussed how sensory AI is flourishing, and what models business use to push the envelope. (Disclaimer: April’s IIA event is an annual conference that I help to facilitate.)

MORE FOR YOU

In moderating the panel, our own Paul Liang of MIT’s Media Lab asked the group where they think the common approach to AI has most gone off the rails.

“The thing I'm actually most worried about is that as AI integrates with all the sensors that we have in our lives, from our watches to our rings to pens to glasses, it will know everything about us,” said Alvin Graylin of Stanford, “and if that data is not controlled by the user, we will at some point become controlled by whoever controls the platforms that owns that data, and I think our loss of agency is one of the biggest risks that we have as humans, as AI becomes more prevalent and as data becomes more available.”

Cinnamon Sipper, CEO of Godela, had this to say about the path to advancing AI:

“I don't believe that the type of output that looks like, you know, general intelligence and physics reasoning will come about by scaling any one model the same way,” Sipper said. “I think, instead, being able to tackle complex physics problem-solving, bringing true physical reasoning into different AI models or different systems, will require a little bit more of a orchestration of different models, as opposed to any just one master general model.”

James Le talked about how things work at his company, TwelveLabs, where he is Head of Developer Experience. He pointed out how so many firms use a method involving big data and supervised learning that is more mechanical, less agile, and less based on teaching the model to understand.

“Our focus as a company is to take the other direction,” he said, “training the video natively on a lot of video content, building these communities that can understand temporal dimension, how spaces relate to each other through time. To that point about orchestration, I think it's also super-important to view kind of a corpus level, video orchestration that can think about concept objects, activities inside the video frame, how they relate to each other, and then, when you ask questions about any specific entities or activities, you can actually derive the context graph, the knowledge graph.”

Domain Expertise

In going over some of these more sophisticated tacks on AI progress, the panel kept touching on that idea of whether to lean more toward explainable AI, or something different.

Sipper mentioned the drawbacks of “black box” systems, suggesting that “pouring a bunch of data into a model, and hoping that it solves all sorts of problems, is a little bit of an intractable trade-off in value and investment right now.”

Le explained combining data labeling, which is a big business, and domain-specific modalities, and AG expanded on that, noting the constraints of using video to teach robots:

“When you look at just using video, it's not enough fidelity of information to train robots to do activities,” Graylin said, “because they don't have pressure data, they don't have directional data, they don’t have details.”

He continued:

“There’s a lot of occlusion that happens when things are being done, when things are getting complex, and also very fine-grained positional data of objects and body parts and so forth, so if you're looking at just training systems with a lot of video, it still won't solve those kinds of problems. Having a combination of well-labeled data with alternative multimodal sensing, I think that allows you to then create the more sophisticated learning that you're talking about.”

Le elaborated:

“If you train with language first, you acquire the bias of the text modality,” he said, “and in our domain, for example, the temporal motion part gets extremely important, and adding on video as an afterthought is not effective.”

The Big Brain

Some of the discussion also moved toward comparing smart AI to humans.

“If we learn from biology, humans learn about the physical world before we learn language,” Graylin said, “so it would actually make sense to do a multimodal model of learning, because if we're modeling the brain, then it would make a lot of sense to learn from all modes at the same time. In fact, if you look at children who learn multiple languages, they may be a little bit slower in the beginning, but they’ll automatically be able to translate between all these languages eventually.”

“These arguments are great,” Liang noted, “but empirically, we don't see the evidence that large scale natively multimodal training outperforms first training language models, and only then stapling other modalities on as an afterthought. So, do you think something needs to change in maybe the architectures of these models, the way that they are trained, the way that data is collected and presented for these models?”

In response, Graylin mentioned self-driving technologies, where the earlier efforts started out with a lot of labeled data, and then better LLMs brought higher-level inference and processing, and how that looks like progress.

Sipper talked about how her company trains with scalar field outputs of simulation data, and the meshes of objects.

Privacy and Agency

As panelists discussed the necessity for privacy and user agency, Graylin argued for permissionless systems.

“This has to be the default,” he said, “that a system does not share beyond the device that data is collected in, and it’s only serving the user. If the user would like to share that with different platforms, then it makes sense, but if it's automatically being captured by platforms, or the device manufacturers, or an advertising vendor, then there's going to be significant privacy backlash.”

Le, again, presented this through the lens of how his company works:

“We think about government, national security, defense use cases, and in that industry, privacy and security are even more prominent.”

“There is such a strong demand for on-prem solutions,” Sipper said, “that a lot of people haven't really figured out how that is compatible with an increasingly cloud-based infrastructure, and wanting to own different parts of the stack, and so I think there are very interesting business models evolving. I'm sure there are more philosophical, grand big questions that will come about.”

“How do we keep people from allowing machines to direct everything?” Graylin asked, “Because when we start to have everything being sensed, then the machine will just give you the answers, and it will just be automatic, and more and more we will rely on machines to tell us what to do, where to go. We're already doing that today when we drive, but we're going to do that to all aspects of our lives.”

More Senses

“I am really excited about the sense of touch, and the sense of smell,” Liang said in conclusion. “I think some of you already alluded to this, that we need AI that understands the physical world, and for it to understand the physical world, it must feel and interact with objects like people can. So, how do you build really good sensors to capture the sense of touch? How do you build sensors that capture smells of different objects, and use that as a way of recognizing whether something is good or bad, or whether something is dangerous, right? These are all very interesting questions that aim to extend our human senses and implant them into AI machines. We've built systems that can transmit smells over digital mediums, and have somebody else wearing something, and recreate that smell. There's lots of senses beyond language, obviously video, audio, that are part of the human experience, and are worth investigating.”

This was such an interesting foray into what people are doing with AI now. Touch? Taste? Smell?

What do you think? Drop me a comment and let me know.