惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Hackread – Cybersecurity News, Data Breaches, AI and More
S
Schneier on Security
罗磊的独立博客
Recorded Future
Recorded Future
Hacker News - Newest:
Hacker News - Newest: "LLM"
G
Google Developers Blog
博客园_首页
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
T
The Blog of Author Tim Ferriss
Know Your Adversary
Know Your Adversary
L
Lohrmann on Cybersecurity
C
Cybersecurity and Infrastructure Security Agency CISA
博客园 - 三生石上(FineUI控件)
M
MIT News - Artificial intelligence
B
Blog
T
Tor Project blog
D
Docker
Engineering at Meta
Engineering at Meta
Apple Machine Learning Research
Apple Machine Learning Research
Spread Privacy
Spread Privacy
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Scott Helme
Scott Helme
MyScale Blog
MyScale Blog
量子位
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
aimingoo的专栏
aimingoo的专栏
IT之家
IT之家
AWS News Blog
AWS News Blog
Google Online Security Blog
Google Online Security Blog
NISL@THU
NISL@THU
D
DataBreaches.Net
Help Net Security
Help Net Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Cloudbric
Cloudbric
美团技术团队
W
WeLiveSecurity
H
Hacker News: Front Page
宝玉的分享
宝玉的分享
The Cloudflare Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
爱范儿
爱范儿
N
News and Events Feed by Topic
V
Visual Studio Blog
C
CERT Recently Published Vulnerability Notes
T
Tailwind CSS Blog
MongoDB | Blog
MongoDB | Blog
F
Fortinet All Blogs
B
Blog RSS Feed
S
Security Affairs

Comments for The Eclectic Light Company

Comment on What does Activity Monitor measure? by hoakley An American in Paris: paintings of Henry Ossawa Tanner 1902-1930 An American in Paris: paintings of Henry Ossawa Tanner 1880-1902 Great Ladies of Impressionism: Berthe Morisot 1874-1891 What to do with a hot Mac Deprecations and removals from Golden Gate Brushstrokes: From El Greco to Rembrandt Portraits of trees: Dutch Golden Age How to get the most from SilentKnight 3 Comment on Hero or hooligan: Achilles becomes the warrior by Deborah J. Brasket Solutions to Saturday Mac riddles 365 Comment on SilentKnight 3.0 for Apple silicon Macs running Sequoia and later (full release) by michaelriccioli Comment on Last Week on My Mac: Uncompressed compressed files by fds Comment on Explainer: Memory by jzonedotcom Comment on Colin Campbell Cooper painting America: 1896-1910 by House of Heart Brushstrokes: 16th century What can you do when an app uses too much memory? SilentKnight 3 second beta adds text and JSON reporting In the shadow: Caravaggism What to do with your encrypted HFS+ disks First beta-test version of SilentKnight 3 for Apple silicon Macs Hero or hooligan: Jason and Medea Comment on Fix documents that won’t open as expected using Quarant2 by EcleX Last Week on My Mac: The mystery of Safari’s Web Archives Last Week on My Mac: Why is it so hard to open a document? Explainer: Disk encryption In the shadow: Caravaggio Brushstrokes: innovators of the first century Changing Paintings: 36 Theseus and the Minotaur macOS virtualisation is leaping forward in Golden Gate Why can’t Preview open that PDF? Crossing the Golden Gate, Intel support, and an update to SystHist Reading the Finder’s Get Info dialog In memoriam Mary Cassatt: 2, 1880-81 Last Week on My Mac: What’s in a name? Explainer: Getting a location Get more from Get Info and the Finder’s contextual menu Stop your photos revealing your location Have you saved thousands of versions? Versatility 1.2 might be what you need Apple has released an update to XProtect for all macOS Apple has released macOS Tahoe 26.5.1 Comment on What Location Services do in macOS by Tristan Hubsch Comment on Protect files with the Locked or Immutable flag by markbot2zero Portraits of trees: Introduction Which tasks require mains power? Comment on Online reference to external displays for Apple silicon Macs by Brian What’s in that phishing email? How to search document versions Rubens’ Peace and War Comment on Last Week on My Mac: Syncing metadata in iCloud Drive by hoakley A weekend with Misia: 2 How to search Time Machine backups? Medium and message: Pottery Hero or hooligan: Theseus and the sandals How QuickLook provides thumbnails and previews Hunting extended attributes with an update to xattred Saturday Mac riddles 360 Comment on How to preserve versions, and how to create versioned PDFs by markbot2zero Comment on What gets synced in iCloud Drive? by hoakley Solutions to Saturday Mac riddles 359 Last Week on My Mac: snapshots, the elephant in APFS How to check whether Spotlight is getting the right metadata macOS Tahoe no longer fully supports Time Capsules The bicentenary of Frederic Edwin Church: 1857-77 macOS virtual machines and audio-video syncing Comment on Use Finder tags for categories by Chuck Last Week on My Mac: Dependency and skill fade Comment on Virtualisation on Apple silicon Macs is different by AndyS Painting Pandora and her box: 1883-1919 Mac Easter eggs Painting Spring blossom 2 Comment on The macOS Natural Language framework and Nalaprop by Ingo Comment on The MACL extended attribute by hoakley On Reflection: Cézanne Privacy: Which folders are protected in Tahoe? Last Week on My Mac: Root cause analysis and ClickFix Last Week on My Mac: Root cause analysis and ClickFix Last Week on My Mac: Root cause analysis and ClickFix Last Week on My Mac: Root cause analysis and ClickFix Why you can’t trust Privacy & Security How can I now have two apps named Pages? How to survive the loss of Rosetta Use Fallback Recovery on Apple silicon Macs Clean install macOS
Last Week on My Mac: Spotlight on semantics
Duncan · 2026-06-29 · via Comments for The Eclectic Light Company

You may have noticed one phrase that was repeated throughout much of WWDC earlier this month, semantic search. Although it had appeared occasionally in the past, this year it came up in more than a dozen presentations, starting in the Platforms State of the Union on day 1. Just what is changing in Spotlight that is semantic?

In traditional search of text content, Spotlight discovers in its content indexes each file containing the search term you have provided. When you search for the term cow, it should return only those files containing those exact characters. This is inevitably a bit more complex, as we normally want search to be case-insensitive, and there are other rules we might want to apply, such as whether that should return words like cower where the term is a prefix, or the place-name Cowleaze, where it’s also a capitalised location name. Those are normally determined by a set of language-specific rules for the Unicode collation applied.

Where there are many hits, as occurs when searching the internet, search ranking can be used to return and order those websites that contain the term and are the most frequently visited, or using a more complex ranking algorithm. But that is of limited use when searching local files.

Semantic search is different, in that its matches aren’t as crisp and Boolean. Rather than working like a simple index, it’s more like a thesaurus in effect. This associates the word cow with a meaning, such as a mature female ox of the species Bos taurus, then looks up related concepts. Some will be close matches, like cattle, bovid, or ungulate, others might be related terms like heifer, an immature cow, and other terms with similar or related meaning.

Semantics is heavily dependent on context. If you’re a farmer, you won’t be interested in the females of other species also known as cows, such as elephants and rhinos, which a zoologist would want to include. A more general audience might want its slang association for a disagreeable woman as another of its associations. There are also regional variations: in US English, cow commonly refers to both sexes and all ages of oxen, while in Australian and New Zealand English it can extend to almost anything that’s deemed objectionable.

In the days before AI, this type of search was often referred to as fuzzy, compared to the crisp black-and-white of regular search, as it not only returns hits that contain the specified term, but those for a grey zone of related terms.

One way to envisage this is to represent concepts, encapsulated as tokens, in multi-dimensional space. Each concept can be located by its coordinates, and by calculating the distance between any two concepts you can express how closely related they are. Semantic search thus tries to discover files and other items of similar and related concepts.

Earlier search methods did this using explicit lists of terms. For example, the photo below shows a few Belted Galloway cattle grazing in a field on chalk downs near here.

Traditionally, if I were maintaining my own image library I’d have to enter detailed information about that image to be stored in Exif metadata, a time-consuming task that’s also prone to error. I could get the location or breed wrong, but we now have the benefit of GPS to ensure at least the location is accurate.

More recently we’ve been able to get images analysed automatically, and in that case it returned a set of keywords to identify the contents:
{animal, cow, mammal, ungulates, outdoor, grass, land, sky, cloudy, "blue sky", plant, shrub}
If we then search for images with the keyword of cow, that should appear in the results, but it omits semantically similar words such as cattle or oxen.

Rather than compiling more exhaustive sets of keywords, semantic search can broaden the scope to cope better. And because we can interact through Siri, we can fine-tune our search results by specifying the cattle should be black and white, perhaps, and combining conventional search criteria such as location.

To get this to work effectively, there are some limitations. Because semantics are so contextual and variable, this involves apps and Core Spotlight. That’s a big benefit to user privacy, as Core Spotlight’s indexes are separated by user and stored locally, although in places like ~/Library/Metadata rather than volume-based Spotlight indexes in the existing hidden .Spotlight-V100 folders. And unlike global Spotlight indexing and search, it requires apps to have code to support both tasks, as it can’t just happen by magic.

While I’m sure we’ll all be impressed with many of the results of semantic search, hits that we never expected to find, it’s going to prove harder to assess those that it misses. That’s the more concerning aspect of the performance of all search systems, and in many cases how we will judge their value. Even if you aren’t impressed yet by other advanced AI coming in Golden Gate, semantic search could prove decisive.