慣性聚合 高效追讀感興趣之博客、新聞、科技資訊
閱原文 以慣性聚合開啟

推薦訂閱源

博客园 - 司徒正美
V
V2EX
T
Tailwind CSS Blog
有赞技术团队
有赞技术团队
aimingoo的专栏
aimingoo的专栏
Apple Machine Learning Research
Apple Machine Learning Research
IT之家
IT之家
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
月光博客
月光博客
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
博客园 - 聂微东
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python)
"卫星嵌入"如何将大地化为一寻向量之域
Gérard Cubak · 2026-05-24 · via DEV Community

吾辈探矿之地质学家,卫星之影像,实若金矿(戏言也)。或为水文热蚀之图绘,或为结构断层之辨识,或为险阻之地域踏勘之规划,皆重赖遥感之术。
然则常规之流程,实为技术之苦海:

  1. 下载八位元原始场景(Sentinel、Landsat、ASTER)。
  2. 终日从事大气校正与云/植被掩膜。
  3. 调弄复杂波段比率(如粘土或铁氧化物比率),以显矿物学之妙。

Google 与 DeepMind 乃新近以 AlphaEarth 基础模型撼动此等旧习。今非复处理原始像素,吾辈得用 Satellite Embeddings(卫星嵌入)矣。

吾名 Gérard Cubaka,于是文中,吾将释此技术如何使吾辈查询地球地质,若探文本数据库般简易。


🧠 之理:地质与光谱之印,含六十四维

若君习知大语言模型(如GPT),则知其化字为数学之矢(即嵌入)。AlphaEarth于大地亦然。
是模取多年多传感器之数据:光学影像(反射率)、Sentinel-1雷达数据(地表粗糙度、地形、结构)及气候数据。其将此物理之变,压缩为十米像素之六十四维向量,岁岁更新。
此六十四通道(名曰A00A63 dans Google Earth Engine)非精确光谱带,乃整体语义之印记。

何故此为探查之革命乎?凡二域,其地表地质背景若同——譬如,同属水热蚀变之迹,同种之表土,或相似之伟晶岩出露——则其数学矢量必甚近(余弦距离低),纵使分属二洲。


🛠 数据集技术规格

此全球数据集可于Google Earth Engine(GEE)目录中免费获取,其标识为GOOGLE/SATELLITE_EMBEDDING/V1_ANNUAL.

  • 空间分辨率十米一像素(宜于区域辨识/Greenfield)。
  • 整饬之六十四幅图像(标准化向量)。
  • 频次 : 年度合成(目前可得自二零一七至二零二五)。

💻 事實上:僅數行即可啟動相似性搜索

地质学家最强大的应用场景乃是以例求之。试想,汝于地契之上识得矿化之迹或已知矿脉,可取其向量,命模型寻天下凡具同此数理之印之域。
欲知如何借 Earth Engine 之 Python API 载此数据,请参下文。

import ee
# Initialiser la connexion à Earth Engine
ee.Initialize()

# Charger la collection mondiale de Satellite 
Embeddingsembeddings_collection = ee.ImageCollection("GOOGLE/SATELLITE_EMBEDDING/V1_ANNUAL")

# Filtrer sur les données les plus récentes (ex: 2024)
embeddings_recent = embeddings_collection.filter(ee.Filter.date('2024-01-01', '2024-12-31')).first()

# Afficher les 64 dimensions disponibles (A00 à A63)
print("Bandes d'exploration disponibles :", 
embeddings_recent.bandNames().getInfo())

退出全屏模式

如何融入探索流程?

  1. 快速区域定位:将GEE与BigQuery Vector Search等矢量数据库连接,可计算兴趣区域(模型居所)向量与研究区域其他部分之间的欧氏距离,数秒内生成矿产适宜性地图。

  2. 地貌植被图绘:此模自然涵养土质所引植被之变(生物地球化学应答)。但以无导算法(K-Means)施诸此六十四谱,可无偏见分划地质要域.


⚖️ 地质利弊析

其利✅:

  • 免"数据准备"之阶:云之清理毕,或异时景之繁纹尽去。数据集已备分析之用。
  • 光与雷达之协同:此矢融合光谱之应(物之成)与雷达之应(纹之理、分之裂、地之形),常需手工极繁之工。
  • 适于绿野新开之境。:可于低廉之计算力,扫描广袤之域(如深谷、绿岩带),俟后遣地勤之师。

其❌:

  • 黑箱效应:异于寻常之ASTER波段比,彼处可确知其指目为黄铜矿或高岭土,然此波段A12A45之地质特征,实难以数理明其所以然。
  • 植被之限:此模削云之效,然于密林赤道之域,光学之透犹限於林冠(虽Sentinel-1雷达内置可助其形貌结构之辨)。
  • 年时解析:此非地质之患(地质者,百年不易也),然则此器不适用,不能察矿场日进之功.

🚀 结论

卫星之嵌 乃数据科学与地球科学交汇之枢机。译吾地球之表为向量之域,谷歌为地质勘探者献一宏观定位之利器,其效前所未有,使屏前图像处理之暇锐减,而增实地勘察之功。
君已用人工智能或机器学习以定地质之目标否?何框架(TorchGeo, Rasterio, QGIS)君所偏爱以操弄此等数据?于评论中论之!


若此文君有所感,可留❤️或🦄!循吾DEV.to之页,得更多科技、Python与地理空间人工智能相合之文。

—热拉尔·库巴卡