惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Cyberwarzone
Cyberwarzone
Apple Machine Learning Research
Apple Machine Learning Research
T
The Blog of Author Tim Ferriss
L
LangChain Blog
N
Netflix TechBlog - Medium
O
OpenAI News
罗磊的独立博客
The Last Watchdog
The Last Watchdog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Google DeepMind News
Google DeepMind News
Stack Overflow Blog
Stack Overflow Blog
www.infosecurity-magazine.com
www.infosecurity-magazine.com
F
Fortinet All Blogs
小众软件
小众软件
B
Blog RSS Feed
H
Help Net Security
L
LINUX DO - 最新话题
S
Security @ Cisco Blogs
Microsoft Azure Blog
Microsoft Azure Blog
S
Secure Thoughts
爱范儿
爱范儿
G
Google Developers Blog
P
Privacy International News Feed
Attack and Defense Labs
Attack and Defense Labs
C
CERT Recently Published Vulnerability Notes
S
Schneier on Security
阮一峰的网络日志
阮一峰的网络日志
MyScale Blog
MyScale Blog
AWS News Blog
AWS News Blog
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Recent Commits to openclaw:main
Recent Commits to openclaw:main
aimingoo的专栏
aimingoo的专栏
U
Unit 42
云风的 BLOG
云风的 BLOG
Recent Announcements
Recent Announcements
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Project Zero
Project Zero
酷 壳 – CoolShell
酷 壳 – CoolShell
Cloudbric
Cloudbric
The GitHub Blog
The GitHub Blog
C
Cisco Blogs
美团技术团队
D
Docker
I
Intezer
The Cloudflare Blog
Martin Fowler
Martin Fowler
T
Tor Project blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Hacker News - Newest:
Hacker News - Newest: "LLM"

Show HN

暂无文章

offline country lookup (a Trifold library)
jaakl · 2026-06-16 · via Show HN

countrycheck: which country is this point in?

An offline lookup library built on the Trifold grid. The level-10 grid (~7 km cells) classified against country polygons with accurate OSM-derived borders — extended with coastal waters and including X-coded territories like Kosovo and the Caspian Sea — collapses into a 323 KB dataset that names the country anywhere on Earth in microseconds, with a confidence value for every answer. Python and JavaScript give identical results. This page runs the real JS library in your browser; the dataset is embedded right in this HTML file.

Try it on the map

Interactive demo

Load sample points or your own file (CSV lon,lat or GeoJSON points), and every point is resolved to a country in your browser by the bundled library, with no server and no network call per lookup. Each dot is coloured by the country it lands in; open ocean stays grey. The lookups-per-second figure is measured tightly around the classification loop on your machine (map rendering and file parsing excluded), so it is the real library throughput. Use the 100k-random button for a stable number.

Controls

Your own points

Open CSV / GeoJSON…

CSV: lon,lat[,name] per line (or a header naming lat/lon columns in either order). GeoJSON: any FeatureCollection of Points. Files stay on your machine.

Exact border refinement exact polygon test wherever a border crosses a cell (downloads ~19 MB once)

Off: border cells use the bundled best-call with its area share as confidence. On: exact country/country and coastline borders. Watch how the counts, confidence and lookup rate change.

Debug layers source border polygons (needs the refinement layer)

Click anywhere on the map to see the level-10 triangle and its country answer.

one hue per country (256 total)
no country: open ocean (confidence 1.0)
border cell (mixed)
answer changed by refinement

Loading dataset…

Click any classified point for its full answer: country code (GADM gid_0), ISO 2, name, kind, confidence, area share and cell address (computed on the fly for open-ocean points, whose cells are not stored). Switching on exact border refinement makes the source polygons authoritative in every cell a border crosses. Caveats inherited from the source data: coastal waters are an approximate distance-based assignment, not legal EEZ; disputed territories follow GADM (Crimea, Western Sahara…); lakes belong to their surrounding country, except the Caspian Sea, which is its own XCA entry.

User guide

JavaScript (browser or Node)

import { CountryCheck } from "./countrycheck.mjs";

// Node: bundled file · browser: fetch the 323 KB dataset
const cc = await CountryCheck.fromFile();                  // Node
const cc = await CountryCheck.fromUrl("countries_L10.tfcs"); // browser

cc.country(24.7536, 59.437);   // 'EST'  (lon, lat)
cc.check(-0.1276, 51.5072);
// { country: 'GBR', iso2: 'GB', name: 'United Kingdom',
//   kind: 'country', confidence: 1, share: 1,
//   cell: 'TFA95BM', refined: false }

Python (stdlib only)

from countrycheck import CountryCheck

cc = CountryCheck()                    # bundled data
cc.country(24.7536, 59.4370)           # 'EST'
cc.check(-0.1276, 51.5072)
# CountryResult(country='GBR', iso2='GB',
#   name='United Kingdom', kind='country',
#   confidence=1.0, share=1.0, cell='TFA95BM',
#   refined=False)

What the answer means

kindmeaningcountryconfidence
countrycell wholly inside one countrythat country1.0
nonecell absent from the dataset (international waters) null1.0
bordermixed cell; bundled best call decides (may be none) best callarea share
border + refineddecided by the exact source polygon exact0.99

Measured accuracy: 99.82% agreement with exact polygon containment on 30,000 uniform random points. The country and none answers were 100% correct; all residual error lives in border answers, which self-report lower confidence. With the border refinement loaded, agreement reaches 100.0% on the same sample.

Command line

$ python countrycheck/python/countrycheck.py 24.7536 59.4370
EST  iso2=EE  name='Estonia'  kind=country  confidence=1.000  share=1.0  cell=TFAVKGR  refined=False

The CLI loads the border refinement automatically when borders_L10.tfcr is present. Install with pip install countrycheck or npm install countrycheck, or run straight from a repo checkout.

Technical info

Canonical index

Any Trifold cell at level ≤ 10 maps to a contiguous range in the level-10 index space (face·410 + path): a level-l cell covers exactly 410−l consecutive indices. The whole country classification becomes run-length intervals.

TFCS format · 323 KB

222,403 runs as varint(gap), varint(len·2|border) — interior runs carry a country id, border runs a best call plus a 4-bit area share — over a country table of 256 code/iso2/name strings, all zlib-compressed. Level-agnostic.

Lookup path

Pure-float point location (no dependencies, bit-identical to the SDK) descends 10 subdivision levels, then one binary search over the run starts. ~0.6 µs in Node, ~13 µs in pure Python, ~3 µs batched with numpy.

Border refinement · TFCR

Source country polygons clipped to every border cell, quantized to a cell-local 16-bit grid (~0.1 m), one zone per country present with zigzag-varint rings and the even-odd rule. A point-in-polygon test then decides the exact country (or none) in those cells.

The borders come from the timezone-boundary-builder “with oceans” polygons (OSM-derived, already reaching into territorial water); GADM level-0 supplies only the country identity and ISO codes, joined by max land overlap. 256 countries and territories, 6.90M level-10 cells belonging to some country, of which 195,473 are border cells. Full documentation, the one-pass build.py, the PostGIS source build (sql/build_countries_coastal.sql) and the cross-language test suite live in countrycheck/ on GitHub. Roadmap: an L12 (~1.8 km) variant and timezone detection from the same source data.

Accuracy: tested on 57,501 real airports

Ground truth is the OurAirports dump — 57,501 points, each tagged with an ISO country code from an unrelated source. countrycheck places 99.50% of them in the correct country from the bundled 323 KB data, and 99.68% with the border refinement loaded. Interior-country answers are 99.91% correct; the refinement works only on the 729 airports that fall in a border cell, and there it lifts agreement from 80.66% to 95.20% — the payoff of the accurate OSM-derived borders.

99.50% → 99.68%

overall agreement with airport country codes, bundled vs. border-refined. The residual is mostly disputed/border territory the sources map differently, dependencies coded to a parent state, and offshore or placeholder coordinates.

99.995% refined

against exact SQL point-in-polygon containment over the same source polygons (100,000 random points): the refinement resolved all but 5 of the 168 base-mode border disagreements (those 5 are coastal-overlap tie-breaks). Base mode: 99.83%.

Reproduce with scripts/accuracy_countrycheck_airports.py (airports) and scripts/benchmark_countrycheck.py (vs. SQL containment).

Benchmark: 7–97× faster than SQL spatial engines

One workload, four engines: assign a country (gid_0) to 100,000 sphere-uniform random points against the same country polygons. Median of seven warm runs, Apple M5 Pro, June 2026. The refined Trifold answers reproduce exact polygon containment to 99.995% (see above) while running an order of magnitude faster — ~7× PostGIS, ~97× DuckDB Spatial. Called one point at a time, the gap holds.

Batch · 100,000 points per call

Trifold base

452,594 pts/s

Singular · one point per call

DuckDB 1.5.3 and PostGIS 3.6.3 compute exact containment and returned byte-identical answers; BigQuery is documented but was not run in this pass. PostGIS singular includes localhost TCP + Docker transport; DuckDB runs embedded in-process. The live figure in the demo panel is the same throughput measured on your own device. Full methodology, dataset manifest, the airport test and the BigQuery procedure: countrycheck_benchmark.md.