惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Martin Fowler
Martin Fowler
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Threat Research - Cisco Blogs
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cyber Attacks, Cyber Crime and Cyber Security
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
T
Troy Hunt's Blog
V
V2EX - 技术
Hacker News - Newest:
Hacker News - Newest: "LLM"
H
Heimdal Security Blog
T
Tor Project blog
IT之家
IT之家
Project Zero
Project Zero
GbyAI
GbyAI
Security Latest
Security Latest
S
Security Archives - TechRepublic
人人都是产品经理
人人都是产品经理
大猫的无限游戏
大猫的无限游戏
Spread Privacy
Spread Privacy
S
Security Affairs
A
Arctic Wolf
C
Cybersecurity and Infrastructure Security Agency CISA
I
Intezer
P
Palo Alto Networks Blog
宝玉的分享
宝玉的分享
Google DeepMind News
Google DeepMind News
T
Threatpost
I
InfoQ
F
Full Disclosure
Blog — PlanetScale
Blog — PlanetScale
Last Week in AI
Last Week in AI
Cisco Talos Blog
Cisco Talos Blog
N
Netflix TechBlog - Medium
MyScale Blog
MyScale Blog
H
Help Net Security
S
Securelist
Y
Y Combinator Blog
月光博客
月光博客
博客园_首页
Engineering at Meta
Engineering at Meta
酷 壳 – CoolShell
酷 壳 – CoolShell
J
Java Code Geeks
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
A
About on SuperTechFans
K
Kaspersky official blog
Microsoft Azure Blog
Microsoft Azure Blog
Vercel News
Vercel News
阮一峰的网络日志
阮一峰的网络日志
T
The Exploit Database - CXSecurity.com
B
Blog

Hacker News

Introducing Claude Opus 4.7 Qwen Studio The Future of Everything is Lies, I Guess: Where Do We Go From Here? GitHub - SeanFDZ/macmind: Single-layer transformer in HyperTalk for the classic Macintosh Show HN: Agent-cache – Multi-tier LLM/tool/session caching for Valkey and Redis Ancient DNA reveals pervasive directional selection across West Eurasia [pdf] Moving a large-scale metrics pipeline from StatsD to OpenTelemetry / Prometheus GitHub - Nightmare-Eclipse/RedSun: The Red Sun vulnerability repository GitHub - SethPyle376/hiraeth: Local AWS emulator focused on fast integration testing, with SQS support, SQLite-backed state, and a debug-friendly web UI. GitHub - macOS26/Agent: Any AI, replaces Claude Code, Cursor, OpenClaw. Over 18 LLM providers (Claude, OpenAI, Gemini, Ollama, Zai, HF, Qwen) wired into a native Mac app that writes code, builds Xcode projects, bumps versions, manages git, automates Safari, use AppleScript, JS or Accessibility, extend Agent! w/ MCP Servers, run tasks from your iPhone via Messages. YouTube now lets you turn off Shorts I Made a Terminal Pager Burgers | マクドナルド公式 Commands — HackerNews CLI documentation ChatGPT for Excel PiCore - Raspberry Pi Port of Tiny Core Linux Live Nation illegally monopolized ticketing market, jury finds Google Broke Its Promise to Me. Now ICE Has My Data. Founding Engineer at Adaptional | Y Combinator CRISPR takes important step toward silencing Down syndrome’s extra chromosome Show HN: Libretto – Making AI browser automations deterministic US v. Heppner (S.D.N.Y. 2026) no attorney-client privilege for AI chats [pdf] Unexpected €54k billing spike in 13 hours: Firebase browser key without API restrictions used for Gemini requests Retrofitting JIT Compilers into C Interpreters IPv6 traffic crosses the 50% mark The Accursèd Alphabetical Clock Cybersecurity Looks Like Proof of Work Now Fragments: April 14 Cal.com Goes Closed Source: Why AI Security Is Forcing Our Decision | Cal.com - Scheduling Software for Online Bookings Laravel raised money and now injects ads directly into your agent When moving fast, talking is the first thing to break Too much discussion of the XOR swap trick Introduction to Spherical Harmonics for Graphics Programmers The Grand Line Building a Z-Machine in the worst possible language High-Level Rust: Getting 80% of the Benefits with 20% of the Pain GitHub - duguyue100/midnight-captain: Inspired by Midnight Commander, tailored to my taste. How to build a `git diff` driver · Jamie Tanna | Software Engineer Center for Responsible, Decentralized Intelligence at Berkeley The Local Universe’s Expansion Rate Is Clearer Than Ever, but Still Doesn’t Add Up - A new synthesis of astronomical measurements confirms a persistent mismatch that could point to physics beyond current models The air throughout our homes is infused with microplastics. But there are things you can do to breathe less of them The disturbing white paper Red Hat is trying to erase from the internet – OSnews The Future of Everything is Lies, I Guess: Annoyances ‘Abhorrent’: the inside story of the Polymarket gamblers betting millions on war Productive procrastination — Max van IJsselmuiden maps, territory and LMs 447 Terabytes per Square Centimetre at Zero Retention Energy: Non-Volatile Memory at the Atomic Scale on Fluorographane Show HN: Pardonned.com – A searchable database of US Pardons 20 Years on AWS and Never Not My Job The Seasons are Wrong Artemis II crew splashes down near San Diego after historic moon mission We gave an AI a 3 year retail lease in SF and asked it to make a profit | Andon Labs How a dancer with ALS used brainwaves to perform live On filing the corners off my MacBooks Installing every* Firefox extension OpenClaw’s memory is unreliable, and you don’t know when it will break Steve Blank Nowhere Is Safe Chimpanzees in Uganda locked in vicious 'civil war', say researchers watgo - a WebAssembly Toolkit for Go linux/Documentation/process/coding-assistants.rst at master · torvalds/linux GitHub - callumlocke/json-formatter: Makes JSON easy to read. Founding Product Engineer at Bild AI | Y Combinator A compelling title that is cryptic enough to get you to take action on it GitHub - Keychron/Keychron-Keyboards-Hardware-Design: Industrial design files for Keychron keyboards and mice. 100+ models with CAD assets in STEP, DXF, DWG, and PDF. Source-available, with commercial use allowed for original compatible accessories within the license terms. [ANNOUNCE] WireGuardNT v0.11 and WireGuard for Windows v0.6 Released 1D-Chess Helium Is Hard to Replace Cooperative Vectors Introduction | Evolve Keeping a Postgres queue healthy — PlanetScale Our response to the Axios developer tool compromise Do Americans read print books, e-books or audiobooks more? The Zettelkasten Method in Obsidian: A Practical Setup Guide Artemis II Is Competency Porn and We Are Starving For It WeakC4 Flight Viz — Cockpit View A Mexican surveillance giant you’ve never heard of is now watching the U.S. border Surelock: Deadlock-Free Mutexes for Rust RISC-V 101 – what is it and what does it mean for Canonical? | Ubuntu The Problem That Built an Industry How Much Linear Memory Access Is Enough? | Solidean Investigating Split Locks on x86-64 Simplest hash functions Sybilproof reputation mechanisms (2005) [pdf] What is a property? How Complex is my Code? Static code analysis in Kotlin — tools overview Toffoli gates are all you need PGLite evangelism dcmake: a new CMake debugger UI Clojure on Fennel part one: Persistent Data Structures Fragments: April 2 Python Release Python install manager 26.1 The Life and Death of the Book Review - Liberties Bitcoin miners are losing $19,000 on every BTC produced as difficulty drops 7.8% God sleeps in the minerals Building slogbox Apple Silicon and Virtual Machines: Beating the 2 VM Limit Who was “Not Even Wrong” first? Pokemon Evolution Vs Darwinian Evolution The APL Programming Language Source Code
GitHub - PearsonZero/kodak-pcd0992-statistical-characterization: Per-image principal component decomposition of the Kodak Lossless True Color Image Suite (PCD0992). First published statistical characterization of inter-channel redundancy structure across all 24 images.
2026-04-21 · via Hacker News

Kodak PCD0992 Statistical Profile Series

Per-Image PCA and Inter-Channel Redundancy Analysis of the Kodak Lossless True Color Image Suite

Baetzel, J. (2026)


Overview

This repository contains the first published per-image statistical characterization of all 24 images in the Kodak Lossless True Color Image Suite (PCD0992). Each image is documented as a two-page reference data sheet reporting the complete inter-channel redundancy structure: covariance matrix, eigendecomposition, Pearson correlations, spatial autocorrelation, and derived classification metrics.

All statistics were computed directly from the 8-bit RGB pixel arrays of the standard 768x512 base-resolution PNG distribution. No subjective descriptions appear in any profile. All redundancy classifications are generated programmatically from the computed metrics using fixed thresholds documented in the methodology.


Related Research

Parent Paper: Baetzel, J. (2026). Statistical Characterization of Inter-Channel Redundancy Structure in the Kodak Lossless True Color Image Suite. Per-Image Principal Component Decomposition of PCD0992.

  • Focus: Theoretical framework establishing the first complete per-image PCA decomposition of the Kodak suite. Documents the dimensionality spectrum, blue channel independence range, eigenvector loading patterns, and evidence for deliberate curation across the 24-image collection.
  • Availability: Included in this repository (baetzel_2026_kodak_pca_characterization.pdf).

This Series: Baetzel, J. (2026). Kodak PCD0992 Statistical Profile Series. Per-Image PCA and Inter-Channel Redundancy Analysis.

  • Focus: Individual reference data sheets and machine-readable metric exports for each of the 24 images. Provides the per-image evidence underlying the suite-wide analysis in the parent paper — covariance matrices, eigendecompositions, correlation heatmaps, spatial autocorrelation, and logic-generated redundancy classifications.
  • Availability: /baseline/ directory (24 PDFs + 25 JSON files).

The parent paper establishes why the Kodak suite spans the full spectrum of inter-channel redundancy. The profile series documents what each individual image contributes to that spectrum.


Dataset Specifications

Property Value
Suite Kodak Lossless True Color Image Suite (PCD0992)
Image Count 24
Resolution 768x512 or 512x768
Bit Depth 24-bit (8 bits per channel)
Color Space sRGB
Color Mode RGB
Format PNG (lossless)
Provenance Kodak PCD Film Scanner 2000, 35mm film, PhotoYCC decode to 8-bit RGB

Computed Metrics Per Image

Each two-page profile reports the following:

Page 1

  • RGB channel distribution (smoothed density curves from pixel data)
  • Per-channel statistics: mean, standard deviation, variance, kurtosis, skewness, min, max
  • Inter-channel correlation heatmap (3x3)
  • Pairwise Pearson correlation coefficients (R-G, R-B, G-B) and suite average
  • Full 3x3 covariance matrix

Page 2

  • Eigendecomposition: eigenvalues, variance explained (%), eigenvector loadings
  • Derived metrics: condition number, eigenvalue ratios, blue channel independence, PC1 dominant channel
  • Dimensionality tier classification
  • Spatial autocorrelation (lag-1, horizontal and vertical)
  • Average local variance (3x3 neighborhood)
  • Redundancy profile (logic-generated from computed metrics)

Suite Overview

The 24 images span nearly the full range of inter-channel redundancy configurations achievable through film-based photographic capture. Condition numbers range from 7.55 to 1,739.16 — more than two orders of magnitude — covering color distributions from near-spherical to extremely elongated.

Dimensionality Tiers

Tier PC1 Range Count Images
Three-Dimensional (PC1 < 75%) 69.27-73.37% 3 kodim02, kodim03, kodim23
Two-Dimensional (PC1 75-85%) 81.60% 1 kodim14
Weakly One-Dimensional (PC1 85-93%) 86.87-91.91% 8 kodim04, kodim05, kodim07, kodim09, kodim11, kodim18, kodim21, kodim22
Strongly One-Dimensional (PC1 93-97%) 93.36-96.96% 7 kodim01, kodim08, kodim10, kodim12, kodim15, kodim16, kodim19
Near-Degenerate (PC1 > 97%) 97.36-98.42% 5 kodim06, kodim13, kodim17, kodim20, kodim24

Eigenvector Loading Patterns

Pattern Count Images
Green dominant 7 kodim03, kodim05, kodim08, kodim09, kodim10, kodim16, kodim17
Green-Blue coupled 6 kodim01, kodim04, kodim11, kodim12, kodim15, kodim21
Red dominant 6 kodim02, kodim06, kodim14, kodim18, kodim19, kodim23
Balanced 4 kodim07, kodim13, kodim20, kodim24
Blue dominant 1 kodim22

Suite Extremes

Metric Low High
Avg |r| kodim23: 0.5595 kodim20: 0.9903
Condition Number kodim23: 7.55 kodim20: 1,739.16
PC1 Variance kodim03: 69.27% kodim20: 98.42%
Blue Independence kodim15: 2.3% kodim03: 52.0%
Highest Single Pair r kodim20 R-G: 0.9955
Lowest Single Pair r kodim03 R-B: 0.2890

How to Read a Profile Sheet

Condition Number (lambda1/lambda3): Ratio of the largest to smallest eigenvalue. High values indicate a needle-like color distribution concentrated along one axis. Low values indicate a more spherical distribution where each channel carries independent information.

Blue Channel Independence: The percentage of blue channel variance not captured by the first principal component. Computed as (1 - (blue_loading_PC1^2 x lambda1 / Var(B))) x 100. Low values indicate the blue channel is almost entirely predictable from the primary variance axis. High values indicate the blue channel carries substantial unique information.

Dimensionality Tier: Classification based on PC1 variance explained. Thresholds at 75%, 85%, 93%, and 97% produce five tiers from Three-Dimensional to Near-Degenerate, corresponding to distinct regimes of inter-channel redundancy.

Eigenvector Pattern: The loading structure of the first principal component. Identifies which channel or channel pair drives the dominant variance axis: balanced (all channels near-equal), coupled (two channels co-load), or dominant (one channel leads).

Spatial Autocorrelation (lag-1): Pearson correlation between each pixel and its immediate neighbor, computed separately for horizontal and vertical directions. Values near 1.0 indicate smooth, spatially coherent image data.


File Structure

/
    README.md
    baetzel_2026_kodak_pca_characterization.pdf
/baseline/
    KODIM01_STATISTICAL_PROFILE.pdf
    kodim01_stats.json
    KODIM02_STATISTICAL_PROFILE.pdf
    kodim02_stats.json
    ...
    KODIM24_STATISTICAL_PROFILE.pdf
    kodim24_stats.json
    kodak_suite_master_stats.json
/docs/
    methodology.md

Root: The parent PCA characterization paper and repository README. /baseline/: 24 two-page PDF reference data sheets and 25 JSON files (24 individual + 1 master). /docs/: Computation pipeline documentation for full reproducibility.


References

[1] Eastman Kodak Company. Kodak Publication No. PCD-042, 1992.

[2] Baetzel, J. (2026). “Statistical Characterization of Inter-Channel Redundancy Structure in the Kodak Lossless True Color Image Suite.”

[3] Watanabe, S. “Karhunen-Loeve Expansion and Factor Analysis,” pp. 635-660, 1965.

[4] Giorgianni, E.J. and Madden, T.E. Digital Color Management. Addison-Wesley, 1998.


Citation

Baetzel, J. (2026). Kodak PCD0992 Statistical Profile Series:
Per-Image PCA and Inter-Channel Redundancy Analysis of the
Kodak Lossless True Color Image Suite.

License

Statistical analysis and profile sheets by Jasmine Baetzel (2026). Benchmark images from the Kodak Lossless True Color Image Suite (PCD0992), released by Eastman Kodak Company for unrestricted usage.