惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

D
DataBreaches.Net
T
Threatpost
N
News and Events Feed by Topic
PCI Perspectives
PCI Perspectives
V2EX - 技术
V2EX - 技术
D
Docker
G
Google Developers Blog
Microsoft Security Blog
Microsoft Security Blog
N
News and Events Feed by Topic
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Google Online Security Blog
Google Online Security Blog
The GitHub Blog
The GitHub Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
Y
Y Combinator Blog
M
MIT News - Artificial intelligence
Blog — PlanetScale
Blog — PlanetScale
博客园 - 司徒正美
T
Troy Hunt's Blog
Webroot Blog
Webroot Blog
Security Archives - TechRepublic
Security Archives - TechRepublic
量子位
Apple Machine Learning Research
Apple Machine Learning Research
H
Help Net Security
F
Full Disclosure
B
Blog
O
OpenAI News
H
Hackread – Cybersecurity News, Data Breaches, AI and More
博客园_首页
Google DeepMind News
Google DeepMind News
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
Engineering at Meta
Engineering at Meta
大猫的无限游戏
大猫的无限游戏
Forbes - Security
Forbes - Security
Know Your Adversary
Know Your Adversary
B
Blog RSS Feed
MongoDB | Blog
MongoDB | Blog
Scott Helme
Scott Helme
T
The Exploit Database - CXSecurity.com
博客园 - 聂微东
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
The Last Watchdog
The Last Watchdog
Recorded Future
Recorded Future
IT之家
IT之家
Project Zero
Project Zero
Stack Overflow Blog
Stack Overflow Blog
小众软件
小众软件
Attack and Defense Labs
Attack and Defense Labs
L
Lohrmann on Cybersecurity
SecWiki News
SecWiki News
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python) The Hidden Cost of AI Systems Nobody Talks About. undefined vs undeclared, and how typeof behaves Switching from file-based jobs to NATS/Kafka in Rust without changing code io_uring Adventures: Rust Servers That Love Syscalls Why Agentic AI is Killing the Traditional Database The POUR principles of web accessibility for developers and designers Quantum Neural Network 3D — A Deep Dive into Interactive WebGL Visualization How To Install Caveman In Codex On macOS And Windows Automation Pipeline Reliability: Why Your Workflow Breaks When Nobody Is Watching I Built an 'Open World' AI Coding Agent — It Works From ANY Folder From Freelancing to Product: A Tech Service Company's SaaS Transformation China's AI Giants: Adding Tencent Hunyuan & ByteDance Doubao to AI University (74 Providers) On the Vibe Coders and Their Lies clerk: Auto-Summarize Your Claude Code Sessions AI Weekly — 2026/04/10–04/17 | The Model Lockdown Is Here, but the Toolchain Is the Real Battleground AI 週報 — 2026/04/10–2026/04/17 模型封鎖潮來了,但工具鏈才是真戰場 Maybe this is how Open-Source apps are born... 🚀 Fine-Tune LLMs with LoRA and QLoRA: 2026 Guide tRPC v11 + Next.js App Router: End-to-End Type Safety Without the Boilerplate ShadCN UI in 2026: Why I Stopped Installing Component Libraries and Started Owning My Components SaaS Billing in React Server Components: Stripe + Supabase Without a Single `useEffect` Join our DEV Weekend Challenge — $1,000 in Prizes Across TEN winners! Submissions Due April 20 at 6:59 AM UTC. Implementing FSRS Spaced Repetition in Flutter + Supabase — Adding Memory Science to an AI Learning App "I Texted My Localhost From the Train — Claude Code Fixed the Bug Before I Got Home" I Built a Sales Prep AI and It Went Deeper Than Expected Design to Code #2: One JSON, Eleven Outputs Solving the 100M-Row Problem: A Summary Table Pattern for High-Volume Push Notification Logs Flutter Web With Wasm: What Actually Changes For Developers I Built 50 Royalty-Free Soundtracks for My Side Project in a Weekend Using AI Music Generation The Vibe Coding Security Checklist: 7 Things to Check Before You Ship Stop Letting Googlebot Guess Fix Your React App's SEO Right Desconstruindo o Streaming do LinkedIn: Como Criar um Engine de Extração de Vídeo de Alta Performance com HLS e FFmpeg (EDA Part-1) EDA (Exploratory Data Analysis) Explained With Real Life — Why Looking at Your Data Is the Most Important Step in Machine Learning Brand Relationship Management at Scale: Our 4-Touch Outreach System for 200+ Brands Why String.fromEnvironment() Might Return an Empty String in Dart JGuardrails 1.0.0 — Hardening Java LLM Apps Against Jailbreaks, Toxicity, and Prompt Injection Plan and Schedule a Full Week of Threads Content From One Claude Conversation Coding Cat Oran Ep3, Five Tables Changed Everything Updated: BFF Pattern I'm done watching freelancers get buried by 200 proposals. So I'm building the alternative. This is my first post BFS Algorithm in Java Step by Step Tutorial with Examples Tracking LLM Pricing Monthly: An Open Dataset for 22 AI Models How We Measure Content ROI on a Comparison Site: Revenue Attribution Without Perfect Data Introducing Nova AI Ops: The AI-Native Operating System for SRE Teams I built a free desktop video downloader for Windows — Grabbit How Talkie OCR Helps Vision-Impaired & Dyslexic Users Read the World Around Them VRCFaceTracking安装和iPhone面捕配置教程,有bug Even CrowdStrike Can't See Your Agents The Automation Gold Rush: What n8n Workflows and Claude Are Opening Up for Developers Right Now
I Built Skills That Let AI Agents Query 14 European Government Registries
Peter · 2026-04-29 · via DEV Community

TL;DR

  • I built 14 Apify actors that scrape official government registries across Poland, Spain, Austria, and France
  • To make them accessible from AI coding agents, I packaged them as 6 open-source skills in the getregdata repo
  • One install command gives you access across 45+ AI agents - Claude Code, GitHub Copilot, Cline, Codex, Amp, and more
  • The skills provide registry-specific analysis frameworks and let you query live data through the Apify API
  • Install: skills add Nolpak14/getregdata -g -y

Why Government Registry Data Is Hard

If you have ever tried to programmatically access European government registries, you know the pain. Each country has its own set of portals, each with its own quirks:

  • Poland's KRS (National Court Register) deliberately anonymizes board member names in its API - returning L****** instead of full names - while the same data is available unredacted in PDF extracts from the same portal
  • Poland's CRBR (Beneficial Owners Register) has no API at all - just a CAPTCHA-protected web form that accepts one company at a time
  • Poland's EKW (Land Registry) blocks all datacenter IPs - you need residential Polish proxies to access property ownership data
  • Spain's BORME (Corporate Gazette) publishes 500+ corporate acts daily as unstructured gazette text with no machine-readable format
  • Austria's Ediktsdatei (Insolvency Publications) requires an IWG license for the official API - but the public web portal has no such restriction
  • France's Societe.com aggregates data from INSEE, INPI, and BODACC behind aggressive anti-bot measures

Each registry is a separate engineering challenge. Different authentication, different anti-scraping measures, different data formats, different legal frameworks.

The Actor Suite: 14 Registries, 4 Countries

Over the past months I built Apify actors for all of these. Each actor handles the specific technical challenges of its registry and returns clean, structured JSON.

Poland (9 registries)

Registry What You Get Records
KNF - Financial Supervision Authority Payment institutions, e-money issuers, lending companies 75,000+
MSiG - Court Gazette (Monitor Sadowy) Bankruptcy declarations, restructuring, liquidation notices 2001-present
KRS - National Court Register Full non-anonymized board member names from PDF extracts 800,000+ companies
KRZ - National Debtor Registry Bankruptcy, restructuring, enforcement proceedings 9 search modes
eKRS - Financial Statements Balance sheets, P&L, assets, equity, revenue, net profit Official filings
EKW - Land Registry Property ownership, mortgages, easements, restrictions 25 million entries
UOKiK - Consumer Protection Court-ruled prohibited contract clauses 7,500+ clauses
CRBR - Beneficial Owners Ultimate beneficial owners (UBO) for AML compliance All KRS entities
BDO - Waste Registry Waste management entity registrations and permits 674,000+ entities

Spain (2 registries)

Registry What You Get
BORME - Corporate Gazette Daily incorporations, officer appointments, capital changes, dissolutions
Registro Mercantil - Company Directory NIF, officers, CNAE codes, legal form, IRUS, EUID

Austria (2 registries)

Registry What You Get
Ediktsdatei - Insolvency Publications Bankruptcy, restructuring, debt regulation proceedings - no IWG license needed
WKO - Chamber of Commerce Directory 620,000+ businesses with phone, email, website, trade licenses

France (1 registry)

Registry What You Get
Societe.com - Company Data SIREN, directors with roles, financials, shareholders, subsidiaries, brands

All actors are pay-per-result with no subscriptions. Pricing ranges from $0.003 to $0.01 per result depending on the registry. A free Apify account includes $5 credits - enough for 100-1,600 lookups.

From Actors to AI Agent Skills

The actors work great when you know exactly which registry to query. But for many users, the real question is: "I need to check this Polish company - where do I even start?"

That is where skills come in. Skills are modular packages that give AI coding agents specialized knowledge. When you install a skill, your agent learns how to handle domain-specific tasks - in this case, querying European government registries.

I packaged the 14 actors into 6 skills organized by workflow:

Skill What It Does Registries Used
regdata Router - identifies which registry you need All 14
regdata-kyc-aml Beneficial ownership, financial licenses, board members CRBR, KNF, KRS, Societe.com, WKO, Spain Dir
regdata-credit-risk Insolvency proceedings, court gazette, financial statements KRZ, MSiG, eKRS, Ediktsdatei, BORME
regdata-property Polish land registry - ownership, mortgages, encumbrances EKW, KRS, CRBR
regdata-compliance Prohibited contract clauses, waste management registrations UOKiK, BDO
regdata-lead-gen Company directors, business contacts, new incorporations KRS, WKO, Spain Dir, Societe.com, BORME

Each skill includes:

  • Analysis frameworks specific to the registries it covers (e.g., how to interpret Polish land registry sections, what Austrian insolvency proceeding types mean)
  • Data extraction instructions for both MCP mode (direct tool calls) and API mode (curl/SDK)
  • Output interpretation guides so you know what the returned data actually means

Installation

One command installs all 6 skills globally across every supported agent:

skills add Nolpak14/getregdata -g -y

Enter fullscreen mode Exit fullscreen mode

This works with 45+ AI agents including Claude Code, GitHub Copilot, Cline, Codex, Amp, Antigravity, and more. The skills are installed to ~/.agents/skills/ and symlinked to agent-specific locations.

How It Works in Practice

Example 1: Check beneficial owners of a Polish company

You open Claude Code (or any supported agent) and type:

Check who owns the Polish company with NIP 5213103635 in CRBR

Enter fullscreen mode Exit fullscreen mode

The regdata-kyc-aml skill activates. It knows that CRBR is Poland's beneficial owners registry and that NIP is the correct identifier. It guides the extraction:

from apify_client import ApifyClient

client = ApifyClient("YOUR_APIFY_TOKEN")

run = client.actor("regdata/crbr-beneficial-owners-scraper").call(
    run_input={"nip": "5213103635"}
)

for item in client.dataset(run["defaultDatasetId"]).list_items().items:
    print(f"Company: {item['name']}")
    for owner in item.get("beneficialOwners", []):
        print(f"  UBO: {owner['firstName']} {owner['lastName']}")
        print(f"  Ownership: {owner.get('sharePercentage', 'N/A')}%")

Enter fullscreen mode Exit fullscreen mode

If you have the Apify MCP server configured, the agent can run the actor directly through tool calls without writing any code.

Example 2: Monitor Spanish corporate gazette for new incorporations

Search BORME for new company incorporations in Barcelona from last week

Enter fullscreen mode Exit fullscreen mode

The regdata-credit-risk skill activates (BORME corporate acts). It builds the right query:

run = client.actor("regdata/borme-corporate-acts-scraper").call(
    run_input={
        "dateFrom": "2026-04-21",
        "dateTo": "2026-04-28",
        "provinces": ["Barcelona"]
    }
)

Enter fullscreen mode Exit fullscreen mode

Example 3: Extract Polish land registry data

Pull EKW data for property WR1K/00094598/3

Enter fullscreen mode Exit fullscreen mode

The regdata-property skill activates. It knows the KW number format, that EKW requires residential Polish proxies, and how to interpret the four sections (Dzial I-IV):

run = client.actor("regdata/ekw-ksiegi-wieczyste-scraper").call(
    run_input={
        "kwNumbers": ["WR1K/00094598/3"],
        "viewType": "aktualna",
        "sections": ["all"]
    }
)

Enter fullscreen mode Exit fullscreen mode

The skill then explains what each section means - Dzial II for ownership, Dzial III for restrictions and easements, Dzial IV for mortgages - so you can actually interpret the raw data.

Example 4: Get non-anonymized KRS board members

Extract board members for Polish KRS number 0000019193

Enter fullscreen mode Exit fullscreen mode

The regdata-lead-gen skill knows that the official KRS API anonymizes names, and that the actor works around this by extracting full names from PDF court extracts:

run = client.actor("regdata/krs-fullnames-scraper").call(
    run_input={
        "krsNumbers": ["0000019193"],
        "extractType": "aktualny"
    }
)

Enter fullscreen mode Exit fullscreen mode

You get full first and last names of board members, supervisory board, and shareholders - not the L****** that the API returns.

What Makes This Different from a Generic Scraping Skill

The value of domain-specific skills is that they carry registry knowledge that a generic web scraping tool does not have:

  • The skill knows that EKW requires residential Polish proxies (datacenter IPs are blocked)
  • It knows that KRS anonymizes names in the API but not in PDFs
  • It knows that BORME Section A publishes corporate acts while Section B publishes financial data
  • It knows that Austria's Ediktsdatei has Konkursverfahren (bankruptcy) and Sanierungsverfahren (restructuring) and what the difference means
  • It knows that a Polish KW number follows the format CODE/NUMBER/DIGIT (e.g., WR1K/00094598/3)
  • It knows that CRBR defines beneficial ownership as >25% shares or voting rights

This registry-specific knowledge means your AI agent can handle queries correctly without you needing to read documentation for each portal.

Authentication Setup

The skills provide analysis frameworks for free, but extracting live data requires an Apify API token:

export APIFY_TOKEN=apify_api_xxxxx

Enter fullscreen mode Exit fullscreen mode

Get your token by creating a free Apify account - includes $5 credits, enough for hundreds of registry lookups.

Pricing Quick Reference

All actors use pay-per-result pricing. No subscriptions, no minimum commitments.

Registry Cost per Result
KNF, UOKiK, WKO, BORME $0.003
MSiG, BDO $0.004
Ediktsdatei, Spain Dir, Societe.com $0.005
KRZ $0.006
KRS Board, eKRS, CRBR $0.008
EKW $0.01

Typical cost for a full company check (CRBR + KNF + KRS Board): ~$0.019.

Technical Details

The skills repo follows the standard skills ecosystem structure:

getregdata/
  skills/
    regdata/SKILL.md              # Router
    regdata-kyc-aml/SKILL.md      # + references/
    regdata-credit-risk/SKILL.md  # + references/
    regdata-property/SKILL.md     # + references/
    regdata-compliance/SKILL.md   # + references/
    regdata-lead-gen/SKILL.md     # + references/

Enter fullscreen mode Exit fullscreen mode

Each SKILL.md uses YAML frontmatter for trigger matching:

---
name: regdata-kyc-aml
description: "Extract beneficial ownership data from Poland's CRBR registry,
  financial license status from KNF, non-anonymized board members from KRS..."
metadata:
  version: 1.0.0
  author: regdata
  triggers:
    - "CRBR beneficial owner lookup"
    - "check KNF registry"
    - "KRS board members extract"
    - "Polish beneficial owners"
---

Enter fullscreen mode Exit fullscreen mode

Triggers are registry-specific, not generic domain terms. The skill activates when you mention a specific European registry, not when you ask about "KYC" or "compliance" in general.

The references/ subdirectories contain detailed interpretation guides (land registry section breakdowns, insolvency proceeding types across jurisdictions, UOKiK clause categories) that the agent loads on demand.

Source Code and Contributing

The skills are MIT licensed and open source:

GitHub: github.com/Nolpak14/getregdata

The underlying Apify actors are available on the Apify Store under the regdata publisher profile.

If you work with government registries in other EU countries and want to contribute skills for those, PRs are welcome.

What Is Next

  • More registries: Germany (Handelsregister, Insolvenzbekanntmachungen), Italy (Registro Imprese), Netherlands (KVK)
  • Deeper cross-registry workflows: automated multi-country due diligence pipelines
  • Scheduled monitoring: get notified when a company's registry data changes

Install: skills add Nolpak14/getregdata -g -y

Actors: apify.com/regdata

GitHub: github.com/Nolpak14/getregdata


This article is part of the European Registry Data series covering programmatic access to government registries across the EU.