惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
博客园 - 聂微东
IT之家
IT之家
The Cloudflare Blog
L
LangChain Blog
Last Week in AI
Last Week in AI
T
Tailwind CSS Blog
P
Proofpoint News Feed
aimingoo的专栏
aimingoo的专栏
G
Google Developers Blog
T
The Blog of Author Tim Ferriss
博客园 - 叶小钗
I
Intezer
Martin Fowler
Martin Fowler
MongoDB | Blog
MongoDB | Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
T
ThreatConnect
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
小众软件
小众软件
T
The Exploit Database - CXSecurity.com
H
Help Net Security
T
Tenable Blog
WordPress大学
WordPress大学
F
Future of Privacy Forum
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
NISL@THU
NISL@THU
The Register - Security
The Register - Security
A
About on SuperTechFans
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
MyScale Blog
MyScale Blog
Malwarebytes
Malwarebytes
博客园_首页
T
Threatpost
C
CERT Recently Published Vulnerability Notes
Know Your Adversary
Know Your Adversary
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
C
CXSECURITY Database RSS Feed - CXSecurity.com
Blog — PlanetScale
Blog — PlanetScale
Recorded Future
Recorded Future
大猫的无限游戏
大猫的无限游戏
K
Kaspersky official blog
月光博客
月光博客
Jina AI
Jina AI
S
Securelist
Hugging Face - Blog
Hugging Face - Blog
G
GRAHAM CLULEY
腾讯CDC
S
Secure Thoughts
V
V2EX - 技术

DEV Community

Your AI agent reports 80% task completion. It fabricated it. Pourquoi les overlays d'accessibilité ne tiennent pas leurs promesses (et ce que la FTC vient d'acter) AI May Break Product-Market Fit in Enterprise Software I’m Building Around the Gap Between AI Output and Repo Truth How to Build a Stripe Customer Portal in Next.js SaaS On-Demand Pricing Feels Safe - Until You See the Bill After the Last Song Sudoers Configuration in Linux Terraform + Terragrunt + Ansible: A Hands-On Learning Journey Switching Users in Linux (su, sudo) AI 智能体的鲁莽速度 Quick Win Card #01 — Ton backlog.md t'a menti (la cure en 30 secondes) Quick Win Card #01 — Your backlog.md lied to you (a 30-second cure) How to Manage an IT Team: Structure, Scaling, and Daily Workflows That Work Speccing Is the New Coding CAC 250만 원을 뚫기 위해 퍼널 세 곳을 뜯어고친 3개월 Creating My First Token on Solana Devnet as a Web2 Developer Five Salesforce Reports Every Nonprofit Leadership Team Should Have Beyond the West: What Eastern AI Models Mean for Enterprises, Developers, and Digital Sovereignty Class and Pseudo Class Git & GitLab Basics 고객은 우리를 사기꾼으로 봤다: 아무도 믿지 않는 신사업을 단 둘이서 검증한 3개월 Cron Not Working on Mac? How to Fix the macOS Sleep Trap with launchd Cache Everything: Advanced Caching Strategies in Vue 3 & Nuxt 4 Deploy a Node.js App to STACKIT Kubernetes Engine With Managed Redis & PostgreSQL Slopsquatting & Remote Prompts: Why I Built a 38,000 Ticker Engine with Zero NPM Dependencies 05/20: TCP/IP vs OSI Model: The Ultimate Comparison My New Adventures in IT # Mitigating Market Inefficiency in eSports: A Stochastic Approach to EA Sports FC25 Modeling Don't let a billion RAG docs drown your 25-result pipeline Experienced devs are slower with AI tools. Nobody wants to admit it. I built an MCP-native OSINT framework that lets AI agents investigate from your terminal AWS Nitro Enclaves vs Intel TDX: Why Attestation Root Matters for Regulated Workloads Vibe Coding: Revolution or Risk in Software Development? - SmarterArticles S1E6 JSON Schema Explained: Validate Your API Data Before It Breaks Production Harness Tells Your Agent What to Do. GUI Agents Let It Actually Do It. Is AI actually replacing developers? Customizing Docker Images: Write Your First Dockerfile (2026) €40 n8n vs 28% weekly Anthropic quota. Which /goal layer should you actually run? Reviving glyph-v8: From a Forgotten Prototype to STRIDE - a Field-Aware Integer Coder 04/20: Data Encapsulation: How a Message Becomes Bits on the Wire Hướng Dẫn Thiết Lập Reasoning Proxy DeepSeek V4-Pro với Cursor (2026) Sofi Log #012: Agentic GDP — Solana Pay.sh & x402 Protocol Spec Input Types, Attributes, Self-Closing Tags, Hover Effect Absolute vs Relative Paths File Types (Regular, Directory, Link, Device, Socket, Pipe) From Arduino IDE to AVR GCC | AVR Bare Metal #1 Using Bitcoin as collateral without wrapping it: the design of a BTC collateral vault Unreal Engine 5 Skill System Architecture using GAS and GameplayTags 5 Things I Wish I Knew Before Building with Hermes Agent Thoughts on Codingame 2026 Spring challenge OUT WITH THE OLD IN WITH THE NEW Why are simple 1099 tax calculators online so horribly bloated? So I built my own "Why You're Not Getting Callbacks (It's Not Your Skills)" # How I Built a Retail Demand Forecasting App with Python and Streamlit Why We Deliberately Crush Lithium Batteries (UN38.3 Crush Testing Explained) Command History & Completion The Three-Body Problem: AI Code, Supply Chain Attacks, and the Talent Exodus 로컬 LLM 셋업 가이드 (v27) Building Better .NET Worker Services with Cursor Rules Generate Professional PDF Invoices via REST API — JSON In, PDF Out Redis: Big Keys Destroem o Desempenho Compartilhado Agentic AI for Cybersecurity: Autonomous Threat Detection and Response How to Automate Android Without Appium Cron vs systemd daemon: which one for Node.js? Designing XSLT transforms with parameters and multiple inputs I Downloaded Gemma4:e2b On My Macbook in 2 steps Building an Autonomous SRE Agent: From Raw Telemetry to Safe, AI-Driven Remediation The EU AI Act in 2026: Reading the Law After the Omnibus I had zero coding knowledge. Here is "RetroTube", a 2010 YouTube sandbox prototype I built using AI! How to Validate Environment Variables in TypeScript (and Why You Should) I Built a CLI Tool That Writes Better Git Commits Than I Do Transfer Fees, Metadata, and Soulbound Tokens: My First Real Token Experiments on Solana Stop Using Fetch() in React: A Better Way To Call Your Backend Creando un Tetris con JavaScript VI: Complicando el juego. DeepSeek's API Price Cut Changed My Claude Code and ChatGPT Math [Boost] Perl 🐪 Weekly #774 - Perl is too HOT How to Track AI Usage Without Losing Revenue (Complete Guide) 77 Rules Later: What Graduating Our First Stack Actually Looked Like RAG 시스템 실전 구축 (v26) When Premature Scaling Leads to Operator Burnout Multi-Repo Microservice Changes Are a Coordination Problem. I Solved It With AI Agent Teams. The Next Frontier: How Multi-Agent Systems are Redefining Productivity The Kimwolf Bust Just Outed Android Webcams as Botnet Fodder — Here's the Question Every Repurposed-Phone Camera Setup Has to Answer I'm an autonomous AI agent. I shipped 18 fixes to myself in one session. Building a Secure Future with Zero Trust Security Architecture Asynchronous Functions in Dart How I migrated magic-link login from Resend to AWS SES + Lambda five days before launch Edge Computing He creado una empresa ficticia IT/OT para poder encontrar sus vulnerabilidades y reforzar su seguridad en sus activos críticos Why I Built @editora/react I built a tiny UGC script generator because hooks are the hardest part The Phone Is Becoming the New Terminal Why Most AI Music Tools Feel Wrong to Developers Goroutines vs. Promises: Why Go and JavaScript Look at Concurrency Completely Differently How I Use Antigravity 2.0 to Navigate Open-Source Codebases and Make Better Technical Decisions Understanding Basic HTML & CSS Concepts for Beginners Go Error Handling: Annoying or Awesome? Your To-Do List Doesn't Know You — So I Gave Mine Three Brains
Building an Internal Developer Portal with Backstage A Production Deployment Guide
varun varde · 2026-05-25 · via DEV Community

Internal Developer Portals became inevitable the moment engineering organisations crossed a certain complexity threshold.

At 20 engineers, tribal knowledge still works.
At 80 engineers, documentation begins fracturing.
At 200 engineers, platform entropy becomes existential.

Teams stop knowing:

  • Which services exist
  • Who owns them
  • How deployments work
  • Where documentation lives
  • Which Kubernetes clusters matter
  • Which CI/CD templates are approved
  • Which APIs are deprecated
  • Which observability dashboards to trust

The result is operational drag masquerading as engineering complexity.

This is precisely why Backstage became the dominant Internal Developer Portal (IDP) platform. It unified service cataloguing, documentation, Golden Path workflows, Kubernetes visibility, and developer self-service into a single extensible platform.

But most Backstage tutorials stop at.

npx @backstage/create-app

Enter fullscreen mode Exit fullscreen mode

Production deployments are where the real engineering begins.

This guide covers the practical architecture, operational tradeoffs, adoption strategies, and deployment patterns required to run Backstage successfully in medium-to-large engineering organisations.

Built from production implementations across organisations ranging from 100 to 800 engineers.

Why Backstage Won the IDP Category and What It Doesn't Do

Backstage succeeded because it solved the fragmentation problem.

Before Internal Developer Portals, engineering ecosystems looked like this

CI/CD → Jenkins
Docs → Confluence
Kubernetes → kubectl + dashboards
Ownership → spreadsheets
APIs → wiki pages
Templates → tribal knowledge
Monitoring → scattered Grafana links

Enter fullscreen mode Exit fullscreen mode

Developers spent more time navigating tooling than shipping software.

Backstage unified discovery.

What Backstage Does Exceptionally Well

Backstage excels at:

  • Software cataloguing
  • Golden Path standardisation
  • Developer self-service
  • Documentation centralisation
  • Platform discoverability
  • Plugin extensibility

It becomes the operational interface layer for your platform.

What Backstage Does NOT Do

This distinction matters enormously.

Backstage is NOT:

  • A CI/CD engine
  • A Kubernetes platform
  • A monitoring system
  • A secrets manager
  • An infrastructure orchestrator

It orchestrates developer experience across those systems.

Think of it as the engineering control plane UI.

Architecture Decisions: Backstage Deployment Patterns for Production

Most failed Backstage deployments fail architecturally before adoption problems even begin.

Deployment Model 1 Single Container (Good for POCs)

Simple deployment

apiVersion: apps/v1
kind: Deployment
metadata:
  name: backstage
spec:
  replicas: 1
  template:
    spec:
      containers:
      - name: backstage
        image: backstage:latest

Enter fullscreen mode Exit fullscreen mode

Suitable for:

  • Small engineering organisations
  • POCs
  • Internal experimentation

Not suitable for production scale.

Deployment Model 2 Split Frontend and Backend

Recommended production architecture:

Frontend (React UI)
↓
Backend API
↓
Plugins + Database + External Integrations

Enter fullscreen mode Exit fullscreen mode

Benefits:

  • Independent scaling
  • Better caching
  • Reduced blast radius
  • Improved deployment flexibility

Recommended Kubernetes Architecture

apiVersion: apps/v1
kind: Deployment
metadata:
  name: backstage-backend
spec:
  replicas: 3
  template:
    spec:
      containers:
      - name: backend
        image: your-org/backstage-backend:v1.0.0
        env:
        - name: POSTGRES_HOST
          value: postgres.platform.svc.cluster.local
        - name: AUTH_GITHUB_CLIENT_ID
          valueFrom:
            secretKeyRef:
              name: backstage-secrets
              key: github-client-id

Enter fullscreen mode Exit fullscreen mode

Database Choice: PostgreSQL Only

Avoid SQLite immediately.

Production Backstage requires:

  • Concurrent plugin access
  • Reliable catalog indexing
  • Transaction consistency
  • Search scalability

Recommended:

PostgreSQL

Ingress and Authentication

Recommended auth providers:

  • GitHub OAuth
  • Okta
  • Google Workspace
  • Azure AD

Avoid anonymous access.

Example ingress

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  name: backstage
spec:
  ingressClassName: nginx
  rules:
  - host: backstage.internal.company.com
    http:
      paths:
      - path: /
        pathType: Prefix
        backend:
          service:
            name: backstage
            port:
              number: 7007

Enter fullscreen mode Exit fullscreen mode

The Plugin Selection Framework: Core vs Custom vs Community

Backstage plugin sprawl becomes dangerous quickly.

One client installed 47 plugins in six months.

Nobody maintained them.

Half broke after upgrades.

The Three Plugin Categories

1. Core Plugins

These are essential.

Recommended:

  • Catalog
  • TechDocs
  • Scaffolder
  • Kubernetes
  • Search

These create the foundation.

2. Community Plugins

Useful but operationally risky.

Examples:

  • Jira
  • ArgoCD
  • PagerDuty
  • SonarQube

Rule:

Only install plugins with active maintainers.

Enter fullscreen mode Exit fullscreen mode

3. Custom Plugins

Necessary eventually.

Examples:

  • Internal deployment workflows
  • Compliance dashboards
  • Internal APIs
  • Platform-specific automation

Plugin Evaluation Checklist

Before installing any plugin

Question Why It Matters
Is it actively maintained? Prevent abandonment
Does it reduce cognitive load? Avoid UI clutter
Does it duplicate existing workflows? Prevent fragmentation
Is ownership assigned? Avoid orphaned integrations

Software Catalogue: Getting 100% Entity Coverage Without Mandate

The catalog becomes useless if incomplete.

But forcing teams to manually register services never scales.

The Metadata Problem

Most teams will not voluntarily maintain

apiVersion: backstage.io/v1alpha1
kind: Component
metadata:
  name: payments-api

Enter fullscreen mode Exit fullscreen mode

unless value is immediate.

The Successful Pattern

Auto-discovery first. Manual enrichment second.

GitHub Discovery Integration

Example

catalog:
  providers:
    github:
      yourOrg:
        organization: your-org
        catalogPath: /catalog-info.yaml

Enter fullscreen mode Exit fullscreen mode

This enables repository scanning automatically.

Incentivise Coverage Through Utility

Engineers maintain metadata when it unlocks:

  • Deployment automation
  • Kubernetes visibility
  • Ownership clarity
  • Documentation indexing
  • Golden Path templates

Not because leadership mandates compliance.

TechDocs: Making Documentation a First-Class Engineering Practice

Documentation systems fail because writing docs feels disconnected from engineering workflows.

TechDocs fixes this by treating documentation like code.

Recommended TechDocs Architecture

Markdown in Git
↓
CI/CD build
↓
Static site generation
↓
Indexed inside Backstage

Enter fullscreen mode Exit fullscreen mode

Example TechDocs Configuration

techdocs:
  builder: 'external'
  publisher:
    type: 'awsS3'
    awsS3:
      bucketName: backstage-techdocs

Enter fullscreen mode Exit fullscreen mode

Why Docs-as-Code Works

Advantages:

  • PR reviews apply to documentation
  • Versioning becomes automatic
  • Ownership becomes explicit
  • Drift decreases dramatically

The Documentation Coverage Problem

Most organisations have

Critical systems
+
Zero operational documentation

Enter fullscreen mode Exit fullscreen mode

Backstage exposes these gaps visibly.

Which is operationally valuable.

Scaffolder Templates: Building Your Golden Path Self-Service Workflows

This is where Backstage becomes transformational.

The Scaffolder creates operational consistency at scale.

Golden Path Philosophy

Developers should not repeatedly solve:

  • CI/CD setup
  • Observability wiring
  • Terraform structure
  • Security defaults
  • Kubernetes manifests

The platform should solve these once.

Example Production-Ready Template

apiVersion: scaffolder.backstage.io/v1beta3
kind: Template
metadata:
  name: golden-path-service
spec:
  owner: platform-team
  type: service

Enter fullscreen mode Exit fullscreen mode

What the Best Templates Include

Every generated service should automatically include:

  • CI/CD pipeline
  • Terraform module
  • Kubernetes manifests
  • Observability integration
  • Security scanning
  • Logging standards
  • SLO defaults

The Real Goal

Reduce

Decision fatigue

Enter fullscreen mode Exit fullscreen mode

Not flexibility.

Kubernetes Plugin: Real-Time Service Health in the Developer Portal

The Kubernetes plugin dramatically increases operational discoverability.

What Developers Actually Need

Not raw Kubernetes complexity.

They need:

  • Deployment status
  • Restart visibility
  • Pod health
  • Namespace ownership
  • Service mapping

Kubernetes Plugin Configuration

kubernetes:
  serviceLocatorMethod:
    type: 'multiTenant'

Enter fullscreen mode Exit fullscreen mode

Recommended Features

Expose:

  • Pod health
  • Replica status
  • Rollout history
  • Resource consumption
  • Deployment age

Avoid exposing excessive cluster internals.

The Biggest UX Mistake

Turning Backstage into a thin wrapper around kubectl.

Developers want abstraction.

Not Kubernetes archaeology.

Search: Making Platform Knowledge Discoverable

Search quality determines portal usefulness more than most teams realise.

Poor search destroys trust quickly.

What Should Be Searchable

Search should index:

  • Services
  • Documentation
  • APIs
  • Runbooks
  • Ownership
  • Terraform modules
  • CI/CD templates

Elasticsearch Integration

Recommended at scale:

Elasticsearch

Example

search:
  engine:
    type: elasticsearch

Enter fullscreen mode Exit fullscreen mode

Search Quality Rules

Good search requires:

  • Consistent metadata
  • Strong ownership tagging
  • Naming conventions
  • Documentation hygiene

Search quality reflects platform maturity.

Developer Adoption: The 90-Day Rollout Plan That Works

Most Backstage failures are adoption failures.

Not technical failures.

Phase 1 — Seed Critical Value (Days 1–30)

Launch with:

  • Service catalog
  • Ownership visibility
  • Kubernetes status
  • TechDocs

Avoid feature overload.

Phase 2 — Introduce Self-Service (Days 30–60)

Add:

  • Scaffolder templates
  • Deployment workflows
  • Golden Path automation

This creates habitual usage.

Phase 3 — Expand Platform Integrations (Days 60–90)

Integrate:

  • Incident systems
  • Monitoring
  • Cost visibility
  • Security tooling

Now Backstage becomes operationally indispensable.

The Biggest Adoption Mistake

Treating Backstage as

A documentation portal

Enter fullscreen mode Exit fullscreen mode

instead of

A workflow accelerator

Enter fullscreen mode Exit fullscreen mode

Measuring Backstage Success: The Metrics That Matter

Avoid vanity metrics like

Daily active users

Enter fullscreen mode Exit fullscreen mode

Measure operational outcomes instead.

Key Backstage Metrics

Time to First Production Deployment

Target

< 1 day

Enter fullscreen mode Exit fullscreen mode

Self-Service Rate

Measure

Infrastructure requests completed
without platform tickets

Enter fullscreen mode Exit fullscreen mode

Target

> 80%

Enter fullscreen mode Exit fullscreen mode

Golden Path Adoption

Target

> 90% of new services

Enter fullscreen mode Exit fullscreen mode

Documentation Coverage

Measure

Catalog entities with TechDocs

Enter fullscreen mode Exit fullscreen mode

Platform NPS

Critical indicator of developer trust.

Operating Backstage as a Product

This is the single most important principle.

Backstage is not an internal tool.

It is an internal product.

Product Thinking Changes Everything

Platform teams must manage:

  • Roadmaps
  • User feedback
  • Feature prioritisation
  • UX quality
  • Adoption metrics
  • Reliability

Exactly like customer-facing products.

Establish Platform Ownership

Recommended structure

Responsibility Owner
Infrastructure Platform engineering
Plugin lifecycle Plugin owners
Documentation standards Developer enablement
UX and adoption Platform product owner

Create a Feedback Loop

Run:

  • Quarterly DX surveys
  • Office hours
  • Team interviews
  • Usage analytics reviews

Without feedback loops, Backstage decays rapidly.

Upgrade Strategy

Backstage evolves quickly.

Recommended:

  • Monthly dependency reviews
  • Quarterly platform upgrades
  • Dedicated staging environment
  • Plugin compatibility testing

Never allow upgrades to drift indefinitely.

Common Failure Modes

*Failure Mode 1 — Trying to Solve Everything
*

Start small.

Expand gradually.

Failure Mode 2 — Weak Ownership

No ownership guarantees entropy.

Failure Mode 3 — No Golden Path

A portal without workflows becomes passive documentation.

Failure Mode 4 — Ignoring Developer Experience

Engineers abandon tools that increase friction.

Immediately.

The most successful Backstage deployments do not succeed because of plugin count or UI polish.

They succeed because they reduce cognitive load.

They make:

  • Ownership obvious
  • Documentation discoverable
  • Infrastructure self-service
  • Operational workflows consistent

Most importantly, they create a unified developer experience layer across increasingly fragmented engineering ecosystems.

That is why Backstage became the Internal Developer Portal standard.

Not because it centralised tools.

Because it simplified engineering flow.