惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

DEV Community

Fixing the session timeouts Beyond Autonomous AI: Understanding Self-Healing Agents in Enterprise AI Systems MCP Is the AI Platform Camera2 API: Handling Orientation, Focus, and Exposure in Background — How to Keep Your Android Camera Running With the Screen Off I built a free Bitly/TinyURL alternative and self-hosted it on a $6/mo VPS — here's the full stack Design to Code #7: How CVA Scaffolding Turned Into Dead Code Stop rebuilding memory and orchestration for every AI agent you build 6 users in one day with zero marketing budget — what actually worked How a photo-blind dating engine actually ranks people (the TypeScript) AI Is Moving From Your Pocket to Your Brain — The 6-Year Timeline How Does Duolingo Monetize? I Decompiled the Android App (v6.79.5) Next.js Dynamic OG Images: Fix the Turbopack CPU Hang AI Is Turning Every Developer Into an Architect What is props 3 Things Building MediTrack Taught Me About Laravel Vibe Coding: My Daily Workflow with Claude Code Using Python to Do the Wonders: How Flet Changes the Game for Developers OpenDev: From Zero Clients to Linux Independence – How I'm Building a One-Man Linux Revolution Migrating from Jest to Vitest 4: A Complete 2026 Guide Making Equation (2.2) of the OpenAI Erdős Result Executable HTTP request headers: canonical reference Prefix caching in vLLM under multi-tenant agent traffic Introducing Oracle Support in Dory How I built 3 products solo as a CA student using AI — no coding background What is AEO? How to Get ChatGPT, Perplexity & AI Search Engines to Cite Your Website — 2026 Guide HTTP rate-control headers: canonical reference Im attending Manifest 2026! AI Music Doesn’t Need Better Prompts — It Needs Better Systems ORA-00215 오류 원인과 해결 방법 완벽 가이드 Stop Making Your AI Chatbot Slower: Streaming Responses with Spring AI and Server-Sent Events Annotations in Spring Boot What is the Model Context Protocol (MCP)? Gemini CLI Skills: Teaching Your Terminal Agent How to Think 🧠 What the Heck is an API? FairLens AI: An Intelligent Dashboard for Automated Bias Auditing RAG vs Fine-Tuning- Choosing Right Strategy for Modern AI Applications AI Metrics Decoded: From Parameters to TOPS I made git merge finish itself — in VS Code, in my terminal, and in CI You just can’t miss this… Redis Essentials: Architecture, Caching, and Setup Docker with AI: A Practical Guide to Running LLMs, Agents and MCP Design to Code #5: Using AI to Build a Design System Analyzing 1,000 Engineering Problems Through GitHub Data Open Graph protocol: canonical reference How a 400-Engineer SaaS Company Cut PR-to-Production from 4.2 Days to 6.4 Hours with Claude Code Multi-Agent DevOps 💬 Embedded AI Chatbots vs Popup Bubbles — Which One Creates Better Engagement? Bajándole todos los minutos posibles al CI del backend con mas de 1000 tests Harness Engineering: Stop Re-Prompting Your Coding Agent Every Session HTML meta referrer: canonical reference AWS MCP Server Just Gave AI Agents Your Cloud Keys — Here's Why That Should Worry You Announcing the Trust Identity Protocol (TIP): HTTPS for the AI Era We built the feature in two days. Making it reliable took two weeks. LuisCore /for-agents.json — agent bootstrap — daily syndication · 2026-05-26 A Curious Journey Into Reverse Engineering an AI-Generated Python .exe Part 2: Enterprise Decision Intelligence Architecture: AI Governance, Threshold Policy Engines, and Operational AI Systems I will continue using Devise with Rails 8! The Developer's Guide to Picking the Right AI Code Model in 2026 (I Spent $500 So You Don’t Have To) 30 Kubernetes Tasks Every CKA Candidate Should Practice Before Exam Day Why Some Websites Feel Instantly Better to Use Advanced React Patterns I Wish I Knew 5 Years Ago ¿Cómo optimizar algoritmos en arreglos y listas con la técnica de dos punteros? I scanned 8 popular open source repos with one command. Here's what I found. mcp-probe v1.6.0: Stricter GitHub Actions checks for MCP CI gates How we connect two strangers' webcams fast (and keep the TURN bill small) LLM Agents Are Now Finding Zero-Days: How AI is Autonomously Rewriting the Rules of Vulnerability Research Minimal Code Doesn’t Mean Stable Code How I manage 40+ skills across Claude Code, Codex, and .agents folders Hardening Stealth Browser Fingerprint Integrity and State Persistence Quick Tip: Benchmarking Multimodal APIs in Under 10 Minutes How I Slashed My AI API Bill by 92% in 2026 — A Cost Optimizer's Speed Benchmark Guide How I Slashed My AI API Bill by 95% — A Practical Guide for 2026 A Go outbox library that runs inside your own DB transaction How I Built a Credit Optimizer That Saves 30-75% on AI Agent Costs (Open Architecture) The Missing POP: How I Ported a Yul Contract to Huff by Reading Every Opcode The Moment the Config Parser Became the Bottleneck Churn Tool Stack by Revenue Stage ($5K to $50K+) What I Learned Exploring AI-Generated 3D: A Hands-On Tour of Meshy, Tripo, and Three.js Day 15 - Software Composition Analysis(SCA) Contributing Upstream Instead of Forking: My grape-swagger-rails Story Behind The Badge: How We Built 2,000 Hackable Badges For Temporal Replay Access Control Doesn't Scale Linearly -- Part 3 33x faster than Rust: Why I stopped waiting for my compiler and built my own. I Built My First Production AWS Project as a Career Changer Why Detecting PII Matters More Than Ever JSON Schema in 10 Minutes — Validation, Types & Real Examples Python Tasks How I Started My Cybersecurity Journey as an SQA Engineer 🔐 Why "fancy fonts" in Discord and Instagram bios turn into boxes ☁️ GKE private cluster setup — common mistakes and how to avoid them I Thought a Username Didn’t Matter… Until I Saw How Much People Care About It Claude for Small Business: 382K Day-One Buyer's Guide I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail How I Built an AI-Powered Incident RCA Platform with LangGraph and RAG The Paywall Was a Painted Door Sonnet hallucinated. My agent stored it as fact. How React-Style Time-Slicing Keeps UIs Responsive 这个 Princeton 开源项目让 AI 自己修 Bug,19K Stars 但 90% 的人只用了 1% 功能 🔥 SWE-agent's 5 Hidden Uses Nobody Told You About 🔥 Decompiling Serial Number U-36: Python TERCOM Reconstruction, Cryptographic Logistical Forensics, and Swarm Consensus Fault Tolerance Microservices Patterns
I Built a Static Blog Generator in 350 Lines of Python — No Dependencies, No Config, No Nonsense
caishen-ai · 2026-05-26 · via DEV Community

One command. One Python file. Zero npm installs. Your Markdown folder becomes a fully-featured static blog.


Let me paint you a picture. You've got 20 Markdown files — drafts, notes, tutorials — sitting in a folder. You want them online. Not as raw .md files on GitHub, but as a real, readable blog. With an index page. With proper SEO meta tags. With an RSS feed people can actually subscribe to. With a sitemap Google can crawl. A custom 404 page that doesn't scream "default nginx." Responsive design that doesn't break on mobile.

What do you do?

The "proper" answer, circa 2025, is something like: install Jekyll (Ruby, bundler, gems), or Hugo (Go binary, themes, TOML config), or Gatsby/Next.js (Node, React, 800MB of node_modules). Then wrestle with config files. Then figure out templating. Then Google "how to generate RSS in Jekyll" at 11 PM on a Tuesday.

I got tired of that dance. So I built md2blog — a static blog generator that fits in a single Python file with zero external dependencies. Just the standard library and 350 lines of code.


The 30-Second Demo

# Clone it
git clone https://github.com/caishen-ai/caishen-blog.git
cd caishen-blog/tools/md2blog

# Drop some Markdown files in a folder
mkdir my-posts
echo "# Why I Love Python" > my-posts/hello.md

# Run one command
python md2blog.py my-posts output --title "My Awesome Blog"

# Done. Open it.
# output/index.html — your blog is live

Enter fullscreen mode Exit fullscreen mode

That's it. No npm install. No gem bundle. No YAML config files. No theming hell. Just Python 3 and your .md files.

📂 GitHub Repo | 🌐 Live Demo


What You Actually Get

For that one command, md2blog generates:

  • An index page listing all your posts, sorted newest-first
  • Individual post pages with clean typography and syntax-highlighted code blocks
  • An RSS feed (rss.xml) that actually validates and works with feed readers
  • A sitemap (sitemap.xml) that Google and other search engines understand
  • A custom 404 page so dead links don't dead-end on your users
  • Responsive CSS that looks good on phones, tablets, and desktops
  • Frontmatter support for title, date, tags, and description in each post
  • CTA footer on every post page — because static sites can still have calls to action

And it generates fast. 200 articles? Takes about 2 seconds on a modern machine. There's no build pipeline, no incremental compilation, no caching layer. It's just Python reading files and writing HTML.


The Design Philosophy: What NOT to Include

md2blog is opinionated. Maybe even aggressively so. Here's what it deliberately doesn't do:

  • No config files. Everything is a CLI flag: --title, --description, --author, --url. If you forget one, it uses sensible defaults.
  • No JavaScript. The output is pure HTML + CSS. No React hydration. No client-side routing. No bundle size to worry about.
  • No templating engine. The HTML is inlined as Python strings. You read that right. For 350 lines of code, pulling in Jinja2 felt absurd. If you want to customize the template, you edit the Python file. It's right there.
  • No image processing. Your Markdown images work as-is. No resizing, no lazy-loading magic. Keep it simple.
  • No syntax highlighting library. Code blocks get <pre><code> tags with a dark background. That's it. If you want Prism.js or highlight.js, add one <script> tag to the template.

Every feature I didn't add is a feature that can't break.


The Interesting Bits (Code Snippets)

Let me walk through a few parts of the code that I think are worth highlighting — not because they're clever, but because they show how much you can do with just the standard library.

1. A Regex Markdown Parser (No MystMark, No CommonMark)

Here's the core of the Markdown-to-HTML conversion. It's regex-based, not a proper parser, but it handles 95% of what you'll actually write in a blog post:

def md_to_html(text):
    """Convert basic markdown to HTML."""
    # Code blocks (triple backticks)
    text = re.sub(
        r'```

(\w*)\n(.*?)

```',
        lambda m: f'<pre><code class="language-{m.group(1)}">{m.group(2)}</code></pre>',
        text, flags=re.DOTALL
    )

    # Inline code, headers, bold, italic, links, images
    text = re.sub(r'`([^`]+)`', r'<code>\1</code>', text)
    text = re.sub(r'^### (.+)$', r'<h3>\1</h3>', text, flags=re.MULTILINE)
    text = re.sub(r'\*\*(.+?)\*\*', r'<strong>\1</strong>', text)
    text = re.sub(r'\[([^\]]+)\]\(([^)]+)\)', r'<a href="\2">\1</a>', text)

    # Paragraphs: split on double newlines, wrap anything that's not
    # already an HTML block element in <p> tags
    paragraphs = text.strip().split('\n\n')
    result = []
    for p in paragraphs:
        p = p.strip()
        if not p:
            continue
        if re.match(r'^<(h[1-4]|ul|ol|pre|blockquote|hr)', p):
            result.append(p)  # Already an HTML block — leave it alone
        else:
            result.append(f'<p>{p.replace(chr(10), "<br>")}</p>')

    return '\n'.join(result)

Enter fullscreen mode Exit fullscreen mode

Is this a "correct" Markdown parser? No. Does it handle nested lists? Barely. Tables? Kind of. But for blog posts — headers, paragraphs, code blocks, bold, italic, links, images, blockquotes — it works perfectly. And it's 30 lines instead of a 10,000-line library.

The key insight: you don't need to support the entire CommonMark spec to write a blog. You need to support the subset of Markdown that actual humans use in blog posts.

2. Frontmatter Parsing (Also Regex-Based)

YAML is a famously complex spec. The YAML 1.2 spec is 23,000 words. So instead of parsing YAML, md2blog parses "YAML-like" frontmatter:

def parse_frontmatter(content):
    """Extract YAML-like frontmatter from markdown."""
    meta = {}
    body = content
    if content.startswith('---'):
        parts = content.split('---', 2)
        if len(parts) >= 3:
            for line in parts[1].strip().split('\n'):
                if ':' in line:
                    key, val = line.split(':', 1)
                    meta[key.strip()] = val.strip()
            body = parts[2]
    return meta, body.strip()

Enter fullscreen mode Exit fullscreen mode

This handles 100% of real-world blog frontmatter. title: My Post, date: 2025-01-15, tags: python, tools, web. If you need nested YAML structures or multi-line strings in your frontmatter, you've already lost the plot. This is a blog, not a Kubernetes manifest.

3. RSS and Sitemap Generation (Under 30 Lines)

This is the feature people actually need but rarely build themselves. md2blog generates both from the same post data:

# RSS feed (standard 2.0 format)
rss_items = []
for post in posts[:20]:  # Last 20 posts
    rss_items.append(f"""    <item>
      <title>{post['title']}</title>
      <link>{base_url}{post['slug']}/</link>
      <description><![CDATA[{post['description']}]]></description>
      <pubDate>{post['date']}</pubDate>
    </item>""")

rss = f"""<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0">
  <channel>
    <title>{blog_title}</title>
    <link>{base_url}</link>
    <description>{blog_description}</description>
    {''.join(rss_items)}
  </channel>
</rss>"""

(output_path / 'rss.xml').write_text(rss, encoding='utf-8')

# Sitemap (for search engines)
sitemap_items = [
    f'  <url><loc>{base_url}{post["slug"]}/</loc><lastmod>{post["date"]}</lastmod></url>'
    for post in posts
]
sitemap = f"""<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url><loc>{base_url}</loc></url>
{chr(10).join(sitemap_items)}
</urlset>"""

(output_path / 'sitemap.xml').write_text(sitemap, encoding='utf-8')

Enter fullscreen mode Exit fullscreen mode

That's it. A working RSS feed that any feed reader can consume. A sitemap Google can crawl. Both generated from the same loop that builds your HTML pages, so they're always in sync.


Why Zero Dependencies Matters

I'm not anti-dependency. I love Python's ecosystem. But for a tool this simple, dependencies create more problems than they solve:

  • Installation friction. pip install is fast when it works. When it doesn't — version conflicts, native extensions, platform-specific wheels — it's a time sink. md2blog works on any machine with Python 3.6+. Period.
  • Bit rot. Dependencies change. APIs deprecate. A tool with 50 dependencies has 50 things that can break next year. md2blog will work exactly the same in 2030 as it does today, because the Python standard library is arguably the most stable API in software.
  • Auditability. 350 lines. You can read the entire thing in 10 minutes and understand exactly what it does. Try that with a Jekyll theme or a Next.js starter template.
  • GitHub Pages compatibility. No build step. The output is static HTML. Push it to a gh-pages branch or any static host and it works.

What md2blog Is NOT

Let me be clear about what this isn't, so you don't expect the wrong thing:

  • It's not a CMS. There's no admin panel, no WYSIWYG editor, no draft mode. You write Markdown in your text editor.
  • It's not a replacement for Hugo or Jekyll. Those are mature, feature-rich, theme-ecosystem-powered tools. md2blog is a 350-line script. It does one thing well.
  • It doesn't have a plugin system. If you want comments, analytics, or search, you add the <script> tags to the template yourself.

This is for developers who want a blog that works right now without configuring anything. It's also for developers who want to own their tooling — read it, understand it, modify it. The whole thing fits in your head.


Real-World Usage

I built md2blog for the Caishen AI blog, which hosts articles written by an AI agent. The workflow is dead simple:

  1. AI writes a Markdown file with frontmatter
  2. md2blog converts the folder into a static site
  3. The output gets pushed to GitHub Pages

A cron job or CI pipeline could automate the entire thing. Write a post, commit it, and the blog updates itself. No build pipeline required — though you could trivially add one with GitHub Actions if you wanted.


Try It Yourself

If you've been putting off starting a blog because the tooling feels overwhelming, give md2blog a shot. It won't give you a perfect blog. But it will give you a real, online, good-enough blog in under a minute. And sometimes "good enough" is exactly what you need to start writing.

If you build something cool with md2blog, drop a link in the comments. I'd love to see what people do with it.