惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

DEV Community

Everbench: A document management system with Local Intelligence The Hidden Features of Claude How We Built a Multi-Agent AI Documentation System (And What We Learned) I got tired of writing post-mortems — so I built RCAi for SREs MIA: A Futuristic AI Desktop Assistant Built with Voice, Gestures, and Controlled Chaos Best Programming Language for Backend Web Development: PHP vs Python PayPal Alternatives for Indian Businesses: Best Payment Gateways for International Card Payments (2026) Gemma 4 Made Me Rethink Local AI: Not Just Text, But Images Too Clean Architecture in .NET Explained (The Dependency Rule) I Compiled Rust to WebAssembly and Made My JavaScript 6 Faster Outlook.com Is the Final Boss of 'Just Send an Email' Conditional Statements and Control Flow in Python Insults & Cutlasses, Local LLM Sword Fighting on Melee Island Production Lab: ECS Fargate + Prometheus + Grafana + Loki + Alloy + Node Exporter How 12 AI agent frameworks handle human approval (most badly) The Four-Index Reality: Why AI Search Isn't One Thing I Scanned 1 Million AI Services. Here's What Worries Me More Than the Vulnerabilities Managing multiple docker hub accounts using docker-use System Design Interview: Decentralized Web Crawler Metric Cardinality: High or Low? 4 Steps to Making the Right Choice 로컬 LLM 셋업 가이드 (v23) GEO vs SEO in 2026 — What Google's May Guidance Changed Cursor Review 2026 — Honest 'Not For Me' Take From a VSCode User Hello from rikuq — a practitioner blog for solo AI SaaS founders Why DevOps Engineers Need Practical Tutorials, Not Just Theory AI Agents in CI/CD: Give Them Context, Not Production Authority Now I See Why Translators Are Panicking Over AI—Should Coders Panic Too? Why I Track HRV Every Morning (And How It Actually Changes My Day) Diffusion Language Models: How NVIDIA's Nemotron-Labs DLM Is Killing Token-by-Token Generation Chatbots GPT pour le support client : ce que les équipes françaises ont réellement besoin de savoir I Hit the 1,232-Byte Wall So You Don't Have To Google Just Rebuilt the Search Box (Again) — But This Time It's Different Aether: A local Android assistant built with Gemma 4 BoxAgnts Introduction (1) — Out of the Box mkdev: trusted HTTPS for localhost, mapped by name Just one question, one answer. Why Java Still Rules the Programming World in 2026 Four Architectures for Letting Claude Edit Elementor (and Why We Shipped Clone-and-Mutate) yard-yaml 0.1.1: safer UTF-8 handling for YAML documentation I Built a Mac App That Keeps Your Clipboard in Sync Across All Your Android Devices Stop Using UUIDs: Why B2B SaaS Needs ULIDs in Laravel 🐘 I'm a non-technical founder who built a Slack approval tool. Here's what actually broke first. Open-Sourcing Our Game AI Stack — SDKs, Templates, and CLI Tools for NPC Dialogue I Built an AI System That Makes 1,000 Decisions a Day. Here's Where I Drew the Line. Lets Encrypt DNS Challenge with Traefik and AWS Route 53 Building an agent-ready website: how to make your site readable for ChatGPT, Perplexity and autonomous agents A productivity tool with GitHub as your cloud database How We Built Dynamic NPC Dialogue with LLMs — Lessons from Early Access cmux: The Native macOS Terminal Built for Running AI Coding Agents in Parallel Deep Atlantic Storage: Rewriting in Rust How I Built a Bulk Image Optimizer with $0 Server Costs Using Vanilla JS and Canvas API Humans and Machines read differently, I think I have a fix? Claude Code Deleted 92 Images Without Asking. This Happens More Than You Think. Method Calling Stack in Java I Built Schedule Sensei & Pushed It to GitHub – Here's What's Inside (And I Need Your Help 👀) OIC: From a Working Toast Watcher to a General "Watch It for Me" Agent Memory is two-thirds of what an AI chip costs to build The XState persistence problem is five years old. Here is what we built to finally solve it. i added MCP support to my SaaS in an afternoon. here's the whole thing. Framework: Link Building ☁️ Importing existing S3 buckets into Terraform state made easy with terraform import existing s3 bucket I Built a Token System on Solana (Without Any Backend Code) 터미널 AI 에이전트 구축 (v21) I Built an AI 3D Model Generator — Here's How I Handle Meshes in the Browser 🛡️ PromptGuard: I Built a Local AI Privacy Firewall That Sanitizes Your Prompts Before They Leave Your Machine PostgreSQL WAL Bloat: Why Automatic Management Is Often Insufficient? Seven PRs Before Lunch: Parallel Claude Code Tabs Plus Audit-Before-Bump Deployment using all three Kubernetes probes Qwen 3.6 Has Four Tiers. Here's How to Route Without Burning Cash. RAG 시스템 실전 구축 (v21) How I handle my errors in PHP The Blind Spot in Treasure Hunt Engine Configuration: Long-Term Server Health Run NVIDIA NIM on Your Own GPU — Same API, Different Endpoint Webflow SEO Implementation 로컬 LLM 셋업 가이드 (v21) How Logs Travel From Your EKS Pod to Datadog 𝗦𝘁𝗼𝗽 𝗖𝗿𝗮𝗺𝗺𝗶𝗻𝗴 𝗙𝗼𝗿 𝗘𝘅𝗮𝗺𝘀, 𝗦𝘁𝗮𝗿𝘁 𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗥𝗲𝗮𝗹 𝗦𝗸𝗶𝗹𝗹𝘀 How to Use EXPLAIN ANALYZE in PostgreSQL: A Visual Guide gRPC Performance: tonic (Rust) vs grpc-go Benchmarked at Scale Hack The Box (HTB): Cap Machine (Full Walkthrough) Visual Search Optimization studygemma: AI study buddy for CS students Architectural Tradeoffs in Webhook Idempotency and SaaS API Versioning One Open Source Project a Day (No. 75): Understand Anything - The AI Engine That Turns Any Codebase Into an Explorable Knowledge Graph From mock-only-works to real-world-works: 48 hours of reCAPTCHA debugging I built a free music tool AI Talking Avatar Pipelines Broke Our Ad CTR by 3.7% 800G to 400G Breakout: How to Scale 400G Networks with 800G Ports 터미널 AI 에이전트 구축 (v20) Topical Authority Architecture Inside Hermes Agent's Session Memory: What X-Hermes-Session-Id Actually Does How Logs Travel From Your EKS Pod to Datadog The Hidden Journey Inside / Kubernetes Is it safe to connect my bank account to AI? No Room — The World of Aying (8/12) Fossils — The World of Aying (10/12) Familiar Stranger — The World of Aying (9/12) Being Seen — The World of Aying (7/12) [I Ran an AI Agent for 30 Days Straight — Here's the Boring Engineering That Made It Work] Gemma 4: The 128K Multimodal Powerhouse in Your Terminal How to Consolidate Your QA Toolstack: A Practical Buyer's Guide
How I Built an AI News Brief with Next.js, Supabase, Vercel, and GPT-4o-mini
AIDeepSignal · 2026-05-25 · via DEV Community

Over the past few months, I have been building a small AI news brief called DeepSignal.

The idea started from a simple personal frustration:

I was reading X, Hacker News, arXiv, OpenAI and Anthropic blogs, product launch pages, newsletters, and company updates every day, but still felt like I was either missing important AI news or wasting time on low-signal updates.

So I built a small system that does three things:

  1. Collects AI-related updates from multiple sources
  2. Scores each story with a transparent 0–100 signal score
  3. Publishes a daily and weekly brief

The product is not technically complex, but the workflow taught me a lot about building AI-assisted content products, SEO for dynamic sites, and the difference between summarizing information and filtering information.

This is a breakdown of the stack, architecture, and lessons learned.


The stack

The current stack is intentionally simple:

Frontend: Next.js 15
Database: Supabase
Hosting: Vercel
AI processing: GPT-4o-mini
Content model: Articles, sources, tags, guides, weekly briefs
SEO: sitemap, canonical URLs, RSS, structured pages

Enter fullscreen mode Exit fullscreen mode

I wanted to keep the system cheap and easy to maintain because this is a solo project.

The rough monthly cost is still low. Vercel handles deployment and hosting, Supabase handles the database, and GPT-4o-mini is used for scoring and classification rather than heavy generation.

The main goal was not to build a complicated AI pipeline.

The goal was to build a reliable workflow that could turn noisy inputs into useful outputs.


The basic architecture

The system has a simple flow:

Sources
   ↓
Fetch / import
   ↓
Normalize article data
   ↓
AI relevance check
   ↓
Signal scoring
   ↓
Tagging and categorization
   ↓
Publish article pages
   ↓
Generate daily / weekly briefs
   ↓
Expose guides, RSS, sitemap

Enter fullscreen mode Exit fullscreen mode

At a high level, each story becomes a structured object:

type Article = {
  id: string;
  title: string;
  url: string;
  source: string;
  summary: string;
  publishedAt: string;
  aiRelevanceScore: number;
  signalScore: number;
  tags: string[];
  category: string;
  canonicalUrl: string;
  isIndexable: boolean;
};

Enter fullscreen mode Exit fullscreen mode

The most important field is not the summary.

It is isIndexable.

That one field ended up being more important than I expected.


Why filtering matters more than summarizing

At first, I thought the main problem was summarization.

Take a long article, summarize it, and users save time.

But after building the first version, I realized summarization alone does not solve the real problem.

A summary tells you:

What does this article say?

Enter fullscreen mode Exit fullscreen mode

But users usually need to know:

Should I care?
Why does this matter?
Is this actually about AI?
Is this a durable signal or just a temporary headline?
Is this more important than the other 50 updates today?

Enter fullscreen mode Exit fullscreen mode

That changed the product direction.

Instead of only generating summaries, the system needed to decide what should be included, ranked, grouped, and excluded.

For an AI news product, filtering is not a minor feature.

Filtering is the product.


The signal score

Each story gets a 0–100 signal score.

The score is not meant to be perfect. It is a transparent ranking system that helps explain why a story may matter.

A story can score higher based on signals like:

- source quality
- AI relevance
- novelty
- technical depth
- business impact
- research importance
- company importance
- cross-source confirmation
- relevance to builders, researchers, or operators

Enter fullscreen mode Exit fullscreen mode

A simplified scoring idea looks like this:

type ScoreInput = {
  sourceWeight: number;
  aiRelevance: number;
  novelty: number;
  technicalDepth: number;
  marketImpact: number;
  researchValue: number;
  companyImportance: number;
};

function calculateSignalScore(input: ScoreInput) {
  const score =
    input.sourceWeight * 0.15 +
    input.aiRelevance * 0.25 +
    input.novelty * 0.15 +
    input.technicalDepth * 0.15 +
    input.marketImpact * 0.1 +
    input.researchValue * 0.1 +
    input.companyImportance * 0.1;

  return Math.round(Math.min(100, Math.max(0, score)));
}

Enter fullscreen mode Exit fullscreen mode

The exact formula can change, but the principle matters:

I wanted users to feel that the ranking had a visible logic, not just a black-box AI label.

That was one of the biggest lessons:

A simple transparent scoring system can be more trustworthy than a more complex but invisible AI ranking.


Using GPT-4o-mini

I use GPT-4o-mini mostly for classification, scoring support, and short summaries.

The AI tasks are intentionally narrow:

- Is this article actually AI-related?
- What category does it belong to?
- What are the key takeaways?
- Is the story relevant to models, agents, research, hardware, infrastructure, regulation, or adoption?
- What tags should it receive?
- What score explanation should be shown?

Enter fullscreen mode Exit fullscreen mode

I try not to use AI as a generic content generator.

Instead, I use it as a structured processing layer.

A simplified prompt pattern looks like this:

You are classifying an AI industry news article.

Return JSON only.

Evaluate:
1. AI relevance from 0 to 100
2. Signal strength from 0 to 100
3. Primary category
4. 3 to 5 tags
5. One-sentence reason why this story matters
6. Whether this story should be indexable for search

Article:
Title: ...
Source: ...
Excerpt: ...
URL: ...

Enter fullscreen mode Exit fullscreen mode

The important part is forcing structured output.

For this kind of workflow, predictable JSON is more useful than beautifully written prose.


Supabase data model

The database is simple.

Core tables:

articles
sources
tags
article_tags
daily_briefs
weekly_briefs
guides
guide_articles

Enter fullscreen mode Exit fullscreen mode

The articles table stores the normalized content.

The sources table stores source metadata and source quality.

The tags table keeps topic structure clean.

The guides table is for evergreen topic pages, such as:

AI agents
AI coding tools
AI research papers
OpenAI updates
Anthropic Claude updates
NVIDIA AI chips
AI hardware

Enter fullscreen mode Exit fullscreen mode

This guide layer became important later for SEO.

A chronological feed is useful for freshness, but guide pages are better for long-term search and topic authority.


Next.js page structure

The site uses a few main page types:

/
Homepage

/articles/[slug]
Individual article pages

/guides
Guide index

/guides/[slug]
Evergreen topic pages

/weekly
Weekly AI brief

/tags/[slug]
Core topic pages

/sources/[slug]
Selected source pages

Enter fullscreen mode Exit fullscreen mode

Not every page deserves to be indexed.

That became one of the most important SEO decisions.


SEO lesson: not every page should be in the sitemap

Early on, I made the mistake of thinking more indexed pages would be better.

It was not.

When a site has too many low-quality, thin, duplicate, or off-topic pages, search engines can get confused about what the site is actually about.

For an AI news site, this matters a lot because source feeds can easily include AI-adjacent but irrelevant content.

So I added stricter sitemap rules.

The sitemap should include:

- homepage
- about page
- guides
- high-quality guide pages
- weekly brief
- selected high-quality article pages
- selected core tag pages

Enter fullscreen mode Exit fullscreen mode

The sitemap should not include:

- saved pages
- subscribe pages
- internal API routes
- search result pages
- parameter URLs
- low-quality tag pages
- non-AI articles
- thin source pages
- duplicate daily feed pages

Enter fullscreen mode Exit fullscreen mode

The rule I use now is simple:

Only put a URL in the sitemap if it is:

- canonical
- indexable
- useful as a search landing page
- relevant to the core AI topic
- not thin or duplicated

Enter fullscreen mode Exit fullscreen mode

This helped clean up the site’s search profile.


Canonical URLs and UTM links

For promotion, I use UTM links like:

https://ai-deep-signal.com/weekly?utm_source=x&utm_medium=social&utm_campaign=weekly

Enter fullscreen mode Exit fullscreen mode

or:

https://ai-deep-signal.com/?utm_source=reddit&utm_medium=social&utm_campaign=launch

Enter fullscreen mode Exit fullscreen mode

But the canonical URL must always point to the clean version:

https://ai-deep-signal.com/weekly
https://ai-deep-signal.com/

Enter fullscreen mode Exit fullscreen mode

That avoids turning campaign URLs into duplicate SEO pages.

For a dynamic site, this is easy to overlook.

Tracking URLs are for analytics.

Canonical URLs are for search engines.

They should not be mixed.


Why I added guides and weekly briefs

The first version of the site was mostly a feed.

That worked, but it had a problem:

Feeds are good for browsing.

Guides are better for understanding.

So I added topic-based guides and a weekly brief.

The weekly page is for people who want a quick summary of what mattered this week.

The guide pages are for evergreen themes that should grow over time.

For example:

/guides/what-are-ai-agents
/guides/best-ai-coding-agents
/guides/ai-research-papers-this-week
/guides/nvidia-ai-chip-news
/guides/openai-news

Enter fullscreen mode Exit fullscreen mode

This gives the site a more stable structure:

Homepage
  ↓
Guides
  ↓
Topic pages
  ↓
Related articles

Enter fullscreen mode Exit fullscreen mode

That structure is much better than only having a reverse-chronological feed.


Deployment on Vercel

Vercel is a good fit for this kind of project because most of the site is content-oriented.

The project benefits from:

- fast deployments
- preview deployments
- automatic HTTPS
- good Next.js support
- serverless functions for lightweight API work
- ISR / caching options

Enter fullscreen mode Exit fullscreen mode

But I avoid using Vercel for heavy background work.

If the project grows, I would move heavier jobs to a separate worker or queue system.

For now, Vercel + Supabase is enough.


What I would improve next

There are still many things I would improve.

Better deduplication

AI news often appears in multiple places. The same story can show up as a company blog post, a tweet thread, a newsletter item, and a Hacker News discussion.

Better clustering would make the brief cleaner.

Better source weighting

Not all sources should have equal authority. A research paper, company announcement, social post, and rewritten news article should be weighted differently.

Better guide pages

The guide pages should become more like living topic trackers, not just lists of related articles.

Each guide should eventually include:

- topic explanation
- latest updates
- important companies
- relevant research
- key risks
- related stories
- last updated date

Enter fullscreen mode Exit fullscreen mode

Better scoring explanations

A score is only useful if users understand it.

I want each article to explain not just the score, but the reason behind the score.


What I learned

A few lessons stood out.

1. Filtering is harder than summarizing

Summarization is relatively easy now. Deciding what deserves attention is much harder.

2. SEO quality matters more than SEO volume

More pages are not always better. Cleaner, more relevant pages are better.

3. Topic pages are more durable than feeds

Feeds create freshness. Guides create long-term value.

4. Transparent AI systems feel more trustworthy

Users do not need a perfect score, but they do need to understand why a score exists.

5. The workflow around the model is the real product

The AI model is only one part. The source selection, scoring rules, publishing flow, SEO structure, and user experience matter just as much.


Final thoughts

This project started as a small personal tool because I was tired of reading too many AI sources every morning.

But it turned into a useful lesson:

AI products do not always need to generate more content.

Sometimes the better product is the one that helps people ignore more content.

That is what I am trying to build with DeepSignal: a cleaner way to follow AI news, research, agents, models, and infrastructure without the daily noise.

The site is here:

https://ai-deep-signal.com/?utm_source=devto&utm_medium=article&utm_campaign=build_log

Enter fullscreen mode Exit fullscreen mode

The weekly brief is here:

https://ai-deep-signal.com/weekly?utm_source=devto&utm_medium=article&utm_campaign=build_log

Enter fullscreen mode Exit fullscreen mode

I would love feedback from other developers:

Would you trust a transparent signal score for news ranking?
Or would you rather see a purely editorial brief without scoring?

Enter fullscreen mode Exit fullscreen mode