惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
Scott Helme
Scott Helme
爱范儿
爱范儿
WordPress大学
WordPress大学
博客园 - 三生石上(FineUI控件)
阮一峰的网络日志
阮一峰的网络日志
博客园 - Franky
V
V2EX
腾讯CDC
博客园_首页
博客园 - 司徒正美
酷 壳 – CoolShell
酷 壳 – CoolShell
T
Tailwind CSS Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
小众软件
小众软件
J
Java Code Geeks
大猫的无限游戏
大猫的无限游戏
月光博客
月光博客
Microsoft Azure Blog
Microsoft Azure Blog
B
Blog
雷峰网
雷峰网
Stack Overflow Blog
Stack Overflow Blog
IT之家
IT之家
罗磊的独立博客
Recorded Future
Recorded Future
博客园 - 聂微东
O
OpenAI News
S
Secure Thoughts
Hacker News: Ask HN
Hacker News: Ask HN
S
Schneier on Security
Hacker News - Newest:
Hacker News - Newest: "LLM"
Y
Y Combinator Blog
C
Cyber Attacks, Cyber Crime and Cyber Security
Project Zero
Project Zero
宝玉的分享
宝玉的分享
K
Kaspersky official blog
N
Netflix TechBlog - Medium
T
The Exploit Database - CXSecurity.com
Google Online Security Blog
Google Online Security Blog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Webroot Blog
Webroot Blog
云风的 BLOG
云风的 BLOG
Simon Willison's Weblog
Simon Willison's Weblog
C
Check Point Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
L
LINUX DO - 热门话题
美团技术团队
L
Lohrmann on Cybersecurity

Stonecharioteer on Tech

I Traced My Traffic Through a Home Tailscale Exit Node What Was I Reading Last? In Three Not-So-Easy Pieces Dogfooding Is Hard Code blocks in your books, finally GoForGo v0.9.0 Merrilin - We built an app to read books I use a Macbook now Data Structures & Algorithms - Preparing for Interviews Using a local DNS namespace for local service discovery Direction KOllector - Publishing KOReader Highlights gbt: branches touched in the last 24 hours A Soiree into Symbols in Ruby Some Smalltalk about Ruby Loops Ruby Blocks Returning from Ruby Blocks, Procs and Lambdas My Linux Laptop Finally Works: How Claude Helped Me Fix Years of Annoyances TIL: Watchexec - Modern File Watching for Development Workflows A Less Busy Mind GoForGo - Learn Go through live examples Migrating My Old Blog to Hugo with Claude The Qtile Window Manager: A Python-Powered Tiling Experience Read the RFCs that Built the Internet Py-x-Protobuf - Or How I Learned to Stop Worrying and Love Protocol Buffers Python Reverse a List New Beginnings Leaving ChainSafe Systems Screen Lock for Cinnamon Desktop using Zenity and Terminal Commands Crews Not Teams A System for Getting Better at LeetCode So Far So Rust Retrying HTTP Requests with Rust A Primer on Control Charts Learning Rust Explicit is Better than Implicit: Rust for Pythonistas Using Custom Delimiters in Jinja Templates TIL: Creating Fixed Length Iterables in Python Documentation Without Assumption Vagrant Python - A Reflection in 2022 Learning Golang No, A Virtual Machine Is Not Enough: Why Developers Need Native Linux Empathy in Tech For Those Who Came in Late A Weekend With PostgreSQL TIL: Gooey and Python Fire for Quick GUIs and CLIs TIL: 2ality - Dr. Axel Rauschmayer's JavaScript Blog TIL: MassDNS - High-Performance Bulk DNS Lookups TIL: Matomo Analytics, Google Tech Writing, Memory Programming, and NES TV Signals TIL: MontyDB - MongoDB Implemented in Python Returning to the Craft of Programming TIL: CPUFetch, OneFetch, and Learn CSS TIL: DNS Performance Testing and Pi-hole with Unbound TIL: Eli Bendersky's Blog, Awesome By Example, NoCoDB, and Martin Kleppmann TIL: CRDTs, Extreme HTTP Performance, and BYTEPATH Game TIL: AutoInvent, ASGI, Python Packaging, RAPIDS GPU Computing, and FlaskCon TIL: MangaDesk - Terminal Client for MangaDex TIL: McFly - Smart Shell History Search TIL: Siege Load Testing and Awesome FastAPI Resources TIL: Ventoy Bootable USB and Justniffer Network Analysis TIL: CLI Code Review, Git Split Diffs, and Internal Combustion Engine TIL: Benford's Law, Web Security Headers, Event Sourcing, and Mozilla Security Guidelines How to Write Documentation - The README.md File The Importance of Documentation TIL: NNgroup UX Research, SponsorBlock, and Labella Python Library TIL: The Little Book of Rust Macros and Rust Performance Book TIL: Git-Bug Distributed Issue Tracker and Omni Kubernetes Monitoring TIL: Zellij - Modern Terminal Multiplexer TIL: How Discord Handles 2.5 Million Concurrent Voice Users TIL: Volumio - The Audiophile Music Player TIL: Areopagitica - Milton's Defense of Free Speech TIL: Fast Node Manager, Zoxide Smart CD, Technical Writing, PyO3, and Qubes OS TIL: Slurm Workload Manager for HPC Clusters TIL: Data Visualization Guide and Oso Authorization Academy TIL: CORS Deep Dive, Piku Tiny PaaS, Rust Strings, and Deno Standard Library TIL: Raspberry Pi OS Development, Vim Beginner Guide, Password Management, and QueryBook TIL: uBlock Origin Performance Optimization on Firefox TIL: Breaking PostgreSQL at Scale and LeetCode Problem Patterns TIL: Awesome Tmux Resources for Terminal Multiplexing TIL: Grit - A Multitree-Based Personal Task Manager TIL: Lens 4.2 Kubernetes IDE, Shell Scripting Guide, and Dark HTTP Server Do The Job You Hate So You Won't Hate The Job You Love TIL: Innernet VPN Solution and NoteCalc Calculator App TIL: Argo CD for GitOps and Lens Kubernetes IDE TIL: Modern Rust CLI Tools - System Monitoring, HTTP Requests, and DNS TIL: tz - A Time Zone Helper Tool TIL: Distributed Systems Education, Fallacies, and Self-Hosted Internet Archiving TIL: Real-Time Voice Cloning Technology TIL: ChartMuseum for Helm, AMD's Corporate Journey, and Kubernetes Pod Scaling TIL: Docker and Kubernetes Tools - Whaler, Descheduler, and Dive TIL: Post-Mortem Collection, Terminal Plotting, and Technical Twitter TIL: Dark Mode Toggle Web Component by Google Chrome Labs TIL: Python eval(), exec(), and compile() Functions TIL: fuser Command for Process and File Investigation TIL: i Hate Regex - The Ultimate Regex Cheat Sheet TIL: Dolt - Git for Data and Database Version Control TIL: x86 Assembly Programming and SafeEyes Break Reminder TIL: Comprehensive Distributed Systems Reading List TIL: Cosmopolitan C Library, Distributed Systems Book, High Performance Browser Networking, and Rust Roguelike Tutorial TIL: ABlog for Sphinx - Documentation as a Blog Platform
TIL: Camelot PDF Tables, PostgreSQL Row Level Security, Zerodha Varsity, and Write Yourself a Git
2021-03-13 · via Stonecharioteer on Tech

Camelot: PDF Table Extraction for Humans

Python library for extracting tables from PDF files with high accuracy:

Key Features:

  • Multiple Parsing Methods: Stream and Lattice parsers for different table types
  • High Accuracy: Better results than traditional PDF parsing tools
  • Pandas Integration: Direct conversion to pandas DataFrames
  • Visual Debugging: Plot detected table areas for validation
  • Customizable: Fine-tune detection parameters

Usage Examples:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
import camelot

# Extract tables from PDF
tables = camelot.read_pdf('document.pdf')
print(f"Found {tables.n} tables")

# Convert to pandas DataFrame
df = tables[0].df

# Export to various formats
tables.export('output.csv', f='csv')
tables.export('output.xlsx', f='excel')

When to Use:

  • Financial reports with tabular data
  • Research papers with data tables
  • Government documents with statistics
  • Any structured data trapped in PDFs

PostgreSQL Row Level Security for Multi-Tenancy

Multi-tenant data isolation with PostgreSQL Row Level Security

Advanced PostgreSQL feature for implementing secure multi-tenant applications:

What is Row Level Security (RLS):

  • Fine-Grained Access: Control access at the row level within tables
  • Policy-Based: Define security policies for different user types
  • Transparent: Application code doesn’t need complex filtering logic
  • Performance: Database-level enforcement with optimized execution

Implementation Pattern:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
-- Enable RLS on table
ALTER TABLE customer_data ENABLE ROW LEVEL SECURITY;

-- Create policy for tenant isolation
CREATE POLICY tenant_isolation ON customer_data
    FOR ALL TO tenant_user
    USING (tenant_id = current_setting('app.current_tenant')::integer);

-- Set tenant context in application
SET app.current_tenant = '123';

Benefits:

  • Data Isolation: Strong tenant boundaries
  • Security: Prevents data leaks between tenants
  • Simplicity: Reduces application complexity
  • Compliance: Meets regulatory requirements

Use Cases:

  • SaaS applications with multiple customers
  • Healthcare systems with patient data isolation
  • Financial applications with account separation
  • Any system requiring strict data boundaries

Zerodha Varsity - Free Trading Education

Varsity by Zerodha – Markets, Trading, and Investing Simplified

Comprehensive free educational platform for learning about financial markets:

What It Covers:

Trading Fundamentals:

  • Technical Analysis: Chart patterns, indicators, candlesticks
  • Fundamental Analysis: Financial statement analysis, valuation
  • Options Trading: Strategies, Greeks, risk management
  • Futures Trading: Contract specifications, margin requirements

Investment Concepts:

  • Portfolio Management: Asset allocation, diversification
  • Risk Management: Position sizing, stop losses
  • Market Psychology: Behavioral finance concepts
  • Regulatory Framework: SEBI guidelines and compliance

Learning Approach:

  • Module-Based: Structured progression from basics to advanced
  • Practical Examples: Real market scenarios and case studies
  • Interactive Content: Charts, calculators, and tools
  • Free Access: No cost for high-quality financial education

Value Proposition:

  • Democratizes financial education
  • Reduces dependence on expensive courses
  • Provides practical, actionable knowledge
  • Builds informed investor community

Write yourself a Git!

Educational project to understand Git internals by implementing it from scratch:

Learning Objectives:

  • Version Control Concepts: How distributed version control works
  • Git Internals: Objects, references, and storage mechanisms
  • Data Structures: Trees, graphs, and hash-based storage
  • File System Operations: Blob storage and retrieval

Key Components to Implement:

Core Git Objects:

  • Blobs: File content storage
  • Trees: Directory structure representation
  • Commits: Snapshots with metadata
  • Tags: Named references to commits

Repository Operations:

  • Initialize: Create new repository structure
  • Add: Stage files for commit
  • Commit: Create snapshot with message
  • Checkout: Switch between branches/commits

Implementation Benefits:

  • Deep Understanding: Learn Git’s internal architecture
  • Debugging Skills: Better troubleshoot Git issues
  • System Design: Understand distributed system principles
  • Programming Practice: File I/O, data structures, algorithms

Technologies Used:

  • Python for implementation simplicity
  • File system operations for storage
  • Hashing algorithms for content addressing
  • Command-line interface design

Each resource provides deep technical knowledge in its domain - from practical data extraction to advanced database security, financial literacy, and version control system design.