惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

GbyAI
GbyAI
T
Tenable Blog
Webroot Blog
Webroot Blog
L
Lohrmann on Cybersecurity
S
Securelist
S
Schneier on Security
NISL@THU
NISL@THU
Know Your Adversary
Know Your Adversary
C
Cybersecurity and Infrastructure Security Agency CISA
T
The Exploit Database - CXSecurity.com
L
LINUX DO - 热门话题
C
CXSECURITY Database RSS Feed - CXSecurity.com
O
OpenAI News
I
Intezer
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
TaoSecurity Blog
TaoSecurity Blog
S
Secure Thoughts
Application and Cybersecurity Blog
Application and Cybersecurity Blog
P
Privacy International News Feed
H
Hacker News: Front Page
N
Netflix TechBlog - Medium
M
MIT News - Artificial intelligence
博客园 - Franky
PCI Perspectives
PCI Perspectives
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Microsoft Azure Blog
Microsoft Azure Blog
MongoDB | Blog
MongoDB | Blog
L
LangChain Blog
P
Proofpoint News Feed
S
Security Affairs
WordPress大学
WordPress大学
The Last Watchdog
The Last Watchdog
S
SegmentFault 最新的问题
小众软件
小众软件
F
Full Disclosure
博客园 - 叶小钗
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
The Blog of Author Tim Ferriss
Simon Willison's Weblog
Simon Willison's Weblog
P
Palo Alto Networks Blog
Security Latest
Security Latest
P
Proofpoint News Feed
月光博客
月光博客
T
Tailwind CSS Blog
Scott Helme
Scott Helme
Hacker News - Newest:
Hacker News - Newest: "LLM"
Google Online Security Blog
Google Online Security Blog
T
Threat Research - Cisco Blogs
Help Net Security
Help Net Security
Project Zero
Project Zero

Stonecharioteer on Tech

I Traced My Traffic Through a Home Tailscale Exit Node What Was I Reading Last? In Three Not-So-Easy Pieces Dogfooding Is Hard Code blocks in your books, finally GoForGo v0.9.0 Merrilin - We built an app to read books I use a Macbook now Data Structures & Algorithms - Preparing for Interviews Using a local DNS namespace for local service discovery Direction KOllector - Publishing KOReader Highlights gbt: branches touched in the last 24 hours A Soiree into Symbols in Ruby Some Smalltalk about Ruby Loops Ruby Blocks Returning from Ruby Blocks, Procs and Lambdas My Linux Laptop Finally Works: How Claude Helped Me Fix Years of Annoyances TIL: Watchexec - Modern File Watching for Development Workflows A Less Busy Mind GoForGo - Learn Go through live examples Migrating My Old Blog to Hugo with Claude The Qtile Window Manager: A Python-Powered Tiling Experience Read the RFCs that Built the Internet Py-x-Protobuf - Or How I Learned to Stop Worrying and Love Protocol Buffers Python Reverse a List New Beginnings Leaving ChainSafe Systems Screen Lock for Cinnamon Desktop using Zenity and Terminal Commands Crews Not Teams A System for Getting Better at LeetCode So Far So Rust Retrying HTTP Requests with Rust A Primer on Control Charts Learning Rust Explicit is Better than Implicit: Rust for Pythonistas Using Custom Delimiters in Jinja Templates TIL: Creating Fixed Length Iterables in Python Documentation Without Assumption Vagrant Python - A Reflection in 2022 Learning Golang No, A Virtual Machine Is Not Enough: Why Developers Need Native Linux Empathy in Tech For Those Who Came in Late A Weekend With PostgreSQL TIL: Gooey and Python Fire for Quick GUIs and CLIs TIL: 2ality - Dr. Axel Rauschmayer's JavaScript Blog TIL: MassDNS - High-Performance Bulk DNS Lookups TIL: Matomo Analytics, Google Tech Writing, Memory Programming, and NES TV Signals TIL: MontyDB - MongoDB Implemented in Python Returning to the Craft of Programming TIL: CPUFetch, OneFetch, and Learn CSS TIL: DNS Performance Testing and Pi-hole with Unbound TIL: Eli Bendersky's Blog, Awesome By Example, NoCoDB, and Martin Kleppmann TIL: CRDTs, Extreme HTTP Performance, and BYTEPATH Game TIL: AutoInvent, ASGI, Python Packaging, RAPIDS GPU Computing, and FlaskCon TIL: MangaDesk - Terminal Client for MangaDex TIL: McFly - Smart Shell History Search TIL: Siege Load Testing and Awesome FastAPI Resources TIL: Ventoy Bootable USB and Justniffer Network Analysis TIL: CLI Code Review, Git Split Diffs, and Internal Combustion Engine TIL: Benford's Law, Web Security Headers, Event Sourcing, and Mozilla Security Guidelines How to Write Documentation - The README.md File The Importance of Documentation TIL: NNgroup UX Research, SponsorBlock, and Labella Python Library TIL: The Little Book of Rust Macros and Rust Performance Book TIL: Git-Bug Distributed Issue Tracker and Omni Kubernetes Monitoring TIL: Zellij - Modern Terminal Multiplexer TIL: How Discord Handles 2.5 Million Concurrent Voice Users TIL: Volumio - The Audiophile Music Player TIL: Areopagitica - Milton's Defense of Free Speech TIL: Fast Node Manager, Zoxide Smart CD, Technical Writing, PyO3, and Qubes OS TIL: Slurm Workload Manager for HPC Clusters TIL: Data Visualization Guide and Oso Authorization Academy TIL: CORS Deep Dive, Piku Tiny PaaS, Rust Strings, and Deno Standard Library TIL: Raspberry Pi OS Development, Vim Beginner Guide, Password Management, and QueryBook TIL: uBlock Origin Performance Optimization on Firefox TIL: Breaking PostgreSQL at Scale and LeetCode Problem Patterns TIL: Awesome Tmux Resources for Terminal Multiplexing TIL: Grit - A Multitree-Based Personal Task Manager TIL: Lens 4.2 Kubernetes IDE, Shell Scripting Guide, and Dark HTTP Server Do The Job You Hate So You Won't Hate The Job You Love TIL: Innernet VPN Solution and NoteCalc Calculator App TIL: Argo CD for GitOps and Lens Kubernetes IDE TIL: Modern Rust CLI Tools - System Monitoring, HTTP Requests, and DNS TIL: tz - A Time Zone Helper Tool TIL: Distributed Systems Education, Fallacies, and Self-Hosted Internet Archiving TIL: Real-Time Voice Cloning Technology TIL: ChartMuseum for Helm, AMD's Corporate Journey, and Kubernetes Pod Scaling TIL: Docker and Kubernetes Tools - Whaler, Descheduler, and Dive TIL: Post-Mortem Collection, Terminal Plotting, and Technical Twitter TIL: Dark Mode Toggle Web Component by Google Chrome Labs TIL: Python eval(), exec(), and compile() Functions TIL: fuser Command for Process and File Investigation TIL: i Hate Regex - The Ultimate Regex Cheat Sheet TIL: Dolt - Git for Data and Database Version Control TIL: x86 Assembly Programming and SafeEyes Break Reminder TIL: Comprehensive Distributed Systems Reading List TIL: Cosmopolitan C Library, Distributed Systems Book, High Performance Browser Networking, and Rust Roguelike Tutorial TIL: ABlog for Sphinx - Documentation as a Blog Platform
TIL: Camelot PDF Tables, PostgreSQL Row Level Security, Zerodha Varsity, and Write Yourself a Git
2021-03-13 · via Stonecharioteer on Tech

Camelot: PDF Table Extraction for Humans

Python library for extracting tables from PDF files with high accuracy:

Key Features:

  • Multiple Parsing Methods: Stream and Lattice parsers for different table types
  • High Accuracy: Better results than traditional PDF parsing tools
  • Pandas Integration: Direct conversion to pandas DataFrames
  • Visual Debugging: Plot detected table areas for validation
  • Customizable: Fine-tune detection parameters

Usage Examples:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
import camelot

# Extract tables from PDF
tables = camelot.read_pdf('document.pdf')
print(f"Found {tables.n} tables")

# Convert to pandas DataFrame
df = tables[0].df

# Export to various formats
tables.export('output.csv', f='csv')
tables.export('output.xlsx', f='excel')

When to Use:

  • Financial reports with tabular data
  • Research papers with data tables
  • Government documents with statistics
  • Any structured data trapped in PDFs

PostgreSQL Row Level Security for Multi-Tenancy

Multi-tenant data isolation with PostgreSQL Row Level Security

Advanced PostgreSQL feature for implementing secure multi-tenant applications:

What is Row Level Security (RLS):

  • Fine-Grained Access: Control access at the row level within tables
  • Policy-Based: Define security policies for different user types
  • Transparent: Application code doesn’t need complex filtering logic
  • Performance: Database-level enforcement with optimized execution

Implementation Pattern:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
-- Enable RLS on table
ALTER TABLE customer_data ENABLE ROW LEVEL SECURITY;

-- Create policy for tenant isolation
CREATE POLICY tenant_isolation ON customer_data
    FOR ALL TO tenant_user
    USING (tenant_id = current_setting('app.current_tenant')::integer);

-- Set tenant context in application
SET app.current_tenant = '123';

Benefits:

  • Data Isolation: Strong tenant boundaries
  • Security: Prevents data leaks between tenants
  • Simplicity: Reduces application complexity
  • Compliance: Meets regulatory requirements

Use Cases:

  • SaaS applications with multiple customers
  • Healthcare systems with patient data isolation
  • Financial applications with account separation
  • Any system requiring strict data boundaries

Zerodha Varsity - Free Trading Education

Varsity by Zerodha – Markets, Trading, and Investing Simplified

Comprehensive free educational platform for learning about financial markets:

What It Covers:

Trading Fundamentals:

  • Technical Analysis: Chart patterns, indicators, candlesticks
  • Fundamental Analysis: Financial statement analysis, valuation
  • Options Trading: Strategies, Greeks, risk management
  • Futures Trading: Contract specifications, margin requirements

Investment Concepts:

  • Portfolio Management: Asset allocation, diversification
  • Risk Management: Position sizing, stop losses
  • Market Psychology: Behavioral finance concepts
  • Regulatory Framework: SEBI guidelines and compliance

Learning Approach:

  • Module-Based: Structured progression from basics to advanced
  • Practical Examples: Real market scenarios and case studies
  • Interactive Content: Charts, calculators, and tools
  • Free Access: No cost for high-quality financial education

Value Proposition:

  • Democratizes financial education
  • Reduces dependence on expensive courses
  • Provides practical, actionable knowledge
  • Builds informed investor community

Write yourself a Git!

Educational project to understand Git internals by implementing it from scratch:

Learning Objectives:

  • Version Control Concepts: How distributed version control works
  • Git Internals: Objects, references, and storage mechanisms
  • Data Structures: Trees, graphs, and hash-based storage
  • File System Operations: Blob storage and retrieval

Key Components to Implement:

Core Git Objects:

  • Blobs: File content storage
  • Trees: Directory structure representation
  • Commits: Snapshots with metadata
  • Tags: Named references to commits

Repository Operations:

  • Initialize: Create new repository structure
  • Add: Stage files for commit
  • Commit: Create snapshot with message
  • Checkout: Switch between branches/commits

Implementation Benefits:

  • Deep Understanding: Learn Git’s internal architecture
  • Debugging Skills: Better troubleshoot Git issues
  • System Design: Understand distributed system principles
  • Programming Practice: File I/O, data structures, algorithms

Technologies Used:

  • Python for implementation simplicity
  • File system operations for storage
  • Hashing algorithms for content addressing
  • Command-line interface design

Each resource provides deep technical knowledge in its domain - from practical data extraction to advanced database security, financial literacy, and version control system design.