惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

www.infosecurity-magazine.com
www.infosecurity-magazine.com
Security Archives - TechRepublic
Security Archives - TechRepublic
TaoSecurity Blog
TaoSecurity Blog
Cloudbric
Cloudbric
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
N
News and Events Feed by Topic
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
S
Securelist
The Cloudflare Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
D
DataBreaches.Net
S
Schneier on Security
L
LangChain Blog
Jina AI
Jina AI
M
MIT News - Artificial intelligence
Recent Announcements
Recent Announcements
T
Tenable Blog
B
Blog RSS Feed
V
Visual Studio Blog
Simon Willison's Weblog
Simon Willison's Weblog
G
Google Developers Blog
T
The Exploit Database - CXSecurity.com
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
WordPress大学
WordPress大学
W
WeLiveSecurity
I
InfoQ
The Hacker News
The Hacker News
雷峰网
雷峰网
月光博客
月光博客
P
Privacy & Cybersecurity Law Blog
O
OpenAI News
Hacker News: Ask HN
Hacker News: Ask HN
T
Threat Research - Cisco Blogs
GbyAI
GbyAI
The Last Watchdog
The Last Watchdog
P
Privacy International News Feed
Cyberwarzone
Cyberwarzone
S
SegmentFault 最新的问题
L
Lohrmann on Cybersecurity
人人都是产品经理
人人都是产品经理
V
V2EX
V
Vulnerabilities – Threatpost
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Cybersecurity and Infrastructure Security Agency CISA
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
T
Troy Hunt's Blog
Application and Cybersecurity Blog
Application and Cybersecurity Blog
阮一峰的网络日志
阮一峰的网络日志
SecWiki News
SecWiki News
Microsoft Azure Blog
Microsoft Azure Blog

cs.DB updates on arXiv.org

Block-Sphere Vector Quantization GroupAffect-4: A Multimodal Dataset of Four-Person Collaborative Interaction CogScale: Scalable Benchmark for Sequence Processing TextAlign: Preference Alignment for Text Rendering with Hierarchical Rewards LogRouter: Adaptive Two-Level LLM Routing for Log Question Answering in Big Data Systems Agentic Cost-Aware Query Planning with Knowledge Distillation for Big Data Analytics Covariance Structure and Coordinate Heterogeneity Govern Binary Quantization of Contrastive Embeddings IVF-TQ: Calibration-Free Streaming Vector Search via a Codebook-Free Residual Layer Automatic Unsupervised Ensemble Outlier Model Selection--Extended Version A Generative AI Framework for Intelligent Utility Billing CO 2 Analytics and Sustainable Resource Optimisation Towards Foundation Models for Relational Databases with Language Models and Graph Neural Networks Gaussian Relational Graph Transformer Croissant Baker: Metadata Generation for Discoverable, Governable, and Reusable ML Datasets Reducing Hallucination in Vision-Language Models via Stage-wise Preference Optimization under Distribution Shift A Horn extension of DL-Lite with NL data complexity 3D Primitives are a Spatial Language for VLMs Enabling AI-Native Mobility in 6G: A Real-World Dataset for Handover, Beam Management, and Timing Advance A CAP-like Trilemma for Large Language Models: Correctness, Non-bias, and Utility under Semantic Underdetermination EpiCastBench: Datasets and Benchmarks for Multivariate Epidemic Forecasting FERMI: Exploiting Relations for Membership Inference Against Tabular Diffusion Models Toward Multi-Database Query Reasoning for Text2Cypher Autonomous FAIR Digital Objects: From Passive Assertions to Active Knowledge HOME-KGQA: A Benchmark Dataset for Multimodal Knowledge Graph Question Answering on Household Daily Activities Detect, Localize, and Explain: Interactive Hierarchical Log Anomaly Analytics with LLM Augmentation Open Ontologies: Tool-Augmented Ontology Engineering with Stable Matching Alignment Machine Learning-Based Pre-Test Risk Stratification for PCR-Confirmed Chlamydia Using Patient-Reported Data and Urine Biomarkers Reconciling Consistency-Based Diagnosis with Actual-Causality-Based Explanations PrepBench: How Far Are We from Natural-Language-Driven Data Preparation? Anatomy of a Query: W5H Dimensions and FAR Patterns for Text-to-SQL Evaluation Building informative materials datasets beyond targeted objectives Cross-Model Consistency of Feature Importance in Electrospinning: Separating Robust from Model-Dependent Features LUCAS-MEGA: A Large-Scale Multimodal Dataset for Representation Learning in Soil-Environment Systems Inconsistent Databases and Argumentation Frameworks with Collective Attacks Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies FINER-SQL: Boosting Small Language Models for Text-to-SQL Efficient Temporal Datalog Materialisation for Composite Event Recognition EGREFINE: An Execution-Grounded Optimization Framework for Text-to-SQL Schema Refinement Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding FineState-Bench: Benchmarking State-Conditioned Grounding for Fine-grained GUI State Setting ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era A Toolkit for Detecting Spurious Correlations in Speech Datasets SiriusHelper: An LLM Agent-Based Operations Assistant for Big Data Platforms Evergreen: Efficient Claim Verification for Semantic Aggregates CacheRAG: A Semantic Caching System for Retrieval-Augmented Generation in Knowledge Graph Question Answering Health System Scale Semantic Search Across Unstructured Clinical Notes Mining Negative Sequential Patterns to Improve Viral Genomic Feature Representation and Classification Prior-Aligned Data Cleaning for Tabular Foundation Models Spark Policy Toolkit: Semantic Contracts and Scalable Execution for Policy Learning in Spark Versioned Late Materialization for Ultra-Long Sequence Training in Recommendation Systems at Scale EPM-RL: Reinforcement Learning for On-Premise Product Mapping in E-Commerce How Hard is it to Decide if a Fact is Relevant to a Query? Towards Universal Tabular Embeddings: A Benchmark Across Data Tasks Using ASP(Q) to Handle Inconsistent Prioritized Data A Demonstration of SQLyzr: A Platform for Fine-Grained Text-to-SQL Evaluation and Analysis Self-Aware Vector Embeddings for Retrieval-Augmented Generation: A Neuroscience-Inspired Framework for Temporal, Confidence-Weighted, and Relational Knowledge VTouch++: A Multimodal Dataset with Vision-Based Tactile Enhancement for Bimanual Manipulation Pre-Execution Query Slot-Time Prediction in Cloud Data Warehouses: A Feature-Scoped Machine Learning Approach Revisiting RaBitQ and TurboQuant: A Symmetric Comparison of Methods, Theory, and Experiments DW-Bench: Benchmarking LLMs on Data Warehouse Graph Topology Reasoning PersonalHomeBench: Evaluating Agents in Personalized Smart Homes NeuroLip: An Event-driven Spatiotemporal Learning Framework for Cross-Scene Lip-Motion-based Visual Speaker Recognition Blue Data Intelligence Layer: Streaming Data and Agents for Multi-source Multi-modal Data-Centric Applications RELOAD: A Robust and Efficient Learned Query Optimizer for Database Systems Credo: Declarative Control of LLM Pipelines via Beliefs and Policies Leveraging LLM-GNN Integration for Open-World Question Answering over Knowledge Graphs IndicDB -- Benchmarking Multilingual Text-to-SQL Capabilities in Indian Languages Multi-modal panoramic 3D outdoor datasets for place categorization Gypscie: A Cross-Platform AI Artifact Management System ODUTQA-MDC: A Task for Open-Domain Underspecified Tabular QA with Multi-turn Dialogue-based Clarification Graph Query Generation with Constraint-guided Large Language Agents CubeGraph: Efficient Retrieval-Augmented Generation for Spatial and Temporal Data LLM+Graph@VLDB'2025 Workshop Summary Memory in the LLM Era: Modular Architectures and Strategies in a Unified Framework Stream2LLM: Overlap Context Streaming and Prefill for Reduced Time-to-First-Token (TTFT) HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness Exploring Urban Land Use Patterns by Pattern Mining and Unsupervised Learning 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models From Natural Language to PromQL: A Catalog-Driven Framework with Dynamic Temporal Resolution for Cloud-Native Observability A Domain-Specific Language for LLM-Driven Trigger Generation in Multimodal Data Collection SpotIt+: Verification-based Text-to-SQL Evaluation with Database Constraints Relational In-Context Learning via Synthetic Pre-training with Structural Prior A Pythonic Functional Approach for Semantic Data Harmonisation in the ILIAD Project TableNet A Large-Scale Table Dataset with LLM-Powered Autonomous DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases KRONE: Scalable LLM-Augmented Log Anomaly Detection via Hierarchical Abstraction Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models OVT-MLCS: An Online Visual Tool for MLCS Mining from Long or Big Sequences Sufficient Explanations in Databases and their Connections to Database Repairs Gradient-Based Join Ordering Presenting DiaData for Research on Type 1 Diabetes Factual Inconsistencies in Multilingual Wikipedia Tables MINT: Multi-Vector Search Index Tuning Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL In-depth Analysis of Graph-based RAG in a Unified Framework Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation Goal-Driven Query Answering over First- and Second-Order Dependencies with Equality BEAVER: An Enterprise Benchmark for Text-to-SQL Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data Querying Inconsistent Prioritized Data with ORBITS: Algorithms, Implementation, and Experiments
Indexing Temporal Relations for Range-Duration Queries
Matteo Ceccarello, Anton Dignös, Johann Gamper, Christina Khnais · 2022-06-15 · via cs.DB updates on arXiv.org

Temporal information plays a crucial role in many database applications, however support for queries on such data is limited. We present an index structure, termed RD-index, to support range-duration queries over interval timestamped relations, which constrain both the range of the tuples' positions on the timeline and their duration. RD-index is a grid structure in the two-dimensional space, representing the position on the timeline and the duration of timestamps, respectively. Instead of using a regular grid, we consider the data distribution for the construction of the grid in order to ensure that each grid cell contains approximately the same number of intervals. RD-index features provable bounds on the running time of all the operations, allow for a simple implementation, and supports very predictable query performance. We benchmark our solution on a variety of datasets and query workloads, investigating both the query rate and the behavior of the individual queries. The results show that RD-index performs better than the baselines on range-duration queries, for which it is explicitly designed. Furthermore, it outperforms specialized indexes also on workloads containing queries constraining either only the duration or the range.