惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
S
Securelist
Project Zero
Project Zero
L
LINUX DO - 热门话题
T
Tenable Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Spread Privacy
Spread Privacy
M
MIT News - Artificial intelligence
The Register - Security
The Register - Security
C
Cyber Attacks, Cyber Crime and Cyber Security
Simon Willison's Weblog
Simon Willison's Weblog
T
The Exploit Database - CXSecurity.com
NISL@THU
NISL@THU
T
Tor Project blog
I
InfoQ
WordPress大学
WordPress大学
阮一峰的网络日志
阮一峰的网络日志
罗磊的独立博客
Know Your Adversary
Know Your Adversary
T
The Blog of Author Tim Ferriss
S
SegmentFault 最新的问题
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
小众软件
小众软件
The GitHub Blog
The GitHub Blog
C
CERT Recently Published Vulnerability Notes
博客园 - 三生石上(FineUI控件)
J
Java Code Geeks
A
About on SuperTechFans
宝玉的分享
宝玉的分享
W
WeLiveSecurity
SecWiki News
SecWiki News
Hugging Face - Blog
Hugging Face - Blog
Blog — PlanetScale
Blog — PlanetScale
The Hacker News
The Hacker News
V2EX - 技术
V2EX - 技术
Cyberwarzone
Cyberwarzone
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Palo Alto Networks Blog
S
Schneier on Security
I
Intezer
P
Proofpoint News Feed
C
Check Point Blog
博客园 - 聂微东
B
Blog RSS Feed
Google DeepMind News
Google DeepMind News
大猫的无限游戏
大猫的无限游戏
C
CXSECURITY Database RSS Feed - CXSecurity.com
人人都是产品经理
人人都是产品经理
博客园 - 叶小钗
G
GRAHAM CLULEY

stat updates on arXiv.org

Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions Rejoinder: The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review Minimax Limits of k-Fold Cross-Validation via Majority Small Ensemble-based Data Assimilation: A Machine Learning-Enhanced Data Assimilation Method with Limited Ensemble Size Possession-Level Player Impact in the Pre-Play-by-Play NBA Era: A Video-Reconstructed RAPM Database, 1984--1996 PCA score regression: the art of losing power Heritability: A Counterfactual Perspective Long Memory in Intrinsically Dynamic Factor Models Modified treatment policies that depend on the natural history of treatment Post-Processing Posterior Predictive P-values Scalable Gaussian Process for Learning Non-Ergodic Ground Motion Model from Physics-Based Simulations with Application to Power Infrastructure Assessment Using the target trial framework for combining information: external comparator analyses and other applications Trustworthy AI/ML Regression and Unbiased Causal Inference for Real-World Data Synthetic Heterogeneous-Effects LASSO: A Fixed-effects Estimation Approach for High-dimensional Mixed-effects Models Bayesian Conformal-Projective Prediction Shared hidden-factor information framework for multiple behavioral tasks Consistent Identification of Top-$K$ Nodes in Noisy Networks Adaptable High-Dimensional Change Point Detection via Ridge Regularization Logistic regression is not enough: The need for Bayesian nonparametric modelling for causal inference using observational data, exemplified by the 'gateway' effect Distributional Conformal Prediction for Markov Processes How Eviction Court Governs: A Statistical Analysis of Bargaining, Templates, and Debt in Philadelphia Deep Regression for Repeated Measurements under Covariate Shift Optimal Estimation of Discrete Multiview Distributions under Heteroskedastic Multinomial Sampling Information-Theoretic Reliability is Robust to Analytic Choice: A 24-Specification Multiverse on Public Cognitive Test-Retest Data Kernel Embedding for Operator-Valued Measures and Its Application to Quantum Tomography A Statistical Physics View of the S&P 500: Pairwise Interactions and Time-Varying Dynamics A Quasi Maximum Likelihood Estimation Method for Bergomi-Type Volatility Models Rank-Based Tests for Mutual Independence of High-Dimensional Random Vectors via $L_q$ Norm Transcripts and Algebraic Distances in Time Series: Stochastic Properties and Nonparametric Dependence Tests Estimation of Directed Acyclic Graphs by Frequentist Model Averaging Exponential mixing properties of nonlinear functional autoregressive models Confidence intervals for causal effects in sequential decision making Measuring multivariate maximal tail dependence A Post-Processing Conformal Prediction Approach for Conditional Coverage via Pivotal Scores Bayesian perspectives on exponential random graph models Nonparametric Estimation via Expected Order Statistics Weighted NPMLE for the Marginal Mean of Recurrent Events with a Competing Terminal Event Considering causality in the construction of molecular signatures of lifestyle exposures Quantile autoregressive moving average models for ratio-based bounded time series Contested Temporalities in Critical Minerals and Resource Extraction for Electric Vehicles Match classification in the last round of four-team round-robin tournaments A multilevel sketch-and-solve method for overdetermined least squares problems The Symmetric Location Problem: a Song of Efficiency and Robustness Statistical methods for partitioning ribbon and globally-distributed flux using data from the Interstellar Boundary Explorer Selective Randomization Inference for Adaptive Experiments Weight-calibrated estimation for factor models of high-dimensional time series Goal-driven Bayesian Optimal Experimental Design for Robust Decision-Making Under Model Uncertainty DiscoverPhysics: Benchmarking LLMs for Out-of-the-Box Scientific Thinking Statistical Inference for Stochastic Gradient Descent Beyond Finite Variance Deployment-complete benchmarking Mapping the Schedule x Bit-Width Boundary in Sub-100M Quantisation-Aware Training High-Dimensional Robust Change-Point Detection via Angular Kernel Statistics Geometry Adaptive Counterfactual Distribution Learning with Diffusion-Guided Smoothing On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits Efficient Benchmarking Is Just Feature Selection and Multiple Regression The Behavioral Credibility Trilemma: When Calibrated Autonomy Becomes Impossible Stein-Encoder: A White-Box Supervised Encoder via Stein Identities in Multi-Modal Studies PAC Learning with Bandit Feedback: Sharp Sample Complexity in the Realizable Setting StrTransformer: Source-Wise Structured Transformers for Unsupervised Blind Source Recovery Courtroom Analogy: New Perspective on Uncertainty-Aware Classification Learning Sparse Compositional Functions with Norm-Constrained Neural Networks Optimal Design for Multinomial Logit Model with Applications to Best Assortment Identification Nonstationary Generalized Linear Bandits with Discounted Online Mirror Descent Rao-Blackwellized Score Matching on Manifolds Projected multi-reference alignment From DPPs to $k$-DPPs: identifiability analysis via spectral decomposition Guided Flow Matching for Forward and Inverse PDE Problems with Sparse Observations: Algorithm and Theory Mean-Shift PCA by Knockoff Mean Different Statistical Perspectives for Understanding Generalisation in Graph Neural Networks Sample correlation adjustments for robust Multi-fidelity Monte Carlo under limited pilot sampling Mixture-of-Finite-Mixtures Wishart Model for Clustering Covariance Matrices with an Application to Brain Functional Connectivity A Direct Variance Estimation (DiVE) for Meta-Analysis of Median Differences Regulatory Considerations for Using Artificial Intelligence Models to Reduce Sample Sizes in Registrational Studies Generalized Rank Regression Generalized Stochastic Approximation of the Log-Likelihood Ratio for Robust Sequential Change-Point Detection The frame problem in quantitative practice: ontological uncertainty and epistemic humility in an age of automated inference Directional subset simulation method for reliability analysis A note on closed-form solutions for estimating sample size when externally validating a binary prediction model based on $C$-statistic precision Joint Estimation of Marginal and Heterogeneous Treatment Effects Trajectory-Oriented Optimization Via Adaptive Thompson Sampling And Grid Refinement: A Tutorial With The ADAPTIVE\_TS Package Global Sensitivity Analysis: a novel generation of mighty estimators based on rank statistics Joint Bayesian models for validating spatial health-event databases against a gold standard: separating global and local discrepancies Anticipating Continued Global Fertility Decline via Neural Forecasting Detecting and Correcting Sample-by-Sample Scale Distortion in RNA Sequencing Data StanBKT: Rethinking Parameter Estimation in Bayesian Knowledge Tracing Fundamental Bounds and Efficient Estimation for Dead-Time-Constrained Event Detection, with Application to Single-Photon Lidar Diffusion Fluid Antenna Systems for Resilient ISAC Joint Object Tracking and Intent Recognition Bayesian High-dimensional Grouped-regression using Sparse Projection-posterior Measures of association for approximating copulas Robust copula estimation for one-shot devices with correlated failure modes Causal inference via implied interventions Latent space projections and atlases: A cautionary tale in deep neuroimaging using autoencoders Refined thresholds for inconsistency: The effect of the graph associated with incomplete pairwise comparisons Non-parametric Causal Inference in Dynamic Thresholding Designs Online Change Point Detection for Multivariate Inhomogeneous Poisson Processes Time Series The Integer-valued Moving-Average Random Field Asymptotic e-processes Parameter estimation for kappa distributions using the EM algorithm in the superstatistical framework Beyond the Composite: Enhancing Trial Analysis through a Divide & Conquer Approach to 'Days Alive and at Home': Insights from the NOTACS trial
Tests for categorical data beyond Pearson: A distance covariance and energy distance approach
Fernando Castro-Prado, Wenceslao González-Manteiga, Javier Costa · 2024-03-19 · via stat updates on arXiv.org

Categorical variables are of uttermost importance in biomedical research. When two of them are considered, it is often the case that one wants to test whether or not they are statistically dependent. We show weaknesses of classical methods -- such as Pearson's and the G-test -- and we propose testing strategies based on distances that lack those drawbacks. We first develop this theory for classical two-dimensional contingency tables, within the context of distance covariance, an association measure that characterizes general statistical independence of two variables. We then apply the same fundamental ideas to one-dimensional tables, namely to the testing for goodness of fit to a discrete distribution, for which we resort to an analogous statistic called energy distance. We prove that our methodology has desirable theoretical properties, and we show that we can calibrate the null distribution of our test statistics without resampling. We illustrate all this in simulations, as well as with some real data examples, demonstrating the adequate performance of our approach for biostatistical practice.