惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs.CL updates on arXiv.org

Temporal Concept Drift in Legal Judgment Prediction: Neural Baselines Across Three Epochs of Ukrainian Court Decisions World-State Transformations for Neuro-symbolic Interactive Storytelling ROC Analysis for Evaluating Translation Quality Estimation Systems Repeated Sequences Reveal Gaps between Large Language Models and Natural Language Unveil: Unified Visual-Textual Integration and Distillation for Multi-modal Document Retrieval Discovering Lexical Gaps Using Embeddings from Multilingual LLMs Measuring the Depth of LLM Unlearning via Activation Patching Generating Legal Commentaries from Case Databases via Retrieval, Clustering, and Generation AstroMind: A High-Fidelity Benchmark for Spacecraft Behavior Reasoning Based on Large Language Models EchoDistill:Alignment Noisy-to-Clean Self-Distillation for Robust Audio LLMs Faithful or Fabricated? A Causal Framework for Rationalization Bias in LLM Judges End-to-End Intracortical Speech Decoding from Neural Activity Multi-Persona Debate System for Automated Scientific Hypothesis Generation An Interactive Paradigm for Deep Research DTO: a Differentiable Training Objective for Effective Counterfactual Story Rewriting Found in Conversation: LLMs Teach Themselves to Close the Multi-Turn Gap Document Classification Pattern Recognition via Information Fusion: A Systematic Review of Multimodal and Multiview Representation Approaches Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks The Tokenizer Tax Across 25 European Languages: Domain Invariance, Cross-Lingual Few-Shot Effects, and the Ukrainian Penalty Lngram: N-gram Conditional Memory in Latent Space Raon-Speech Technical Report CUNY at CLPsych 2026: A Pipeline Approach to Classification and Summarization of Mental Health Changes Improving the Completeness and Comparability of Segment Disclosures: A Large Language Model Approach Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs Phonetic Modeling of Dialectal Variation in Vietnamese Speech DRInQ: Evaluating Conversational Implicature with Controlled Context Variation Towards a Universal Causal Reasoner StepGap: A Hybrid NLI-LLM Checker for Step-Level Evidence-Gap Detectionin Multi-Hop Question Answering TS-Skill: A Benchmark for Evaluating Analytical Skills in Time-Series Question Answering Who judges the judges? Governance from metrics: a runtime framework for continuous LLM compliance monitoring WhenLoss: Diagnosing Write and Retrieval Bottlenecks in Long-Context Memory Systems SLAP: Stratified Loss-based Pruning for On-Policy Data-Efficient Instruction Tuning Know You Before You Speak: User-State Modeling for LLM Personalization in Multi-Turn Conversation The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models CSP-Atlas: Concept-Specific Neural Circuits in a Sparse Python Transformer Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation HiMed: Incentivizing Hindi Reasoning in Medical LLMs When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation Beyond the Target: From Imitation to Collaboration in Speculative Decoding Quantifying the Impact of Translation Errors on Multilingual LLM Evaluation Toxicity in Twitch Chats: An LLM-Based Analysis Across Gaming Communities Extracting Training Data from Diffusion Language Models via Infilling Direct Preference Optimization for English-Mandarin Code-Switching Speech Recognition in Audio LLMs ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions Decompose-and-Refine: Structured Legal Question Answering with Parametric Retrieval SEAL: Synergistic Co-Evolution of Agents and Learning Environments How Much Structure Do LLMs Need? Evaluating LLMs for Bibliometric Cluster Description QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks Word Class Representations Spontaneously Emerge from Successor Representations Trained on Natural Language Structure-Aware RAG: Structured Retrieval Augmented Generation from Noisy Data for Conversational Agents Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning TriVAL: A Tri-Validation Framework for Faithful Automatic Optimization Modeling AERIC: Anticipatory Hidden-State Monitoring for Implicit Harmful Dialogue Grammatically-Guided Sparse Attention for Efficient and Interpretable Transformers Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation Translators as Invisible Teachers of AI: Copyright, Translation Memory, and the Political Economy of Linguistic Data CP-Agent: A Calibrated Risk-Controlled Agent for Feedback-Driven Competitive Programming Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking Improving Sampling for Masked Diffusion Models via Information Gain OnePred: Next-Query Prediction via Recursive Intent Memory in Multi-Turn Conversations Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework SemEval-2026 Task 6: CLARITY -- Unmasking Political Question Evasions FastKernels: Benchmarking GPU Kernel Generation in Production ETCHR: Editing To Clarify and Harness Reasoning PrefBench: Evaluating Zero-Shot LLM Agents in Hidden-Preference Personalized Pricing Negotiations OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents SkillOpt: Executive Strategy for Self-Evolving Agent Skills BURMESE-SAN: Burmese NLP Benchmark for Evaluating Large Language Models Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models When Do LLMs Reason? A Dynamical Systems View via Entropy Phase Transitions Multilingual Knowledge Transfer under Data Constraints via Lexical Interventions Decomposing Queries into Tool Calls for Long-Video Keyframe Retrieval Multi-Gate Residuals CultivAgents: Cultivating Relationship-Centered Multi-Agent Systems for Personalized Gardening Benchmarking Gaslighting Attacks Against Speech Large Language Models Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches Transcoders Trace Visual Grounding and Hallucinations in Vision-Language Models InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion Metadata Predictability Is Not Evidence Dependence: An Intervention-Based Audit for Weak-Label Benchmarks DELICATE: Diachronic Entity LInking using Classes And Temporal Evidence Strategic Coercion Within Alliances: The Greenland Sovereignty Game as an AI Stress Test ChartFI: Benchmarking Faithfulness and Insightfulness of Chart Descriptions from Multimodal Large Language Models RADAR: Relative Angular Divergence Across Representations TEAM: Temporal-Spatial Consistency Guided Expert Activation for MoE Diffusion Language Model Acceleration Strong Teacher Not Needed? On Distillation in LLM Pretraining AI-Friendly LaTeX: Using LaTeX Code as a Knowledge Source for Retrieval-Augmented Generation DiLaDiff: Distilled Latent-Augmented Diffusion for Language Modeling CoSPlay: Cooperative Self-Play at Test-Time with Self-Generated Code and Unit Test Speak-to-Structure: Evaluating LLMs in Open-domain Natural Language-Driven Molecule Generation GEMQ: Global Expert-Level Mixed-Precision Quantization for MoE LLMs Decomposing and Measuring Evaluation Awareness Sparser Block-Sparse Attention via Token Permutation Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning TurkicNLP: An NLP Toolkit for Turkic Languages ModeSwitch-LLM: A Lightweight Phase-Aware Controller for Cross-Mode LLM Inference on a Single GPU What Does the Server See? Understanding Privacy Leakage from Large Language Models in Split Inference
Side-by-side Comparison Amplifies Dialect Bias in Language Models
Kritee Konda · 2026-05-26 · via cs.CL updates on arXiv.org

View PDF HTML (experimental)

Abstract:Language models (LMs) can exhibit systematic biases against speakers based on variations in their dialects, even in the absence of a dialect label, a behavior known as covert dialect bias. In this work, we quantify covert dialect bias in online discourse by evaluating how LMs associate stereotypical traits (derived from social psychology research on racial bias) with intent-equivalent tweets in Standard American English (SAE) and African-American Vernacular English (AAVE). While prior work shows that LMs associate more negative stereotypes with AAVE when evaluating tweets in isolation, we are surprised to find that this bias is significantly exacerbated when SAE / AAVE tweet pairs are compared side by side, a setting that more closely reflects high-impact decision making contexts in which models are used to rank candidates. The bias only worsens when dialect labels are explicitly specified. This is striking, given the extensive efforts from commercial developers to mitigate bias in their LMs. Encouragingly, we show that counterfactual fairness finetuning can mitigate covert dialect bias for some stereotypical traits, reducing average disparities when evaluating tweets in isolation, however, these improvements do not consistently hold across traits when evaluating SAE / AAVE tweets side by side. Our findings show that existing evaluation settings for covert dialect bias may underestimate its severity, specifically in contrastive settings. Additionally, overt dialect bias remains pronounced even after safety aligned finetuning, indicating that it remains an unresolved problem, and motivates the need for more robust evaluation and mitigation frameworks.
Comments: In proceeding at ACM Conference on Fairness, Accountability, and Transparency 2026
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as: arXiv:2605.24384 [cs.CL]
  (or arXiv:2605.24384v1 [cs.CL] for this version)
  https://doi.org/10.48550/arXiv.2605.24384

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Kritee Kondapally [view email]
[v1] Sat, 23 May 2026 03:51:44 UTC (6,682 KB)