惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

SecWiki News
SecWiki News
I
InfoQ
The Cloudflare Blog
人人都是产品经理
人人都是产品经理
博客园 - Franky
T
Tailwind CSS Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
量子位
博客园_首页
罗磊的独立博客
V
V2EX
李成银的技术随笔
大猫的无限游戏
大猫的无限游戏
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
True Tiger Recordings
Vercel News
Vercel News
Cyberwarzone
Cyberwarzone
Cisco Talos Blog
Cisco Talos Blog
F
Fox-IT International blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
M
Microsoft Research Blog - Microsoft Research
Know Your Adversary
Know Your Adversary
爱范儿
爱范儿
The Register - Security
The Register - Security
G
Google Developers Blog
The Hacker News
The Hacker News
Malwarebytes
Malwarebytes
S
Securelist
博客园 - 三生石上(FineUI控件)
Jina AI
Jina AI
T
Threat Research - Cisco Blogs
T
The Exploit Database - CXSecurity.com
S
SegmentFault 最新的问题
博客园 - 叶小钗
F
Fortinet All Blogs
Apple Machine Learning Research
Apple Machine Learning Research
宝玉的分享
宝玉的分享
博客园 - 聂微东
T
Threatpost
博客园 - 【当耐特】
D
Docker
P
Privacy & Cybersecurity Law Blog
www.infosecurity-magazine.com
www.infosecurity-magazine.com
G
GRAHAM CLULEY
V
Visual Studio Blog
C
Cisco Blogs
IT之家
IT之家
S
Security Archives - TechRepublic
Latest news
Latest news
阮一峰的网络日志
阮一峰的网络日志

Paper Index on ACL Anthology

Hybrid Human-LLM Corpus Construction and LLM Evaluation for the Caused-Motion Construction Implicit and Indirect: Detecting Face-threatening and Paired Actions in Asynchronous Online Conversations Controlling Language and Style of Multi-lingual Generative Language Models with Control Vectors Northern European Journal of Language Technology, Volume 11 Bullshit, Pragmatic Deception, and Natural Language Processing Pragmatic uses of I don’t know, boosters, and hedges in text and talk GailBot: An automatic transcription system for Conversation Analysis Discourse Relations and Connectives in Higher Text Structure User Satisfaction Reward Estimation Across Domains: Domain-independent Dialogue Policy Learning Signaling of Causal Relations in Spanish: Variety, Functionality, and Specificity Cognitive and social delays in the initiation of conversational repair Laughter use by virtual agents increases task success Graph-to-Text Approach to Knowledge-Grounded Response Generation in Human–Robot Interaction Calling things by their names: Towards a unified account for name-informing and mixed quotation Referential Communication Between Friends and Strangers in the Wild Light Verb Constructions in ELEXIS-WSD – Annotation, Comparisons and Issues Automatic Detection of the Bulgarian Evidential Renarrative Prior Lessons of Incremental Dialogue and Robot Action Management for the Age of Language Models Event and Entity Coreference Across Five Languages: Effects of Context and Referring Expression Embodied Conversational Systems in Human–Robot Interaction: Introduction to the Special Issue Beyond semantics: the challenges of annotating pragmatic and discourse phenomena German Demonstrative Pronouns in Contrast Modelling Structures for Situated Discourse Lexical Alignment to Non-native Speakers From Discursive Practice to Logic? Remarks on Logical Expressivism Common Ground inconsistencies in dialogue systems: conflict patterns implied by polar question forms The Conversational Discourse Unit: Identification and Its Role in Conversational Turn-taking Management Strategic Dialogue Assessment: The Crooked Path to Innocence Please, Please, Just Tell Me: The Linguistic Features of Humorous Deception When to Say What and How: Adapting the Elaborateness and Indirectness of Spoken Dialogue Systems The (Possible) Use of AI Tools for Processing Texts in Journalism in Bulgarian User Impressions of System Questions to Acquire Lexical Knowledge during Dialogues Repair of claimed non-understanding of word meaning in online discussion forum interaction Characterizing the Response Space of Questions: data and theory Automatic Essay Scoring Systems Are Both Overstable And Oversensitive: Explaining Why And Proposing Defenses Digging Communicative Intentions: The Case of Crises Events Exploring the Sensitivity to Alternative Signals of Coherence Relations Enhancing Long-term RAG Chatbots with Psychological Models of Memory Importance and Forgetting Demonstrative Pronouns as Anti-Logophoric Pronouns: An Experimental Investigation Attribution and the discourse structure of reports How People Structure Representations of Discourse German Modal Particles as Discourse Signals Form and Function of Connectives in Chinese Conversational Speech It matters how you combine your clauses: Effects of syntactic subordination, connectives, and typographic and prosodic boundaries on the prominence of referents Multi-modal Anaphora and Broadcasting of Information by Gestural Post-holds Lexical and contextual cue effects in discourse expectations: Experimenting with German ’zwar...aber’ and English ’true/sure...but’ Investigating Proactivity in Task-Oriented Dialogues Few Shades of Supervision for Discourse Segmentation An Analysis of Japanese Sentence-final Particle Yone: Compare Yone and Ne in Response Processing of discourse anaphors by L2 speakers of English Perspective-Taking and Protagonist Prominence Self-Repair in Tigrinya: Trouble Sources, Mechanisms and Solutions The effect of domain knowledge and implicitation on discourse relation inferences Studying Alignment in a Collaborative Learning Activity via Automatic Methods: The Link Between What We Say and Do Scoring Coreference Chains with Split-Antecedent Anaphors Opinion Piece: Can we Fix the Scope for Coreference? Computational Linguistics in Bulgaria Why ellipsis? Interactional function predicts choice of syntactic form in conversation The Use of Perspective Markers and Connectives in Expressing Subjectivity: Evidence from Collocational Analyses Journal Computational Linguistics in Bulgaria Does ChatGPT Adapt Itself to the Language Used and the Audience It Implies? The timing of prominence information during the resolution of German personal and demonstrative pronouns Narrative Elements in Expository Texts A Neural Approach to Discourse Relation Signal Detection A modular architecture for creating multimodal embodied agents with an episodic Knowledge Graph as an explainable and controllable long-term memory A Dataset of Brazilian Portuguese Clinical Notes for Anaphylaxis Detection A Lexicon-Grammar of Brazilian Portuguese Predicative Adjectives A Multilingual Voice Analytics Module for Contact-Center Hiring Agent Orchestration - LLM for Legal Metadata Extraction: A Comparative Analysis of Efficiency and Precision ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs AMALIA: A Fully Open Large Language Model for European Portuguese Análise de Sentimento Baseada em Aspectos no Domínio de Acomodações Utilizando o modelo BERTimbau Auditing the Evaluators: How Far Can Automatic Evaluation Go in Assessing Portuguese Financial Texts? Automated Essay Scoring for Brazilian Portuguese. Evidence from Cross-Prompt Evaluation of ENEM Essays Bridging Cultural Gaps in Automated Translation of Brazilian Expressions: A Study on Cultural Adaptation Bruna: A Real-Time Multimodal Voice Agent with Hybrid Reasoning Caracterização lexical e sintática de notícias falsas em português produzidas por humanos e por máquinas Cartas Indígenas ao Brasil: Classificação Multi-Rótulo Can I guess where you are from? Modeling dialectal morphosyntactic similarities in Brazilian Portuguese Causal_QA.PT: A Human–LLM Co-Curated Benchmark for Causal Question Answering in Portuguese Language dialect2vec: Um método baseado em vetores para transcrição dialetal do português a partir de questionários do ALiB Diálogos Tóxicos: Gatilhos e Padrões de Interação no Reddit Brasileiro Discovery of Legal Patterns in Civil Petitions via LLM-Based Fact Extraction and Density Clustering Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs Enhanced Universal Dependencies in the Wild: Evaluating Portuguese EUD Parsing in Realistic Scenarios Enhancing Brazilian Inflation Forecasts through Sentiment Analysis Using Large Language Models Evaluating Automated Scoring Models on Official ENEM Essays Evaluating FrameNet-Based Semantic Modeling for Gender-Based Violence Detection in Clinical Records Evaluating Reference-Free Summarization Quality Metrics for Portuguese: A Study with Human Judgments in Financial News Evaluating Small Language Models for English-to-Portuguese Translation: Impact of Model Scale and Quantization Evolução de Padrões Linguísticos na Escrita Científica em Português: Uma Análise com NILC-Metrix Experimental Evaluation of Topic Modeling Methods for Categorizing Irregularities in Health-related news Exploring automatic terminology extraction from historical medical data Exploração de métodos simbólicos para detecção de emoções para o português Exploring Knowledge Graphs for Automatic Fake News Detection in Portuguese Exploring Sentiment Analysis Approaches in a Public Agency Security News Dataset MATH-PT: A Math Reasoning Benchmark for European and Brazilian Portuguese Modeling Linguistic Violence: An Ontology-Based Framework for the Computational Analysis of Violence Manifested in Language Multi-Agent Architecture with RAG and Dynamic Context Windows for Text-to-SQL Optimization Negation-Aware Data Augmentation for Portuguese Natural Language Inference
A Multitask Transformer for Offensive Language Detection and Target Identification in HateBR
2026-04-13 · via Paper Index on ACL Anthology

Abstract

Hate speech detection is often treated as a binary task, ignoring the hierarchical nature of toxicity, such as severity levels and specific target groups. This work presents a Multitask Learning (MTL) approach for the HateBR dataset, utilizing a shared BERTimbau encoder to simultaneously predict binary offensiveness, ordinal severity, and hate speech targets. Our experiments demonstrate that the MTL architecture outperforms Single-Task baselines on the primary offensive detection task, increasing the Matthews Correlation Coefficient from 0.80 to 0.82. Beyond predictive performance, we show that joint training implicitly enforces hierarchical sanity: the unified model yields a 0% target-inconsistency rate (i.e., no cases where a comment is predicted Non-offensive while still assigned a hate target). However, we observe negative transfer in the fine-grained multilabel target task (Micro-F1 drops from 0.59 to 0.42), highlighting a trade-off between logical consistency and target attribution under extreme imbalance.

Anthology ID:
2026.propor-1.109
Volume:
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:
April
Year:
2026
Address:
Salvador, Brazil
Editors:
Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:
PROPOR
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1049–1054
Language:
URL:
https://aclanthology.org/2026.propor-1.109/
DOI:
Bibkey:
Cite (ACL):
Guilherme Silva, Pedro Silva, Matheus Peixoto, Gladston Moreira, and Eduardo Luz. 2026. A Multitask Transformer for Offensive Language Detection and Target Identification in HateBR. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 1049–1054, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):
A Multitask Transformer for Offensive Language Detection and Target Identification in HateBR (Silva et al., PROPOR 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.propor-1.109.pdf