惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Hackread – Cybersecurity News, Data Breaches, AI and More
S
Schneier on Security
罗磊的独立博客
Recorded Future
Recorded Future
Hacker News - Newest:
Hacker News - Newest: "LLM"
G
Google Developers Blog
博客园_首页
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
T
The Blog of Author Tim Ferriss
Know Your Adversary
Know Your Adversary
L
Lohrmann on Cybersecurity
C
Cybersecurity and Infrastructure Security Agency CISA
博客园 - 三生石上(FineUI控件)
M
MIT News - Artificial intelligence
B
Blog
T
Tor Project blog
D
Docker
Engineering at Meta
Engineering at Meta
Apple Machine Learning Research
Apple Machine Learning Research
Spread Privacy
Spread Privacy
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Scott Helme
Scott Helme
MyScale Blog
MyScale Blog
量子位
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
aimingoo的专栏
aimingoo的专栏
IT之家
IT之家
AWS News Blog
AWS News Blog
Google Online Security Blog
Google Online Security Blog
NISL@THU
NISL@THU
D
DataBreaches.Net
Help Net Security
Help Net Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Cloudbric
Cloudbric
美团技术团队
W
WeLiveSecurity
H
Hacker News: Front Page
宝玉的分享
宝玉的分享
The Cloudflare Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
爱范儿
爱范儿
N
News and Events Feed by Topic
V
Visual Studio Blog
C
CERT Recently Published Vulnerability Notes
T
Tailwind CSS Blog
MongoDB | Blog
MongoDB | Blog
F
Fortinet All Blogs
B
Blog RSS Feed
S
Security Affairs

IEEE Spectrum

How a Forgotten Wire Turned a Cheap Chip Into a Brainlike Neuron Why Does a Bank Need a Chief Scientist? What it Means to Be a Mathematician When AI Does the Math AI Learns the "Dark Art" of RF Chip Design Can AI Learn to Read the Room? Commemorating 70 Years of Artificial Intelligence IEEE Rolls Out Large Language Models Virtual Training Course Can Sound-Driven Synapses Make AI Both Faster and Greener? How AI Attribution Could Finally Pay Musicians for Training Data Inside GM’s AI Push to Speed Up the Design of Cars and Moon Rovers Are Emotion Reading Robots Still Missing What Matters Most? The Google DeepMind Spinoff Chasing Hidden Drug Targets Save 14 Percent of Energy Used in LLM Training With This Trick AI Can Help Track the World’s Shrinking Glaciers Nvidia’s AI Hardware Comes to Windows in RTX Spark PCs Why Quantum Computers Need a ‘Healthy Chunk’ Of Classical Power How Young Engineers Can Turn AI Into Career Leverage Why Aren’t We Measuring How AI Affects Humans? Majestic’s 128TB AI Server Aims to Smash the LLM Memory Wall Finding Success in Industry as a Chip Designer Why South Africa’s AI Policy Leverage Is Slipping Away Unused AI and Thermal Cameras Help Ships Steer Clear Of Gray Whales Why Reclaiming ‘Social Engineering’ Could Protect Your Autonomy AI with Model-Based Design: Virtual Sensor Modeling - Wiley Science and Engineering Content Hub Millimeter Waves Turn Tiny Insects Into Trackable Data Māori AI Voice Puts Language Ownership Back In Community Hands Open-Source AI Could Make It Easier to Build Smart Robots The Future of Physical AI Isn’t Smarter Robots, It’s Smarter Interfaces Agentic AI for Robot Teams How Melbourne’s AI and Data Center Flywheel Is Accelerating Research Innovation Hidden Voice Glitches Could Hijack Audio AI Tools AI Rings Turn Sign Language Into Text In Real Time Graphene Leaf Tattoos Turn Plants Into Living Moisture Meters Accelerating Chipmaking Innovation for the Energy-Efficient AI Era Can AI Chatbots Reason Like Doctors? General AI Outruns Specialized Tools at Transcribing Handwriting Neutralizing the Gigascale Problem: How to Solve the Physical Power Paradox of Extreme AI Training Loads Tiny Data Centers at Substations Aim to Keep AI Power Usage In Check Orbital Bets On a Mesh Of GPU Satellites for AI Inference Can AI Really Build Better AI? AI Chatbot Safety Guardrails for Mental Health Ten Key Enablers for 6G Wireless Communications - Wiley Science and Engineering Content Hub
AI Model ConlangCrafter Dreams up Entire New Languages
https://www.facebook.com/48576411181 · 2026-06-27 · via IEEE Spectrum

There are over 7,000 natural languages today, but that doesn’t stop people from occasionally making up completely new ones. These constructed languages, or conlangs, include Dothraki, Klingon, and various Elvish languages. Now, an AI model called ConlangCrafter is also capable of generating new languages—and it is particularly good at it.

In a paper published 27 June in the Proceedings of the Association of Computer Linguists, researchers analyzed ConlangCrafter’s language generation abilities, reporting that it can develop a diverse array of novel languages that consistently abide by their rules.

How ConlangCrafter Creates New Languages

In previous work, Gašper Beguš, an associate professor of linguistics at the University of California, Berkeley, showed how large language models (LLMs) can analyze languages to the same extent as most humans. In his most recent endeavour, he set out to push the language boundaries of AI models even further.

“Creating an entire language is not an easy task at all,” Beguš says, noting that some people have dedicated their careers to creating conlangs for movies, books, and video games.

But Beguš sees additional value in making AI models capable of creating truly novel languages beyond what humans could imagine. “[Models] are able to imagine or come up with things that we might not, and we can learn so much from that,” he says.

For example, ConlangCrafter can create new languages with unconventional communication systems, such as a language for a cephalopod species that uses colors and gestures instead of sounds. Of course, while this “color language” generated by ConlangCrafter isn’t truly what an octopus uses for communication, Beguš envisions these imaginary languages as a means for studying non-human centric languages in greater detail.

Beguš and the rest of the team, including Morris Alper, a postdoctoral researcher at Carnegie Mellon University and Moran Yankua, a Ph.D. student at Tel Aviv University , designed ConlangCrafter so that it can apply a wide range of linguistic rules in terms of how sounds are organized in a language (phonology), the relationship between word and sentence structure (morphosyntax), and vocabulary.

A random number generator regularly introduces variation so that every language comes out different. A built-in editing loop then reviews the result for contradictions and fixes them. Users can choose whatever mix of rules they want, or ask ConlangCrafter to make up its own rules.

“[Models] are able to imagine or come up with things that we might not, and we can learn so much from that.” Gašper Beguš, University of California, Berkeley

“You can choose whatever flavor of language you want,” says Beguš. “You can create a mixed language between Japanese and Esperanto, for example.”

“The goal is for the languages to be creative, so they should all be different from each other,” says Alper, who specializes in multimodal machine learning and computational linguistics. “You also want them to be consistent, because a language is like a system of rules, and those rules shouldn’t contradict each other.”

To evaluate diversity, the team measured how much the generated languages differed from one another across key linguistic features such as the basic word order used in sentences. To evaluate consistency, they checked whether translations into each invented language correctly followed that language’s own rules.

They compared languages generated by ConlangCrafter to languages created by general-purpose LLMs, such as Gemini-2.5-Pro. “Our full system can be about twice as diverse and almost 70 percent more consistent than simply prompting an LLM to invent a new language,” says Alper.

ConlangCrafter in Natural Language Processing

David Mortensen, an assistant research professor at the Language Technologies Institute at Carnegie Mellon University who was not involved in the work, says that ConlangCrafter could help natural language processing researchers better evaluate the ways in which the structure of a language affects the performance of a model.

“There is a substantial body of research that suggests that linguistic structure–both at training time and test time–does affect model performance,” he says. “Hypotheses in this area have been very hard to evaluate, however.” He adds that a tool such as ConlangCrafter could help facilitate experiments on the effects of factors such as language typology and lexicon in a scientifically sound and reliable way.

ConlangCrafter is available for free online. Its creators note that the system is currently limited in more complex linguistic dimensions such as semantics, contextual and conversational use of language, and the visual aspects of writing.

Beguš envisions expanding upon this research to study the Sapir-Whorf hypothesis, which suggests that the way we speak influences the way we think and perceive the world. For example, this could involve running simulations of different worlds, each with its own language, exploring its impact on societies. “That’ll be a nice next step,” he says.