惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Martin Fowler
Martin Fowler
SecWiki News
SecWiki News
Y
Y Combinator Blog
博客园 - 叶小钗
Stack Overflow Blog
Stack Overflow Blog
Recent Announcements
Recent Announcements
P
Proofpoint News Feed
aimingoo的专栏
aimingoo的专栏
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
T
Tailwind CSS Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
美团技术团队
D
DataBreaches.Net
人人都是产品经理
人人都是产品经理
Last Week in AI
Last Week in AI
Microsoft Azure Blog
Microsoft Azure Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
The Cloudflare Blog
博客园 - 司徒正美
The Register - Security
The Register - Security
Engineering at Meta
Engineering at Meta
B
Blog
大猫的无限游戏
大猫的无限游戏
M
MIT News - Artificial intelligence
Vercel News
Vercel News
T
Threat Research - Cisco Blogs
T
The Exploit Database - CXSecurity.com
Latest news
Latest news
腾讯CDC
Blog — PlanetScale
Blog — PlanetScale
H
Hacker News: Front Page
Google DeepMind News
Google DeepMind News
Help Net Security
Help Net Security
Recorded Future
Recorded Future
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Tenable Blog
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
F
Full Disclosure
P
Palo Alto Networks Blog
H
Heimdal Security Blog
O
OpenAI News
Hacker News - Newest:
Hacker News - Newest: "LLM"
C
Cisco Blogs
罗磊的独立博客
L
LINUX DO - 热门话题
Google DeepMind News
Google DeepMind News
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Cloudbric
Cloudbric

OfficeChai

These Are The 10 Cheapest AI Models In The World [June 2026] 18 Best AI Tools For English Speaking (With Examples) [2026] AI Impact? Vacancy Rates For US Office Properties Are Now Highest Since The 2008 Crisis KPMG Pulls Report Praising AI After It Was Found To Have Fake AI-Generated Citations India's Sarvam Raises $234 Million At $1.5 Billion Valuation After SpaceX Stock Pops 20%, Musk Has Made More Money In The Last 24 Hours Than Warren Buffett Made In His Entire Career OfficeChai Nobody Is Using AI Better Than Meta: NVIDIA CEO Jensen Huang 21 Best AI Tools For Animation (With Examples) [2026] 22 Best AI Tools For Architecture (With Examples) [2026] Datacenter Construction Spending Has Eclipsed Public Transportation Spending In The US China Scraps 12,000 Degree Courses, Mainly In Arts And Humanities, To Prepare For AI Age OfficeChai There Is No Job Loss With AI: David Friedberg Loop Between Human Capital And "Token Capital" Will Be The New IP For Firms, Says Satya Nadella How to Reduce Dependency on Key Employees 8 Google Index Checker Use Cases Beyond New Blog Posts Memory Squeeze? Smartphone Purchases Are Down Globally 21 Best AI Tools For Accounting (With Examples) [2026] AI For Voice Generation: 22 Best Options (With Examples) [2026] These Are The Most Popular Image Generation Models On OpenRouter [June 2026] Search Traffic For Websites Is Down 25% Over The Last Year Because Of AI: a16z Data Agentic Coding Has Led To A 50% Increase In Number Of Apps, But Most Are Finding Very Few Users: SimilarWeb Data OpenRouter Launches Fusion API, Which Uses A Combination Of Models To Achieve Fable-Like Performance At Half The Price Dario Amodei Refused To De-Deploy Or Fix Vulnerabilities In Fable Before US Export Controls, Says David Sacks 23 Best AI Tools For Notes Making (With Examples) [2026] 16 Best AI Tools For Astrology (With Examples) [2026] How Jensen Huang Once Had To Ask SEGA's CEO To Pay NVIDIA For A Technology That Didn't Work ChatGPT Already Has 11% Of The Search Market: OpenAI CFO Sarah Friar SpaceX Has Now Launched More Satellites Than Rest Of Humanity Combined Across History Globalization Is Dead, Time For India To Wake Up Says Sridhar Vembu After US Bans Anthropic Mythos And Fable Models For Foreign Users Elon Musk Becomes World's First Trillionaire After Record SpaceX IPO Anthropic Suspends Access To Mythos And Fable Models Following US Govt Directive Against Foreign Users 27 Best AI Tools For Market Research (With Examples) [2026] Why Jeff Bezos Makes Important Decisions Early in The Morning Education And Healthcare IT Have Been The Hardest Areas To Invest In: Peter Thiel Giving AI Long-Term Goals Could Lead To The Emergence Of Self-Preservation: Geoffrey Hinton Your Startup Doesn't Have a Hardware Problem. It Has an Accountability Problem Cyber Incidents Rarely Start With a Hacker: The Weak Links Businesses Overlook What Makes an App Worth Returning to Every Day? 21 Best AI Tools For Lead Generation (With Examples) [2026] How NBA Player Shaquille O'Neal Became An Early Investor In Ring AI For Kids Learning: 22 Best Options (With Examples) [2026] These Are The Most Popular AI Model Companies On OpenRouter [June 2026] Advanced Fintech and NeoBank Software Development Solutions: Building the Digital Banks of Tomorrow TRON Payments: Integrating AML Checks Into Business Workflows 18 Best AI Tools For Resume (With Examples) [2026] 16 Best AI Tools For UI Design (With Examples) [2026] These Are Top 10 Countries Generating The Most Internet Traffic How to Choose the Best Magento Agency for Your Store These Are The Best AI Models For Creative Writing [June 2026] AI For Managers: 28 Best Tools (With Examples) [2026] 17 AI Tools For Trading (With Examples) [2026] AI Has Led To An Explosion Of New Apps, But Nearly None Have Managed To Garner Significant Usage Cloudflare CEO Matthew Prince Says Vinod Khosla Asked Him To Fire His Co-founders For Him To Invest In His Company Australia’s AirTrunk To Invest $30 Billion To Develop Datacenters In India Anthropic Says That Their Employees Are Using AI To Write 8x More Code Compared To 18 Months Ago Anthropic Is Extremely Expensive, Many Are Urgently Looking For Alternatives: Microsoft AI CEO Mustafa Suleyman Sergey Tokarev on creating DIY “Beehives” and a free guidebook AI Crypto Price Prediction: How Accurate Are Machine Learning Models? Why Anthropic Could Find It Hard To Maintain Its $965 Billion Valuation Startup CEO Says They're Saving "Millions Of Dollars" By Replacing Anthropic Models With DeepSeek Ola Cabs' Valuation Falls 99% From Peak, Now Valued At Just $70 Million By Vanguard After TCS Case, Former Wipro Employee Alleges Attempt At Religious Conversion By Coworkers Bot Traffic Has Surpassed Human Traffic On The Internet For The First Time In History, Clouflare Says ChatGPT's Free Users Do 7 Queries Per Day, Those On $20 Plan Do 3x More: CFO Sarah Friar How Keith Rabois Had Been "Highly Skeptical" In 2023 That Anthropic Would Be Worth More Than $5 Billion In 10 Years How to Install AdGuard Home with Docker Step by Step We're Running Out Of Training Data, But Not Too Worried Because There Are Alternate Approaches: Google's Jeff Dean JioHotstar Is Hiring For 75 AI Roles Amid AI Content Push NVIDIA's Nemotron 3 Becomes Most Intelligent Open Weights Model From The US Hackers Allegedly Fooled Meta's AI To Take Over Accounts By Simply Asking It To Change User Emails Manchester Super Giants' AI Promotional Video Gets Panned As "Slop" For Glaring Cricketing Errors AI Reducing Jobs Is "Complete Nonsense": NVIDIA CEO Jensen Huang MiniMax Releases MiniMax M3, Is Competitive With Frontier Models On Many Benchmarks IIT Delhi-Incubated BotLab Dynamics Lights Up Skies With Lord Shiva Themed Drone Show During IPL Final NVIDIA Introduces RTX Spark, A New Chip Optimized For AI Agents For Windows Laptops And PCs NVIDIA Introduces Vera, A New CPU Chip For AI Agents That Is 80% Faster Than x86 CPUs OpenAI's Codex Reaches 5 Million Users, Resets Rate Limits For Users Key Factors That Influence Personal Loan Approval in India AI Is Allowing Me To Experiment And Try Crazier Things: Mathematician Terrance Tao Efficiency Of Human Learning Is Still A Thousand Times Better Than LLM Learning, Need Algorithmic Advances To Improve It: Jeff Dean San Francisco Home's Zillow Listing Says It'll Accept OpenAI Or Anthropic Stock As Payment Open-Source Models Currently Lag Proprietary Models By Just 4 Months: Epoch AI Self-Improvement Possible In AI Models Within A Year, Say Google's Top AI Leaders Digital Minds: Preparing for a Moral Challenge Before It Arrives Nearly 30% Of US-Based Y-Combinator Founders Are Of Indian Origin: SF Chronicle Data "A New Era Of PC": NVIDIA, Microsoft Windows Tease New Collaboration At Least 146,000 AI Hallucinated Citations In Papers Published In 2025, Finds Paper AI Doesn't Undergo Experiences, Has No Moral Conscience: Pope Leo XIV Claude Opus 4.8 Tops Artificial Analysis Intelligence Index, Edges Out GPT 5.5 With Score Of 61.4 Anthropic Says Its Annual Revenue Run-rate Has Now Touched $47 Billion Anthropic Raises $65 Billion At $965 Billion Valuation, Is Now Worth More Than OpenAI Claude Opus 4.8 Is Better Than Opus 4.7 But Not As Good As Mythos Preview, Says Anthropic Claude Opus 4.8 Beats GPT 5.5 On GDPval-AA Benchmark For Real World Tasks Anthropic Releases Claude Opus 4.8, Beats Opus 4.7, GPT-5.5 On Many Benchmarks GTM for Tech Startups Explained How to Use an AI Picture Generator to Create Professional Images Anthropic Is Now Generating 35% More Revenue Than OpenAI: The Information SK Hynix, Micron Join $1 Trillion Club Following AI-Led Memory Shortages
Law Professors Prefer AI Answers Over Those Of Their Peers, Finds Study
OfficeChai Team · 2026-06-17 · via OfficeChai

AI is fast eclipsing the abilities of the top people in some of the highest-paid professions.

A new study by researchers from Stanford and other leading U.S. law schools has delivered striking evidence of this shift in one of the most demanding professional domains: legal education. Titled “Law Professors Prefer AI Over Peer Answers,” the paper finds that when law professors were asked to blindly choose between short-answer responses written by their colleagues and those generated by large language models (LLMs), they overwhelmingly preferred the AI versions.

The study, published May 27, 2026, involved sixteen contracts law professors from fourteen U.S. law schools who all teach from the same casebook. Participants first created 40 representative office-hours-style questions across categories like case recall, doctrine, hypotheticals, and policy. They then wrote their own answers and judged 2,918 anonymized pairwise comparisons between human and LLM responses.

Clear Preference for AI

Professors rated responses from Google’s Gemini 2.5 Pro at a 75.92% win rate against human instructors, while NotebookLM (a retrieval-augmented version grounded in the casebook) achieved 74.75%. The models performed on par with the strongest human participants, and in some analyses, even outperformed every instructor. Every single judge in the study preferred LLM answers over peer responses on average, with a median LLM-preference rate of 75.81%.

Notably, the advantage held across all question types—including complex hypotheticals and policy questions that require nuanced judgment rather than rote recall. AI responses were also flagged as pedagogically harmful far less often (3.53% pooled rate) compared to professor-written answers (12.06% average).

The researchers went further by engineering textual features such as length, clarity, structure, and pedagogical support to test whether surface-level polish explained the results. It didn’t. LLMs consistently outperformed predictions based on these features alone, suggesting the advantage stems from substantive reasoning quality.

Shared Professional Standards

To determine whether this reflected genuine alignment with expert standards or mere stylistic appeal, the team analyzed inter-judge agreement on overlapping trials. Agreement exceeded what would be expected from purely idiosyncratic preferences, indicating that LLMs were capturing latent professional norms that the professors themselves endorse.

Using an “LLM-as-judge” framework validated against human evaluators, the researchers extended the ranking to newer models. Claude Opus 4.7 topped the list, followed by other frontier systems. All outperformed human instructors. Reasoning-focused variants, such as Gemini 2.5 Flash with thinking budget, significantly outperformed non-reasoning counterparts.

Implications for Legal Education and Beyond

The findings challenge assumptions about AI’s limitations in high-judgment fields. While many prior evaluations focused on objective accuracy, this study tested AI against the subjective but shared standards of expert practitioners—the very essence of legal training.

For law schools facing instructor capacity constraints, the results point to a practical opportunity: always-available AI tutors that can deliver high-quality short answers aligned with professional expectations. The authors suggest implementations with clear guardrails, citations to source material, and escalation paths to human faculty.

The paper also highlights a curious detail: stock Gemini 2.5 Pro often outperformed RAG-grounded variants (including a commercial AI tutor built on the same base model), raising questions about context dilution in long-document retrieval.

As AI capabilities continue to advance rapidly, this research underscores a broader trend. In domains where success depends on reasoned judgment rather than single ground truths, frontier models are not just matching experts—they are frequently preferred by them. For businesses, technologists, and educators, the message is clear: the integration of AI into professional knowledge work is accelerating, even in fields long considered resistant to automation.