惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Fox-IT International blog
Recent Announcements
Recent Announcements
D
Docker
IT之家
IT之家
B
Blog
Jina AI
Jina AI
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 【当耐特】
Google DeepMind News
Google DeepMind News
F
Fortinet All Blogs
量子位
C
Check Point Blog
Microsoft Azure Blog
Microsoft Azure Blog
罗磊的独立博客
博客园 - 司徒正美
李成银的技术随笔
美团技术团队
Blog — PlanetScale
Blog — PlanetScale
雷峰网
雷峰网
The GitHub Blog
The GitHub Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
T
The Blog of Author Tim Ferriss
酷 壳 – CoolShell
酷 壳 – CoolShell
MongoDB | Blog
MongoDB | Blog
P
Proofpoint News Feed
L
LangChain Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Y
Y Combinator Blog
大猫的无限游戏
大猫的无限游戏
有赞技术团队
有赞技术团队
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
V
Visual Studio Blog
T
Tailwind CSS Blog
H
Help Net Security
Engineering at Meta
Engineering at Meta
小众软件
小众软件
B
Blog RSS Feed
Stack Overflow Blog
Stack Overflow Blog
月光博客
月光博客
M
Microsoft Research Blog - Microsoft Research
宝玉的分享
宝玉的分享
人人都是产品经理
人人都是产品经理
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
GbyAI
GbyAI
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Last Week in AI
Last Week in AI
Martin Fowler
Martin Fowler
Stack Overflow Blog
Stack Overflow Blog

Snorkel AI

Building AI-Native Systems for Federal Infrastructure: A Conversation with Rezaur Rahman Code World Models and AutoHarness for LLM Agents Why coding agents need better data, evals, and environments Why coding agents need better data, evals, and environments Understanding Olmix: A Framework for Data Mixing Throughout Language Model Development Understanding Olmix: A Framework for Data Mixing Throughout Language Model Development Benchmarks should shape the frontier, not just measure it Benchmarks should shape the frontier, not just measure it Benchtalks #1: Alex Shaw (Terminal-Bench, Harbor) – Building the Benchmark Factory Building FinQA: An Open RL Environment for Financial Reasoning Agents How Tool Discipline Let a 4B Model Outsmart a 235B Giant on Financial Tasks Coding agents don’t need to be perfect, they need to recover Closing the Evaluation Gap in Agentic AI SlopCodeBench: Measuring Code Erosion as Agents Iterate Introducing the Snorkel Agentic Coding Benchmark 2026: The year of environments Part V: Future direction and emerging trends The self-critique paradox: Why AI verification fails where it’s needed most A chat with the Terminal-Bench team Intelligence per watt: A new metric for AI’s future Terminal-Bench 2.0: Raising the bar for AI agent evaluation Snorkeling in RL environments Introducing SnorkelSpatial Scaling trust: rubrics in Snorkel’s quality process Evaluating multi-agent systems in enterprise tool use Evaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark Parsing isn’t neutral: why evaluation choices matter The science of rubric design The right tool for the job: An A-Z of rubrics Data quality and rubrics: how to build trust in your models Building the benchmark: inside our agentic insurance underwriting dataset Evaluating AI agents for insurance underwriting LLM observability: key practices, tools, and challenges Data-centric development of an enterprise AI agent with Snorkel Building the data development platform for specialized AI LLM-as-a-judge for enterprises: evaluate model alignment at scale Why GenAI evaluation requires SME-in-the-loop for validation and trust Research spotlight: is long chain-of-thought structure all that matters when it comes to LLM reasoning distillation? Why enterprise GenAI evaluation requires fine-grained metrics to be insightful What is specialized GenAI evaluation, and why is it so critical to enterprise AI? LLM alignment techniques: 4 post-training approaches Research spotlight: Is intent analysis the key to unlocking more accurate LLM question answering? Why enterprises should embrace LLM distillation Retrieval-augmented generation (RAG) failure modes and how to fix them What is large language model (LLM) alignment? Databricks + Snorkel Flow: integrated, streamlined AI development How LLM evaluation drives better models in Snorkel Flow Unlock proprietary data with Snorkel Flow and Amazon SageMaker LLM evaluation in enterprise applications: a new era in ML Snorkel AI joins the AWS ISV Accelerate Program and launches Snorkel Flow Availability in AWS Marketplace AI data development: a guide for data science projects SnorkelCon 2024: Inaugural Snorkel AI user conference gathers leaders from 30+ Fortune 500 companies Snorkel Flow 2024.R3: Supercharge your AI development with enhanced data-centric workflows Explore the new GenAI Evaluation Suite: Snorkel 2024.R3 New NLP features in Snorkel Flow 2024.R3 Enterprise data compliance and security review: Snorkel Flow 2024.R3 How a global financial services company built a specialized AI copilot accurate enough for production Task Me Anything: innovating multimodal model benchmarks Alfred: Data labeling with foundation models and weak supervision RAG: LLM performance boost with retrieval-augmented generation Call center AI for customer experience management: a case study New GenAI features, data annotation: Snorkel Flow 2024.R2 How data slices transform enterprise LLM evaluation Meta’s Llama 3.1 405B is the new Mr. Miyagi, now what? Meta’s new Llama 3.1 models are here! Are you ready for it? Data-centric AI with Snorkel and MinIO Weak supervision for non-categorical applications + superalignment Snorkel AI signs strategic collaboration agreement with AWS to help enterprises cross the demo-to-production chasm AI alignment made simple: innovative solutions for businesses How does the Snorkel Flow label model work? Vision language models: how LLMs boost image classification Long context models in the enterprise: benchmarks and beyond How to build production-grade RAG retrieval with Snorkel Flow How Bonito helps fine-tune specialized LLMs faster than ever Walking safely before building flying saucer seatbelts: introducing Enterprise Alignment Role-based access controls in Snorkel Flow secure enterprise data Accelerating AI development in manufacturing with Snorkel Flow and AWS SageMaker How ROBOSHOT boosts zero-shot foundation model performance Discover what’s new in Snorkel Flow: Flexible data and LLM connectivity, secure data controls, and more! Faster than ever document intelligence with new Snorkel Flow FM-first workflow The art of data development for Enterprise LLMs Crossing the demo-to-production chasm with Snorkel Custom How Snorkel topped the AlpacaEval leaderboard (and why we’re not there anymore) CRFM’s HELM and enterprise LLM evaluation beyond accuracy How we achieved 89% accuracy on contract question answering Five sessions not to miss at Google Cloud Next 24 Content filtering breakthrough: Snorkel client reaches 96% recall in 3 days Here’s how Snorkel Flow + Google AI built an enterprise-ready model in a day Snorkel teams with Microsoft to showcase new AI research at NVIDIA GTC How Skill-it! enables faster, better LLM training Fine-tuned representation models boost LLM systems. Here’s how Enterprise GenAI to surge in 2024: survey results Large language model training: how three training phases shape LLMs LoRA: Low-Rank Adaptation for LLMs LLM distillation demystified: a complete guide Enterprises must shift their focus from models to data in AI development Insurance’s GenAI revolution: a business perspective Scaling human preferences in AI: Snorkel’s programmatic approach Building better enterprise AI: incorporating expert feedback in system development “Fall in love with your data”—Snorkel AI’s Enterprise LLM Summit
Anthropic Claude + AWS: revolutionizing pharma data analytics with Snorkel AI
2025-06-05 · via Snorkel AI

A leading pharmaceutical company has committed to double its revenue by 2030 and aims to fuel that growth, in part, with AI-powered data insights.

Seeking to build an AI system that could extract, analyze, and present insights from vast, complex datasets, the company partnered with Snorkel AI, Amazon Web Services (AWS), and Anthropic. The company sought the trustworthy results of Anthropic’s Claude models, the security and cost controls of Amazon Bedrock, and the ability to rapidly, expertly, and reliably curate training data provided by the Snorkel AI Data Platform—which integrates natively with AWS.

Combining these tools, the firm created an agentic AI co-pilot capable of better navigating its data, unlocking critical business insights, and driving decision-makers’ ability to identify opportunities and challenges across its operations.

Key Outcomes:

  • AI-ready data: Snorkel’s programmatic approach to data curation—labeling, sampling, filtering, and augmenting data—helps AI teams efficiently capture expert knowledge to build high quality datasets and iteratively build production-quality AI.
  • Accelerated AI development: Snorkel’s integration with Amazon SageMaker and Amazon Bedrock enabled rapid fine-tuning and deployment.
  • Enhanced data understanding: The resulting application, built on Anthropic’s Claude Sonnet model, empowered key decision-makers with up-to-date insights without requiring them to know how to code.

This collaboration demonstrates how integrated AI solutions can effectively address data complexity challenges and set a new standard for AI adoption in the pharmaceutical industry. It created tangible improvements in operational efficiency and business performance, supporting long-term growth objectives.

AWS and Amazon Bedrock

Amazon Bedrock offers a fully managed service that provides seamless access to leading foundation models, including Anthropic’s Claude series. This integration facilitates the development and deployment of generative AI (GenAI) applications without extensive setup or specialized infrastructure.

Advantages of AWS:

  • Scalability and performance: Bedrock’s robust infrastructure ensures that enterprises can scale their AI applications efficiently.
  • Cost-effective AI solutions: AWS’s managed services allow enterprises to optimize costs associated with deploying and maintaining AI applications.
  • Efficient fine-tuning: Snorkel integrates with SageMaker and Bedrock to orchestrate streamlined model fine-tuning to optimize AI performance.

The pharmaceutical giant chose Amazon Bedrock for several reasons. Bedrock provides a comprehensive, secure, and efficient platform for enterprises. It integrates seamlessly with Snorkel’s ai data development platform and allows companies to access and deploy Anthropic’s Claude models, which are aligned with their organizational goals of innovation, safety, and operational excellence.

A pharma company used Anthropic Claude + AWS + Snorkel AI to revolutionize pharma data analytics

Anthropic Claude integration

Anthropic’s Claude models, accessible via Amazon Bedrock, offer:

  • Advanced reasoning: Claude handles complex problem-solving tasks by integrating diverse data points for coherent conclusions.
  • Multimodal analysis: It interprets and analyzes visual data alongside text.
  • Code generation: Claude facilitates code creation using natural language descriptions, which helps generate queries for the pharmaceutical giant’s data management systems.
  • Multilingual support: Enables effective communication across global teams by supporting multiple languages.

Anthropic’s Constitutional AI approach underpins the Claude models with a principled framework aligned with human values. This reduces the risk of harmful or biased outputs—enhancing trust, reliability, and transparency.

Anthropic Claude + AWS: revolutionizing pharma data analytics with Snorkel AI

Claude Models overview

Anthropic offers three Claude models tailored to specific use cases:

  1. Claude Opus 4: Anthropic’s largest hybrid reasoning model, Opus excels in complex tasks requiring high accuracy and advanced language comprehension.
  2. Claude Sonnet 4: Balances capability and performance, making it suitable for general business applications such as coding assistance and enterprise deployments.
  3. Claude Haiku 3.5: Haiku is optimized for speed, cost-effectiveness, and agentic tool use, making it ideal for applications like customer support and content moderation.

Snorkel AI Data Platform capabilities

Snorkel’s AI data development platform accelerates the process of converting raw records into high-quality training data sets by 10-100x by combining:

  1. Programmatic data curation: Experts contribute logic that data scientists encode into labeling functions. The platform applies these labeling functions to the entire dataset, using Snorkel’s proprietary weak supervision algorithm to apply the most likely label when they conflict. This minimizes the manual effort required to build training data while improving label consistency and auditability.
  2. Guided error analysis: Snorkel’s guided error analysis helps users identify shortcomings within the training data for targeted improvement, facilitating iterative refinement. 
  3. Integration with enterprise infrastructure: Snorkel integrates seamlessly with enterprise cloud infrastructure, including AWS, ensuring scalability and security.
  4. On-board annotation suite: Snorkel’s integrated annotation suite enables SMEs to manually create additional labels where needed.

Snorkel’s tools and features empower enterprise data science teams to iteratively improve models until they reach production benchmarks—meeting the challenges of the pharmaceutical industry and many others.

Putting it all together

The pharmaceutical giant aimed to build an advanced AI system that could effectively query, visualize, and explain data accessible through its existing database tools and APIs. However, the team faced significant challenges, including a slow user acceptance testing (UAT) process. This hindered the collection of organic training data and slowed progress.

To overcome these challenges, Snorkel researchers collaborated with the pharmaceutical company to develop a process using Anthropic’s Claude models to programmatically generate, filter, curate, and evaluate synthetic UAT data. Additionally, Snorkel’s researchers helped distill a smaller guardrail model that could be cost-effectively deployed on Amazon Bedrock, ensuring robust pre- and post-production reporting and flagging potential errors in AI outputs.

Better together: Snorkel AI, Amazon Bedrock, and Anthropic Claude

This partnership represents a paradigm shift in how AI collaborations can drive business transformation. 

By combining Anthropic’s cutting-edge language models, Amazon Bedrock’s enterprise-grade deployment capabilities, and Snorkel AI’s powerful AI data development platform, the pharmaceutical company created an AI system that empowers decision-makers with rapid insights. 

Key collaborative benefits:

The partnership between Snorkel AI, AWS, and Anthropic yielded significant benefits for the pharmaceutical company, transforming its AI capabilities and operational efficiency.

  • Time savings: The company accelerated AI development by automating data labeling through Snorkel’s tools and leveraging synthetic data from Anthropic’s Claude models. This reduced manual annotation time and enabled faster deployment without compromising quality or compliance.
  • Accuracy improvements: Snorkel’s labeling functions and weak supervision enhanced model accuracy by ensuring high-quality training data.
  • Cost reductions: The partnership optimized costs by leveraging Amazon Bedrock’s scalable infrastructure.

This collaboration demonstrated how integrated AI solutions can effectively address AI challenges in the pharmaceutical industry, improving operational efficiency, model accuracy, and cost management.

Learn more about Snorkel + AWS

Snorkel AI and AWS unlock the power of AI by empowering some of the world’s leading companies to transform their data and knowledge into real-world business value. Snorkel’s platform is also available through the AWS Marketplace. If you would like to learn more about what Snorkel can do for your organization, book a demo today.

Shan Kandaswamy (AWS)

Shan is a Senior Partner Solutions Architect specializing in Generative AI at AWS, dedicated to solving complex customer challenges. He advocates for innovative AI solutions, distributed architecture, and serverless technologies, helping customers harness the power of Generative AI in their cloud journey.

Matt Casey

Matt Casey leads content production at Snorkel AI. In prior roles, Matt built machine learning models and data pipelines as a data scientist. As a journalist, he produced written and audio content for outlets including The Boston Globe and NPR affiliates.