Claude Science: Inside Anthropic’s AI Workbench for Scientists

A biology postdoc spends less time discovering things than most people assume. A bigger share of the week goes to reformatting a CSV so a genome browser will read it, remembering which of six databases has the right protein annotation, or waiting on a cluster job that failed three hours in because of a missing dependency. Anthropic’s answer to that problem, launched in beta on June 30, 2026, is Claude Science — a workbench that tries to fold literature search, data wrangling, compute orchestration, and figure generation into a single conversation with an AI agent.

This guide covers what Claude Science actually does, who it’s built for, how it fits alongside Claude Code and Claude Cowork, what it costs, and where it falls short — so you can decide whether it belongs in your lab’s workflow.

What Claude Science Is

Claude Science is not a new AI model. It runs on Anthropic’s existing Claude Opus 4.8, the same model available to any paying subscriber, with no specialized fine-tuning and no gated access model underneath it. What Anthropic built is the scaffolding around that model: a coordinating agent that can hand off work to specialist sub-agents, a reviewer agent that checks citations and numerical claims, connections to more than 60 scientific databases, and native rendering for the file types scientists actually work with — 3D protein structures, genome browser tracks, and chemical structures.

Anthropic frames the product as the scientific equivalent of Claude Code: a domain-specific agent harness layered on top of a general-purpose model, aimed at a job that involves a lot of repetitive tool-switching and validation work. Where Claude Code targets software engineers, Claude Science targets researchers working in genomics, single-cell biology, proteomics, structural biology, and cheminformatics, with biology and biomedical research as the initial focus.

Why Anthropic Built It Now

This isn’t Anthropic’s first move into life sciences. The company introduced Claude for Life Sciences the previous October — a set of connectors and skills that gave Claude access to scientific databases and tooling. Claude Science is a substantial expansion of that groundwork, arriving alongside a separate and more unusual announcement: Anthropic said it will run its own preclinical drug-discovery programs targeting neglected and rare diseases, partly as a way to stress-test the product on real research rather than only shipping it to outside labs.

The company’s reasoning tracks its own playbook. Anthropic’s leadership has argued that building strong tools for a domain requires doing real work in that domain — the same logic behind Claude Code being built by a company that also employs software engineers who use it daily.

How Claude Science Works

At the center of Claude Science is a hierarchical multi-agent setup. A coordinating agent takes a request in plain language, decides which specialist agents or connectors it needs, and manages the sequence of steps. A separate reviewer agent runs alongside this process, checking whether citations are traceable and whether calculations hold up, and correcting errors as they surface rather than leaving them for a human to catch during peer review.

Domain coverage on day one. The workbench ships pre-configured for genomics, single-cell analysis, proteomics, structural biology, and cheminformatics, with native connections to resources researchers already rely on — UniProt, the Protein Data Bank, Ensembl, Reactome, ClinVar, ChEMBL, and GEO among them, alongside journals and preprint servers. Rather than a scientist manually querying each database and reconciling schemas, the coordinating agent distributes that work across specialist sub-agents and synthesizes the result.

Model integration through BioNeMo. Claude Science uses NVIDIA’s BioNeMo Agent Toolkit to connect to life-sciences models scientists already trust, including Evo 2, Boltz-2, and OpenFold3, rather than asking labs to abandon tools they’ve already validated.

Native rendering of scientific artifacts. Because so much of scientific communication is visual, Claude Science displays proteins, molecular structures, and genome tracks directly, without a separate viewer. A researcher can ask the agent, in ordinary language, to adjust a figure — remove gridlines, switch an axis to a log scale — and the agent edits the underlying code itself.

Reproducibility by default. Every figure comes bundled with the exact code and computational environment that produced it, a plain-language explanation of the method, and the full message history behind it. That combination is meant to address a real and growing problem: AI-assisted writing has been linked to a rise in fabricated citations and unverifiable statistics in published papers, and a workbench that can’t show its work adds to that risk rather than reducing it.

Compute handling. Large scientific jobs — folding a protein, running a genomics pipeline — usually mean configuring a compute job, waiting, checking for failures, and pulling results back manually. Claude Science drafts a plan, asks before consuming new compute resources, and lets a researcher review or cancel any step before submission. It can submit jobs to a lab’s own high-performance computing cluster or to cloud compute, including credits provided through Modal for selected projects.

Who Claude Science Is For

Beginners and students get a single conversational entry point instead of having to learn PubMed’s search syntax, Jupyter, R, and a cluster terminal all before running a first real analysis.

Working researchers and postdocs get compression on tasks that used to take weeks. At the Allen Institute, a computational neuroscientist used Claude Science to build an automated literature-review pipeline; reviews that once took up to two years to compile can now exceed 100 pages of synthesized material in a fraction of that time, with the reviewer agent verifying citations along the way. At UCSF’s Brain Tumor Center, an epidemiologist reported that comprehensive germline analysis of glioma, previously a lengthy manual process, can now be completed dramatically faster.

Principal investigators and lab heads get a tool that manages compute costs and job orchestration on behalf of the team, plus a discounted Team plan aimed specifically at active academic and nonprofit labs, with eligibility verified through the lab’s principal investigator.

Pharma and biotech teams get a workbench purpose-built for drug discovery workflows — target identification, molecular design, and clinical data analysis — the same category of work Anthropic is testing on its own rare-disease programs.

Advanced practitioners working on hard, judgment-heavy problems — the kind where feedback loops are slow and biology is messy rather than clean — should treat Claude Science as a serious accelerant, not a replacement for expert judgment. A Harvard physicist working with Anthropic’s tools put the model’s performance on scientific tasks at roughly the level of a second-year graduate student, a comparison Anthropic itself has referenced as a calibration point rather than a marketing claim.

Claude Science vs. Alternatives

	Claude Science	Google DeepMind tools (e.g., AlphaFold lineage)	OpenAI’s biology-focused model	Building your own pipeline on the API
What it is	Agent workbench on top of a general model	Specialized scientific models	Domain fine-tuned model, limited access	Custom-built harness
Access	Beta, all paid Claude plans (Pro, Max, Team, Enterprise)	Varies by tool, often research-only	Restricted to vetted US enterprise customers	Open to any API user, but requires engineering time
Database integration	60+ pre-connected databases out of the box	Narrower, tool-specific	Not a general workbench	Manual, built by your team
Reproducibility tracking	Built in — code, environment, and history per output	Varies	Not disclosed publicly	Depends on what you build
Best fit	Labs wanting an integrated, low-setup research environment	Deep structural biology work in DeepMind’s specific domains	Enterprises already inside that vendor’s ecosystem	Teams with specific needs a general workbench won’t cover

The honest caveat here: independent benchmarking of AI models on real research tasks remains difficult, and even the best-performing specialized models on rigorous test suites have cleared only a minority of realistic research tasks. Claude Science’s advantage is less about raw model capability and more about removing the integration tax that keeps scientists from using AI on the tasks that matter.

Setting Up Claude Science: A Practical Checklist

Confirm your plan tier — Claude Science is in beta on macOS and Linux for Pro, Max, Team, and Enterprise plans
If you’re on a Team or Enterprise plan, have an admin enable Claude Science in organization settings; it isn’t on by default
Check whether your lab qualifies for the discounted academic/nonprofit Team plan, verified through your principal investigator
If you want compute credits, apply to the AI for Science program before the July 15, 2026 deadline — up to $30,000 in credits plus up to $2,000 in Modal compute for selected projects
Identify which of the 60+ integrated databases matter most for your work, and confirm your existing pipelines (BioNeMo models, cluster configuration) are compatible
Start with a low-stakes task — a literature synthesis or a figure regeneration — before routing critical-path analysis through the agent
Establish a review habit: treat the reviewer agent’s citation and calculation checks as a first pass, not a replacement for your own verification before publication

Common Mistakes and Recommendations

Treating it as a black box. The entire design point of Claude Science is traceability — every output links back to the code and environment that made it. Skipping that trail defeats the purpose and reintroduces the reproducibility risk the tool is meant to reduce.

Assuming domain coverage is universal. The initial rollout leans heavily toward biology and biomedical research. Researchers in physics, materials science, or social science should expect a thinner set of pre-built connectors and plan accordingly.

Skipping human verification on judgment calls. Independent estimates comparing model performance to a second-year graduate student are a useful sanity check, not a guarantee. Novel hypotheses, ambiguous data, and high-stakes clinical conclusions still need expert review.

Ignoring compute costs. Because Claude Science can submit jobs to cloud compute or a cluster, an unsupervised agent with a broad mandate can rack up real costs. The built-in step where the agent asks before reaching new resources is there for a reason — don’t disable that habit even when it feels slow.

Where This Is Heading

Anthropic’s own framing suggests Claude Science is a foundation rather than a finished product: broader domain coverage beyond biology, deeper integration with lab hardware and cluster environments, and continued investment in the reviewer-agent concept as a check against AI-introduced errors in the scientific record. The competitive picture is also moving quickly, with other major AI labs pursuing their own domain-specific science tools and benchmarks, so the comparison table above is a snapshot rather than a permanent ranking. Expect capability and access details to keep shifting through the rest of 2026.

Key Takeaways

Claude Science is an agent-based workbench, not a new model — it runs on Claude Opus 4.8 with domain-specific tooling layered on top
It’s aimed first at biology, genomics, proteomics, structural biology, and cheminformatics, with 60+ integrated databases
Reproducibility is a core design feature: every output ships with its code, environment, and message history
It’s in beta on macOS and Linux, available to Pro, Max, Team, and Enterprise subscribers, with admin enablement required on Team/Enterprise
Grants of up to $30,000 in credits are available to qualifying research projects through July 15, 2026
It compresses time-consuming tasks like literature synthesis and genomic analysis, but still requires expert oversight on judgment-heavy science

FAQ

Is Claude Science a separate AI model from Claude? No. It runs on Claude Opus 4.8, the same model available to paying subscribers, with an agent framework and database connections built around it rather than a specialized model underneath.

Which plans include Claude Science? It’s in beta for Pro, Max, Team, and Enterprise plans on macOS and Linux. Team and Enterprise admins need to enable it for their organization before members can use it.

Does Claude Science replace Claude Code or Claude Cowork? No. Anthropic has said it’s designed to complement those tools, not replace them — scientists still write code, and Claude Science adds compute orchestration, database access, and scientific visualization on top of that workflow.

What scientific fields does it currently support best? Genomics, single-cell analysis, proteomics, structural biology, and cheminformatics are supported out of the box, reflecting the initial focus on biology and biomedical research.

Can academic labs get a discount? Yes. Anthropic offers a discounted Team plan for active scientific labs at academic institutions and nonprofit research organizations, with eligibility tied to the lab’s principal investigator.

How does Claude Science handle reproducibility? Every figure or result comes with the exact code and computational environment used to produce it, a plain-language explanation of the method, and the full conversation history, so results can be traced and re-verified later.

Is there funding available for research projects using Claude Science? Anthropic’s AI for Science program is backing up to 50 projects with as much as $30,000 in credits each, with an additional $2,000 in Modal compute for select projects. Applications close July 15, 2026, with awards announced by July 31.

How reliable is Claude Science for high-stakes conclusions? Treat it as a strong research assistant, not a final authority. Independent assessments have compared its performance on scientific tasks to roughly a second-year graduate student — useful, but not a substitute for expert review before publication or clinical use.

Does Claude Science work with existing lab compute infrastructure? Yes. It can submit jobs to a lab’s own high-performance computing cluster or to cloud compute on demand, and it asks for approval before consuming new compute resources.

How does Claude Science compare to specialized biology models from other AI labs? Competing approaches include highly specialized fine-tuned models with restricted access versus Claude Science’s strategy of pairing a general-purpose model with a broad integration layer. Neither approach has a settled performance edge yet, and benchmarking in this space is still evolving.

Visit:

Claude Science: Anthropic’s AI Workbench for Researchers (49 chars)