HomeBlogContact
Link FiveLink SixLink Seven
OverviewHow it worksTool LibraryOn-Prem & SecurityRequest a demo ↗
Ricochet Scientist Assistant Agent · For Staff Scientists

One agent.
Every tool you trust.

Assemble your dream research agent à la carte — from the best open-source scientific tools, wrapped as skills, connected over MCP, and run inside your own walls. We're the orchestration layer and the science expertise. The ecosystem does the rest.

Request a demo ↗Browse the tool library
SCROLL50+ VETTED OSS TOOLS
scientist.assistant
DR. REYES · STAFF SCIENTIST
Across the last 5 years of literature, which solid-electrolyte interphase additives push Li-metal anode coulombic efficiency above 99%? Summarize the mechanisms and cite every source.
SCIENTIST ASSISTANT
On it — running a grounded literature review with in-text citations.
⚙paperqa2lit-ragRUNNING
✓retrieved 1,284 candidate papers
✓parsed & embedded 312 full texts
▸synthesizing grounded answer
WORKING · 03:31transport: MCP
Two decades delivering across
National Labsbuilt first for
On-Premor air-gapped
MCPopen standard
Open Sourcevetted & wrapped

We don't reinvent the wheel. We connect them.

See how it works

The open-source world has already built extraordinary scientific tools — for literature, chemistry, materials, biology, simulation, and more. Each has its own community driving support, innovation, and development.

The Scientist Assistant Agent gives you one simple, governed way to make them all work together. Each tool stays current with its own community — so your agent grows as the field grows, and never goes obsolete.

50+
Vetted open-source tools, ready to wire in as skills.
1
Agent and web UI on top of the whole web of tools.
MCP
Open standard connecting every skill — no proprietary lock-in.
0
Wheels reinvented. The ecosystem does what it does best.
PaperQA2DoclingRDKitpymatgenBiopythonSciPyscanpySnakemake
How it works

Declare. Connect.
Orchestrate. Grow.

A base framework that lets you declare the tools and agents you want in your environment — each wrapped in a uniform API so the Scientist Assistant Agent can call it as a skill. Build the agent you need; expand it whenever the field moves.

01

Declare your stack

Pick the tools your science needs from the vetted library. Your environment is unique to you — nothing you don't want, nothing you can't audit.

À LA CARTE
02

We wrap & connect

Each tool is wrapped in a uniform API and exposed to the agent as a skill — all connected over MCP, the open standard for tool use.

MCP TRANSPORT
03

One agent orchestrates

The Scientist Assistant Agent routes each request to the right tool, chains them together, and returns a grounded answer through a single web UI.

ORCHESTRATION LAYER
04

It grows with the field

Each tool evolves with its own community. Add new skills as the space evolves — your agent adapts instead of going obsolete.

NEVER OBSOLETE
The Tool Library · vetted open-source packages, by workflow

Your à la carte
skill catalog.

The open-source ecosystem a staff scientist can wire into their agent — grouped by research workflow. Every package is real, actively maintained, and backed by its own community. We vet, wrap, and connect each one; anything not yet on the shelf, we integrate.

01

Literature Search, Retrieval & Synthesis

Find, retrieve, and synthesize the literature — with citations you can trust.

6 TOOLS

PaperQA2

RAG

Answer questions over your paper corpus with grounded, in-text citations.

Future-House/paper-qa

Ai2 Scholar QA

SYNTHESIS

Multi-paper literature reviews with evidence and comparison tables.

allenai/ai2-scholarqa-lib

OpenScholar

SEARCH

Grounded scientific Q&A retrieved over 45M+ open-access papers.

AkariAsai/OpenScholar

PyAlex

METADATA

Search and aggregate scholarly metadata across 270M+ works via OpenAlex.

J535D165/pyalex

pyzotero

REFERENCES

Read and write a researcher’s Zotero reference library.

urschrei/pyzotero

arxiv.py

PREPRINTS

Search arXiv and fetch paper metadata and PDFs.

lukasschwab/arxiv.py
02

Document & Data Extraction

Turn PDFs, scans, and reports into clean, structured, machine-readable text.

8 TOOLS

Docling

PARSE

Convert PDF, DOCX, PPTX, and HTML into structured Markdown and JSON.

docling-project/docling

Marker

CONVERT

Fast, high-fidelity PDF→Markdown, strong on math and tables.

datalab-to/marker

GROBID

EXTRACT

Pull headers, references, and full text from scholarly PDFs as TEI/XML.

grobidOrg/grobid

Unstructured

INGEST

Partition 25+ document types into clean, LLM-ready chunks.

Unstructured-IO/unstructured

Nougat

OCR

Transformer OCR for academic PDFs, including equations.

facebookresearch/nougat

PyMuPDF

EXTRACT

Fast text, image, and table extraction and PDF rendering.

pymupdf/PyMuPDF

pdfplumber

TABLES

Extract text and tables with exact layout coordinates.

jsvine/pdfplumber

Camelot

TABLES

Extract tables from text-based PDFs straight to DataFrames.

camelot-dev/camelot
03

Data Analysis, Stats & Data Science

Load, query, model, and reason over tabular and experimental data.

10 TOOLS

pandas

DATAFRAMES

Load, transform, and aggregate tabular data.

pandas-dev/pandas

Polars

DATAFRAMES

High-performance columnar DataFrames for large data.

pola-rs/polars

DuckDB

SQL

Run fast analytical SQL over files and DataFrames in-process.

duckdb/duckdb

SciPy

NUMERICS

Fit, optimize, and transform numerical data.

scipy/scipy

statsmodels

STATS

Fit statistical models and return rigorous diagnostics.

statsmodels/statsmodels

scikit-learn

ML

Train and evaluate classical machine-learning models.

scikit-learn/scikit-learn

papermill

NOTEBOOKS

Parameterize and execute analysis notebooks programmatically.

nteract/papermill

nbconvert

NOTEBOOKS

Execute notebooks in a kernel and export results.

jupyter/nbconvert

Data Formulator

EXPLORE

AI-driven, natural-language data exploration and charting.

microsoft/data-formulator

PandasAI

NL QUERY

Ask questions of a DataFrame in plain English.

sinaptik-ai/pandas-ai
04

Scientific Computing & Numerical

Arrays, symbolic math, autodiff, and PDE solvers — the computational core.

6 TOOLS

NumPy

ARRAYS

The foundation for numerical array computation.

numpy/numpy

SymPy

SYMBOLIC

Symbolic algebra, calculus, and equation solving.

sympy/sympy

JAX

AUTODIFF

Differentiable, accelerated array computing on GPU/TPU.

jax-ml/jax

DeepXDE

PINNs

Solve forward and inverse PDEs with physics-informed neural nets.

lululxvi/deepxde

FEniCSx

FEM

Solve partial differential equations by finite elements.

FEniCS/dolfinx

Astropy

ASTRONOMY

Coordinates, units, FITS, tables, and cosmology.

astropy/astropy
05

Chemistry & Materials

Cheminformatics, atomistic simulation, and the materials-discovery toolchain.

9 TOOLS

RDKit

CHEM

Compute molecular properties and handle SMILES.

rdkit/rdkit

Open Babel

CHEM

Interconvert 100+ chemical file formats.

openbabel/openbabel

ASE

SIMULATION

Set up, run, and analyze atomistic simulations.

DTU GitLab - ase

pymatgen

MATERIALS

Materials structure analysis; query the Materials Project.

materialsproject/pymatgen

MDAnalysis

MD

Analyze molecular-dynamics trajectories.

MDAnalysis/mdanalysis

OpenMM

SIMULATION

Run GPU-accelerated molecular dynamics.

openmm/openmm

PySCF

QUANTUM

Run quantum-chemistry electronic-structure calculations.

pyscf/pyscf

matminer

MATERIALS

Featurize materials data for machine learning.

hackingmaterials/matminer

ChemCrow

AGENT

An LLM agent that reasons with chemistry tools.

ur-whitelab/chemcrow-public
06

Biology & Bioinformatics

From sequences and single cells to protein-structure prediction.

6 TOOLS

Biopython

BIO

Sequence I/O, BLAST, and NCBI / PubMed access.

biopython/biopython

Biomni

AGENT

A general-purpose biomedical research agent.

snap-stanford/Biomni

scanpy

SINGLE-CELL

Analyze single-cell RNA-seq and annotate cell types.

scverse/scanpy

pysam

GENOMICS

Read and manipulate sequencing alignments (BAM / VCF).

pysam-developers/pysam

ColabFold

STRUCTURE

Fast AlphaFold2 protein-structure prediction.

sokrypton/ColabFold

OpenFold

STRUCTURE

Trainable PyTorch reproduction of AlphaFold2.

aqlaboratory/openfold
07

Physics, Earth & Neuro

Astronomy, climate, seismology, and neurophysiology — each in its native format.

4 TOOLS

MNE-Python

NEURO

Analyze EEG / MEG neuro time-series.

mne-tools/mne-python

xarray

GEO

Work with NetCDF and labeled, gridded N-D data.

pydata/xarray

ObsPy

SEISMOLOGY

Fetch and process seismic waveform data.

obspy/obspy

yt

SIMULATION

Analyze and visualize 3D volumetric simulation data.

yt-project/yt
08

Workflow, Orchestration & Reproducibility

Make it reproducible, scale it across the cluster, and track every run.

8 TOOLS

Snakemake

WORKFLOW

Define and run reproducible HPC / cloud pipelines.

snakemake/snakemake

Nextflow

WORKFLOW

Run portable, container-native bioinformatics pipelines.

nextflow-io/nextflow

MLflow

TRACKING

Log experiments, parameters, and models.

mlflow/mlflow

DVC

VERSIONING

Version datasets and models; reproduce pipeline stages.

iterative/dvc

Prefect

ORCHESTRATION

Orchestrate dependent tasks with retries and scheduling.

PrefectHQ/prefect

Dagster

DATA ASSETS

Build typed, observable data-asset pipelines.

dagster-io/dagster

Apache Airflow

SCHEDULER

Schedule and monitor batch DAG workflows.

apache/airflow

Hydra

CONFIG

Compose and sweep experiment configurations.

facebookresearch/hydra
09

Visualization & Reporting

Turn results into publication-quality figures, dashboards, and reports.

7 TOOLS

Matplotlib

PLOTTING

Generate publication-quality plots.

matplotlib/matplotlib

Plotly

INTERACTIVE

Generate interactive, shareable web charts.

plotly/plotly.py

Bokeh

INTERACTIVE

Build interactive browser visualizations and dashboards.

bokeh/bokeh

seaborn

STATS VIZ

Make statistical plots quickly with great defaults.

mwaskom/seaborn

Streamlit

WEB APP

Expose a result as an interactive web app.

streamlit/streamlit

Gradio

UI

Wrap a model or tool in a UI and API endpoint.

gradio-app/gradio

Quarto

REPORTING

Render reproducible scientific reports to PDF / HTML.

quarto-dev/quarto-cli
10

Orchestration Glue & MCP

The plumbing: a uniform API, model serving, and the MCP standard that connects it all.

12 TOOLS

MCP Python SDK

MCP

Expose any tool to the agent over the open MCP standard.

modelcontextprotocol/python-sdk

FastMCP

MCP

Turn any Python function into an MCP skill.

jlowin/fastmcp

FastAPI

API

Wrap a tool behind a uniform, async HTTP API.

fastapi/fastapi

LangChain

FRAMEWORK

Tool-calling and agent glue.

langchain-ai/langchain

LangGraph

ORCHESTRATION

Stateful, multi-agent orchestration graphs.

langchain-ai/langgraph

LlamaIndex

RAG

Build retrieval and data agents.

run-llama/llama_index

vLLM

SERVING

Serve local models behind an OpenAI-compatible API.

vllm-project/vllm

Ollama

LOCAL LLM

Run local models for offline or air-gapped use.

ollama/ollama

Ray

DISTRIBUTED

Scale tool execution across a cluster.

ray-project/ray

BioMCP

MCP

Query biomedical databases (PubMed, trials, variants) over MCP.

genomoncology/biomcp

arXiv MCP

MCP

arXiv search, download, and analysis as an MCP tool.

blazickjp/arxiv-mcp-server

mcp.science

MCP

Ready-made scientific MCP servers (Materials Project, DFT, Jupyter).

pathintegral-institute/mcp.science
11

Code Execution & Sandboxing

Run agent-generated code safely, in isolation.

2 TOOLS

E2B

SANDBOX

Safely execute agent-generated code in secure sandboxes.

e2b-dev/code-interpreter

Open Interpreter

EXECUTE

Run tasks via natural-language code execution.

openinterpreter/open-interpreter

Don't see a tool you rely on? That's what our custom integration service is for — we'll vet it, wrap it, and connect it to your agent.

We're not another
walled garden —
we're the layer between.

Other tools can orchestrate. Our difference is singular: we focus on science and scientists. We know your environment, your review processes, and what "trustworthy" has to mean in a lab.

Why the Scientist Assistant Agent

Built by people who
know scientists.

01

Science-native

The tools are curated for research, not generic business workflows. We know what a staff scientist actually needs on the bench.

02

Yours, on your terms

Every environment is unique to the scientist and can run fully on-prem — including air-gapped networks. Your data never has to leave.

03

Vetted, not just bundled

Every tool in a stack has been reviewed by our team for fit, quality, and security — so IT can trust what's running.

04

Never obsolete

Because each tool evolves with its own community, your agent keeps improving as the field does. No single vendor to outpace it.

05

Connected by MCP

Skills speak the open Model Context Protocol — an interoperable standard, not a proprietary format you're locked into.

06

Custom integration

Need something niche or in-house? Our custom integration service vets, wraps, and connects it to your agent.

For IT & Security

Deployable inside
your walls.

Designed to pass review at a national lab first — and to hold up anywhere security and provenance matter.

On-prem & air-gapped

Runs entirely on lab-controlled infrastructure. No dependency on outside services for sensitive work.

You own the keys & data

Bring-your-own-key and full data residency. Your data and credentials stay under your control.

Open standard transport

Tools connect over MCP — interoperable and inspectable, with no proprietary lock-in.

Auditable & licensed

Every tool is open source with a known license and community. What runs is transparent and reviewable.

Deployment profile

The agent is the thin part.

We are the orchestration layer and web UI. The heavy lifting is done by mature open-source tools, each maintained by its own community — so there's less bespoke code for your team to trust, and more battle-tested software underneath.

DeploymentOn-prem · air-gapped
TransportMCP (open standard)
ToolsOpen source · vetted
Keys & dataCustomer-owned
IntegrationCustom service available
Casey Cobb · CEO
Who's behind it

We know scientists.
That's the whole point.

Ricochet is led by Casey Cobb — CEO, Berkeley Lab Affiliate, conference keynote speaker, and author of Flipping the Script. For close to two decades, Ricochet has helped science organizations turn information chaos into well-managed, integrated systems.

The Scientist Assistant Agent is that philosophy applied to the AI era: don't reinvent the wheel, connect the best of what exists, and wrap it in the judgment and governance a lab can actually trust.

— Let's assemble the agent your science deserves. Together.

Ready to assemble your agent?

Tell us about your science and your environment. We'll show you the stack we'd build — and how it runs inside your walls.

Browse the libraryRequest a Demo ↗
Join our newsletter for up-to-date industry insights.

By subscribing you agree to with our Privacy Policy and provide consent to receive updates from our Ricochet.

Resources
BlogTerms & Privacy
Get in touch
Contact us
LinkedIn
© 2024 Ricochet. All rights reserved.