QuantLens Get Started
Healthcare & Life Sciences

BioTrials Clinical Catalyst

559K trials, 1,515 tickers, 168K labeled catalyst events with returns. The clinical trials–to–equity bridge for biotech catalyst modeling.

1999–2025 coverage · 48K+ sponsors · FDA application links · Patent cross-references · Trial-level outcome labels.

Note: BioTrials Pro includes 168,131 labeled catalyst events joined to forward equity returns and volatility metrics.

559K
Clinical Trials
1,515
Mapped Tickers
168K
Catalyst Events
26
Years History

Domain

Healthcare & Life Sciences

Scale

~560k trials, 1.5k tickers

Updated

Daily (Last: 2025-12-06)

Deep Dive

Why this dataset matters for biotech investors

BioTrials Clinical Catalyst is an end-to-end dataset linking the global clinical trials universe to public biotech and pharma equities. It covers 558,973 trials (1999–2025), 48K+ sponsors, and 1,515 mapped public tickers, enriched with FDA application links, patent cross-references, therapeutic area tags, and trial-level outcome labels.

The Pro layer includes 168,131 labeled catalyst events joined to forward equity returns and volatility metrics. It serves as a clinical trials–to–equity bridge for biotech catalyst modeling and drug pipeline research, enabling investors to systematically analyze the impact of trial outcomes on stock performance.

Feature Engineering: v1 & Roadmap

The current Pro release ships with a robust set of trial and catalyst signals. The roadmap extends this into deep outcome analysis and investigator networks.

Included in v1 Release

Trial Status

Recruiting, Active, Completed, Terminated, Withdrawn status flags.

Sponsor Type

Industry, NIH, University, Federal, Other classifications.

Phase Indicators

Phase 1, 2, 3, 4, Early Phase 1, Not Applicable mapping.

Enrollment Data

Target vs Actual enrollment, gender breakdown, age eligibility.

Condition Mapping

MeSH terms, ICD-10 codes, rare disease flags.

Intervention Types

Drug, Device, Biological, Procedure, Genetic, Dietary Supplement.

Roadmap: Ultra Pro Extensions

Outcome Analysis

Primary/Secondary endpoint success, p-value extraction.

Adverse Events

Serious vs Non-serious counts, organ system classification.

Publication Linkage

PubMed citations, journal impact factor, results publication lag.

Investigator Network

PI ranking, site performance, key opinion leader (KOL) mapping.

Editions

Choose the depth of intelligence you need

BioTrials Core

Clean clinical trial history, ready for your own models.

$799/license
  • 1999-2025 ClinicalTrials.gov registrations
  • ~559k trials + sponsor + catalyst tables
  • Clean clinical trial history
  • Sponsor entities with ticker coverage
Contact Sales

Technical Specifications

Delivery Format

  • Columnar Parquet
  • Partitioned by year/event date
  • DuckDB / Polars ready

History & Coverage

  • 1999–2025 History
  • Global Coverage
  • Daily Updates

Integration

  • Presigned URL downloads
  • Python / R compatible
  • SHA-256 verification