Background
The goal of assessing ADME properties is to understand how a potential drug candidate interacts with the human body, including absorption, distribution, metabolism, and excretion. This understanding is essential for evaluating the drug's efficacy, safety, and clinical potential, guiding drug development towards optimal therapeutic outcomes. Fang et al. (2023) disclosed DMPK datasets collected over 20 months, covering six ADME in vitro endpoints: human and rat liver microsomal stability, MDR1-MDCK efflux ratio, solubility, and human and rat plasma protein binding. The dataset includes between 885 and 3,087 measurements for each corresponding endpoint.
Description of readout
- Readouts:
LOG HLM_CLint (mL/min/kg)
, LOG RLM_CLint (mL/min/kg)
, LOG HPPB (mL/min/kg)
, LOG RPPB (mL/min/kg)
, LOG_MDR1-MDCK_ER
, LOG_SOLUBILITY
- Bioassay readout: Intrinsic clearance
- Optimization objective: Higher value
Benchmarking
The goal of this benchmark is to perform multitask learning and select the best models for predicting six ADME endpoints altogether.
Train/test split
To discover more potential hits that are similar to the discovered hits, a random splitting was applied.
Distribution of the train/test in the chemical space