Introducing Certified DatasetsReviewed against basic checks, certified datasets are more visible on Polaris.

This dataset has not yet been certified by approved reviewers. It may contain issues related to data completeness and quality.

Dataset

Drug mutagenicity.

Created on: July 22, 2024Dataset size: 164 KBNumber of datapoints: 7,278
Public

Tags

TOX

Modalities

MOLECULE

Related benchmarks

2024-07-22

Details

README

Background

Mutagenicity means the ability of a drug to induce genetic alterations. Drugs that can cause damage to the DNA can result in cell death or other severe adverse effects. Nowadays, the most widely used assay for testing the mutagenicity of compounds is the Ames experiment which was invented by a professor named Ames. The Ames test is a short-term bacterial reverse mutation assay detecting a large number of compounds which can induce genetic damage and frameshift mutations. The dataset is aggregated from four papers.

Description of readout

Task Description: Binary classification. Given a drug SMILES string, predict whether it is mutagenic (1) or not mutagenic (0).

Data resource

Reference: [1] In silico Prediction of Chemical Ames Mutagenicity