The Antiviral Competition ResultsSee the results of the Polaris competition, organized by ASAP Discovery and OpenADMET.

Benchmark

Preview
plinder-org/runs-n-poses

A protein-ligand co-folding benchmark with 2,073 high-resolution protein-ligand systems released after 30 September 2021 with at most 100 similar ligands in the train set. The test set is further split in multiple subsets with increasing distance to the train set to test for generalization. Performance is measured using a compound metric of BiSyRMSD and LDDT-PLI.

Created on: February 03, 2025Train size: 0Test size: 76 + 106 + 149 + 272 + 301 + 364 + 328 + 477
Public
V2

Participants

Tags

structural-biology
co-folding
protein-ligand
pose-prediction

Related dataset

This is a Preview

As the authors of this benchmark are actively looking for feedback, this benchmark has been released as a preview. Interacting with it through the Polaris client is currently not possible. Share your thoughts on this work in the #runs-n-poses channel in our Discord!

Leaderboard

Test set
Task
#
NameContributors
success_rate
rmsd_success_rate
mean_lddt_pli
coverage
References
1

AlphaFold3

15.068
17.808
0.304
96.053
2

Boltz-1

5.556
9.259
0.250
71.053
3

Chai-1

4.688
18.750
0.281
84.211
4

Protenix

1.471
11.765
0.251
89.474

Details

README

🌹 Runs N' Poses 🌹

A protein-ligand co-folding prediction dataset and benchmark

image Figure last updated on February 4th, 2025

A protein-ligand co-folding benchmark with 2,073 high-resolution protein-ligand systems released after 30 September 2021 with at most 100 similar ligands in the train set. The test set is further split in multiple subsets with increasing distance to the train set to test for generalization. Performance is measured using a compound metric of BiSyRMSD and LDDT-PLI.

See the GitHub repository for more details on how these methods were run and evaluated.

Metrics

We measure the following 4 performance measures

  • success_rate: The percentage of predictions with a LDDT-PLI > 0.8 and a BiSyRMSD < 2.
  • rmsd_success_rate : The percentage of predictions with a BiSyRMSD < 2.
  • mean_lddt_pli : The average LDDT-PLI.
  • coverage : The percentage of systems that the method could produce a prediction for.

Split

Train

We don't provide the train set here. The train set should only include PLI systems from before 30 September 2021.

Test

The test set consists of 2,073 high-resolution protein-ligand systems released after 30 September 2021 with at most 100 systems with similar ligands in the train set (same as Figure 1B in the paper cited below). It is further divided in 8 subsets, binned by their similarity (as defined by sucos_shape_pocket_qcov) to the training set:

  • Test 0 to 20 (n=76)
  • Test 20 to 30 (n=106)
  • Test 30 to 40 (n=149)
  • Test 40 to 50 (n=272)
  • Test 50 to 60 (n=301)
  • Test 60 to 70 (n=364)
  • Test 70 to 80 (n=328)
  • Test 80 to 100 (n=477)

Citation

If you use this dataset in your research, please cite Runs N' Poses and PLINDER:

Škrinjar, P., Eberhardt, J., Durairaj, J., & Schwede, T. "Have protein-ligand co-folding methods moved beyond memorisation?" bioRxiv (2025): 2025-02
Durairaj, Janani, et al. "PLINDER: The protein-ligand interactions dataset and evaluation resource." bioRxiv (2024): 2024-07.

User Attributes

These are custom, user-defined attributes that are not required by the Polaris data model.

AttributeValue
Paper
Githubhttps://github.com/plinder-org/runs-n-poses
Zenodohttps://doi.org/10.5281/zenodo.14794786
runs-n-poses | Polaris