Drug-target binding kinetics with mechanism-grounded discipline.
Curated datasets and physics-grounded ML for predicting selectivity in domains where the literature has rich mechanism and thermodynamic coverage but kinetic measurements are sparse.
In active research. Methodology paper in progress.
A research-and-product effort focused on a specific gap in drug discovery infrastructure: the layer where binding kinetics, target selectivity, and mechanism-grounded prediction intersect.
We do three things:
Curate kinetic and binding-affinity data to publication-grade quality, with provenance, multi-modality reproducibility weighting, and explicit data-quality flagging.
Model with formalism chosen by substrate mechanism — not GKSL by default, not deep learning by default. Empirical rate laws, two-state equilibrium, multi-state classical master equations, or genuine quantum dynamics — whichever the physics supports.
Pre-register methodology, training data, and acceptance criteria before evaluation. Document honest negative results. No post-hoc reinterpretation.
Active research substrate: carbonic anhydrase (CA) inhibitor selectivity.
We're building a PLBD-trained ML predictor for CA II vs CA IX selectivity, using multi-modality experimental data (SPR + ITC + FTSA + SFA) combined via reproducibility weighting. To our knowledge, no such predictor exists in the literature; the methodology paper is in progress.
Datasets currently curated:
Methodology:
The four-phase research protocol — mechanism study, formalism mapping, textbook rate laws, ML — runs separately for each substrate. Cross-substrate parameter inheritance is not assumed. Each substrate paper is its own claim.
Browse the curated kinase dataset. Click any row for compound details.
| Target | Compound | koff (s⁻¹) | τres | Assay | Mode | Source |
|---|---|---|---|---|---|---|
Loading dataset… | ||||||
To prevent overinterpretation:
Now: Run 3 + Run 4 evaluation on CA II vs CA IX selectivity. Pre-registered acceptance bars (R² ≥ 0.78 / 0.55 / 0.40 tiered). All results reported with equal prominence; no cherry-picking.
Next: CA selectivity preprint, then peer-reviewed venue. Post-publication: scoped API access for collaborating labs.
Beyond: Methodology applies to additional substrate domains (GPCRs, enzymes, ion channels) on the same mechanism-first → pre-registered → ML evaluated path. No commitments until each substrate's Phase 1 mechanism study completes.
Working in a domain where this might apply, or curious about specific datasets we've curated?