[Question] Does DrugEx v3 support R-group enumeration with specified attachment points? #25

guosijia · 2025-05-29T06:53:02Z

Question

We are trying to perform R-group enumeration using DrugEx v3 and would like to clarify whether DrugEx supports specifying attachment points for fragment-based molecule generation.

Background

We understand that DrugEx v3 can generate molecules based on input fragments, but we're uncertain about its capability to handle specific attachment points for R-group enumeration tasks.

Current Approach and Issues

Approach 1: Using fragments with dummy atoms (`[*:1]`)

Input scaffold:

O=C(CCC(F)(F)F)NC(c1cnn2cc(C(C3CCC(F)(F)CC3)[*:1])nc2c1)C1CC1

Command used:

python -m drugex.dataset \
    -b ${base_dir} \
    -i scaffold_with_star.tsv \
    -mc SMILES \
    -o rgroup_data \
    -mt graph \
    -s

Issue: The generated graph data file (rgroup_data_graph.txt) contains only column headers but no actual data (empty matrix), preventing further molecule generation.

File content example:

C0 C1 C2 C3 C4 ... C399
(no data rows)

Approach 2: Removing dummy atoms from fragments

Modified scaffold:

O=C(CCC(F)(F)F)NC(c1cnn2cc(C(C3CCC(F)(F)CC3))nc2c1)C1CC1

Issue: While this approach generates valid graph data and molecules successfully, most generated molecules do not grow from our intended attachment point. The model seems to modify the scaffold at random positions rather than the specific location where the [*:1] was originally placed.

Questions

Does DrugEx v3 natively support R-group enumeration with specified attachment points?
Is there a correct way to handle dummy atoms ([:1], [:2], etc.) in DrugEx input fragments?
If attachment point specification is not directly supported, what would be the recommended workflow for R-group enumeration tasks?
Are there any plans to support explicit attachment point specification in future versions?

Expected Behavior

We would like to:

Input a scaffold with clearly marked attachment points (e.g., [*:1])
Generate molecules that grow specifically from these marked positions
Maintain the core scaffold structure while only modifying the R-groups at specified locations

Environment

DrugEx version: v3.4.5
Python version: 3.8+
Operating System: Ubuntu

Any guidance or clarification would be greatly appreciated!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Question] Does DrugEx v3 support R-group enumeration with specified attachment points? #25

[Question] Does DrugEx v3 support R-group enumeration with specified attachment points? #25

guosijia commented May 29, 2025

[Question] Does DrugEx v3 support R-group enumeration with specified attachment points? #25

[Question] Does DrugEx v3 support R-group enumeration with specified attachment points? #25

Comments

guosijia commented May 29, 2025

Question

Background

Current Approach and Issues

Approach 1: Using fragments with dummy atoms ([*:1])

Approach 2: Removing dummy atoms from fragments

Modified scaffold:

Questions

Expected Behavior

Environment

Any guidance or clarification would be greatly appreciated!

Approach 1: Using fragments with dummy atoms (`[*:1]`)