You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are trying to perform R-group enumeration using DrugEx v3 and would like to clarify whether DrugEx supports specifying attachment points for fragment-based molecule generation.
Background
We understand that DrugEx v3 can generate molecules based on input fragments, but we're uncertain about its capability to handle specific attachment points for R-group enumeration tasks.
Current Approach and Issues
Approach 1: Using fragments with dummy atoms ([*:1])
Issue: The generated graph data file (rgroup_data_graph.txt) contains only column headers but no actual data (empty matrix), preventing further molecule generation.
Issue: While this approach generates valid graph data and molecules successfully, most generated molecules do not grow from our intended attachment point. The model seems to modify the scaffold at random positions rather than the specific location where the [*:1] was originally placed.
Questions
Does DrugEx v3 natively support R-group enumeration with specified attachment points?
Is there a correct way to handle dummy atoms ([:1], [:2], etc.) in DrugEx input fragments?
If attachment point specification is not directly supported, what would be the recommended workflow for R-group enumeration tasks?
Are there any plans to support explicit attachment point specification in future versions?
Expected Behavior
We would like to:
Input a scaffold with clearly marked attachment points (e.g., [*:1])
Generate molecules that grow specifically from these marked positions
Maintain the core scaffold structure while only modifying the R-groups at specified locations
Environment
DrugEx version: v3.4.5
Python version: 3.8+
Operating System: Ubuntu
Any guidance or clarification would be greatly appreciated!
The text was updated successfully, but these errors were encountered:
Question
We are trying to perform R-group enumeration using DrugEx v3 and would like to clarify whether DrugEx supports specifying attachment points for fragment-based molecule generation.
Background
We understand that DrugEx v3 can generate molecules based on input fragments, but we're uncertain about its capability to handle specific attachment points for R-group enumeration tasks.
Current Approach and Issues
Approach 1: Using fragments with dummy atoms (
[*:1]
)Input scaffold:
O=C(CCC(F)(F)F)NC(c1cnn2cc(C(C3CCC(F)(F)CC3)[*:1])nc2c1)C1CC1
Command used:
python -m drugex.dataset \ -b ${base_dir} \ -i scaffold_with_star.tsv \ -mc SMILES \ -o rgroup_data \ -mt graph \ -s
Issue: The generated graph data file (rgroup_data_graph.txt) contains only column headers but no actual data (empty matrix), preventing further molecule generation.
File content example:
C0 C1 C2 C3 C4 ... C399
(no data rows)
Approach 2: Removing dummy atoms from fragments
Modified scaffold:
O=C(CCC(F)(F)F)NC(c1cnn2cc(C(C3CCC(F)(F)CC3))nc2c1)C1CC1
Issue: While this approach generates valid graph data and molecules successfully, most generated molecules do not grow from our intended attachment point. The model seems to modify the scaffold at random positions rather than the specific location where the [*:1] was originally placed.
Questions
Does DrugEx v3 natively support R-group enumeration with specified attachment points?
Is there a correct way to handle dummy atoms ([:1], [:2], etc.) in DrugEx input fragments?
If attachment point specification is not directly supported, what would be the recommended workflow for R-group enumeration tasks?
Are there any plans to support explicit attachment point specification in future versions?
Expected Behavior
We would like to:
Environment
Any guidance or clarification would be greatly appreciated!
The text was updated successfully, but these errors were encountered: