Molecule Design

Hierarchical representation based constrained multi-objective evolutionary optimisation of molecular structures

We propose an efficient algorithm to generate Pareto optimal set of reliable molecular structures represented by group contribution methods. To effectively handle structural constraints we introduce goal oriented genetic operators to the multi-objective Non-dominated Sorting Genetic Algorithm-II (NSGA-II). The constraints are defined based on the hierarchical categorisation of the molecular fragments. The efficiency of the approach is tested on several benchmark problems. The proposed approach is highly efficient to solve the molecular design problems, as proven by the presented benchmark and refrigerant design problems.

The article can be downloaded from the website of the journal or from my ResearchGate profile.

Group Contribution Method-based Multi-objective Evolutionary Molecular Design

The search for compounds exhibiting desired physical and chemical properties is an essential, yet complex problem in the chemical, petrochemical, and pharmaceutical industries. During the formulation of this optimization-based design problem two tasks must be taken into consideration: the automated generation of feasible molecular structures and the estimation of macroscopic properties based on the resultant structures. For this structural characteristic-based property prediction task numerous methods are available. However, the inverse problem, the design of a chemical compound exhibiting a set of desired properties from a given set of fragments is not so well studied. Since in general design problems molecular structures exhibiting several and sometimes conflicting properties should be optimized, we proposed a methodology based on the modification of the multi-objective Non-dominated Sorting Genetic Algorithm-II (NSGA-II). The originally huge chemical search space is conveniently described by the Joback estimation method. The efficiency of the algorithm was enhanced by soft and hard structural constraints, which expedite the search for feasible molecules. These constraints are related to the number of available groups (fragments), the octet rule and the validity of the branches in the molecule. These constraints are also used to introduce a special genetic operator that improves the individuals of the populations to ensure the estimation of the properties is based on only reliable structures. The applicability of the proposed method is tested on several benchmark problems.

The article can be downloaded from the website of the journal or from my ResearchGate profile.