Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
Biomedical and Pharmaceutical Sciences Original Article
Biomedical and Pharmaceutical Sciences, Review Article
Case Report, Biomedical and Pharmaceutical Sciences
Commentaries, Biomedical and Pharmaceutical Sciences
Ethnomedicine, Review Article
Food Science, Review Article
Global Health Outreach, Review Article
Letter to Editor
Nutrition, Invited Editorial
Nutrition, Review Article
Orginal Article, Research
Original Article, Biomedical and Pharmaceutical Sciences
Original Article, Computational Chemistry
Original Article, Toxicology
Original Research Article, Biomedical and Pharmaceutical Sciences
Perspectives on Current Topics, Public Health
Public Health, Case Report
Research Article, Biomedical and Pharmaceutical Sciences
Research, Original Article
Review Article, Agricultural Sciences
Review Article, Biomedical and Pharmaceutical Sciences
Review Article, Global Health Outreach
Review Article, Research
Review Article, Toxicology
Toxicology Original Article
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
Biomedical and Pharmaceutical Sciences Original Article
Biomedical and Pharmaceutical Sciences, Review Article
Case Report, Biomedical and Pharmaceutical Sciences
Commentaries, Biomedical and Pharmaceutical Sciences
Ethnomedicine, Review Article
Food Science, Review Article
Global Health Outreach, Review Article
Letter to Editor
Nutrition, Invited Editorial
Nutrition, Review Article
Orginal Article, Research
Original Article, Biomedical and Pharmaceutical Sciences
Original Article, Computational Chemistry
Original Article, Toxicology
Original Research Article, Biomedical and Pharmaceutical Sciences
Perspectives on Current Topics, Public Health
Public Health, Case Report
Research Article, Biomedical and Pharmaceutical Sciences
Research, Original Article
Review Article, Agricultural Sciences
Review Article, Biomedical and Pharmaceutical Sciences
Review Article, Global Health Outreach
Review Article, Research
Review Article, Toxicology
Toxicology Original Article
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
Biomedical and Pharmaceutical Sciences Original Article
Biomedical and Pharmaceutical Sciences, Review Article
Case Report, Biomedical and Pharmaceutical Sciences
Commentaries, Biomedical and Pharmaceutical Sciences
Ethnomedicine, Review Article
Food Science, Review Article
Global Health Outreach, Review Article
Letter to Editor
Nutrition, Invited Editorial
Nutrition, Review Article
Orginal Article, Research
Original Article, Biomedical and Pharmaceutical Sciences
Original Article, Computational Chemistry
Original Article, Toxicology
Original Research Article, Biomedical and Pharmaceutical Sciences
Perspectives on Current Topics, Public Health
Public Health, Case Report
Research Article, Biomedical and Pharmaceutical Sciences
Research, Original Article
Review Article, Agricultural Sciences
Review Article, Biomedical and Pharmaceutical Sciences
Review Article, Global Health Outreach
Review Article, Research
Review Article, Toxicology
Toxicology Original Article
View/Download PDF

Translate this page into:

Research Article
Biomedical and Pharmaceutical Sciences
2026
:6;
6
doi:
10.25259/AJBPS_24_2025

Predicting the transport of chemicals across the skin: Stochastic calculus and machine learning

Department of Biological and Pharmaceutical Sciences, Touro University, California, United States.
College of Pharmacy, Touro University, California, United States.
Author image
Corresponding author: Kevin Ita, Department of Biological and Pharmaceutical Sciences, Touro University, California, United States. kita@touro.edu
Licence
This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-Share Alike 4.0 License, which allows others to remix, transform, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.

How to cite this article: Ita K, Lebeke A. Predicting the transport of chemicals across the skin: Stochastic calculus and machine learning. Am J Biopharm Pharm Sci. 2026;6:6. doi: 10.25259/AJBPS_24_2025

Abstract

Objectives:

While quantitative structure–permeability relationships (QSPR) models have enhanced the understanding of structure–permeability relationships, their deterministic characteristics frequently underestimate the variability observed in experimental datasets, stemming from variations in skin shape, experimental circumstances, and measurement noise. Stochastic differential equations (SDEs) can be employed to model data that includes random noise from various sources. In this research project, the Euler–Maruyama (EM) algorithm, the Milstein scheme, and the Heston stochastic model were utilized to predict permeability coefficients.

Materials and Methods:

A curated dataset containing molecular weight, log Kow, Balaban Index, Harary Index, Ramification Index, and Forgotten Index served as inputs.

Results:

The Milstein method produced an MSE of 0.634 and an R2 of 0.74. The Heston model resulted in a mean squared error (MSE) of 0.66 and an R2 of 0.72. In a second set of studies, three machine learning models – gradient boost regression, support vector regression, and the LevenbergMarquardt algorithm – were applied to predict skin permeability coefficients from the same dataset. The models were trained on a dataset of eighty-seven chemicals. The LevenbergMarquardt method yielded a mean squared error (MSE) of 0.94 and a coefficient of determination (R2) of 0.58. Gradient boost regression (GBR) achieved R2 and MSE values of 0.84 and 0.45, respectively. Support vector regression (SVR) produced an R2 of 0.78 and an MSE of 0.62.

Conclusion:

In this research project, we utilized the Euler–Maruyama (EM) algorithm, the Milstein scheme and the Heston stochastic model to predict permeability coefficient . Optimization of machine learning algorithms augments predictive accuracy and diminishes the time, resources, and labor expended on skin transport studies.

Keywords

Calculus
Dermal transport
Machine learning
Risk assessment
Transdermal delivery

Highlights

  • Stochastic differential equations (SDEs) were applied to model the inherent variability in skin permeability data, addressing limitations of deterministic quantitative structure– permeability relationships (QSPR) models.

  • Three stochastic approaches – the Euler–Maruyama algorithm, the Milstein scheme, and the Heston model – were implemented and compared for predicting permeability coefficients.

  • Six molecular descriptors (molecular weight, log Kow, Balaban Index, Harary Index, Ramification Index, and Forgotten Index) served as predictive inputs.

  • The Euler–Maruyama method demonstrated superior stochastic modeling performance with an R2 of 0.81 and MSE of 0.5259.

  • Among the machine learning approaches, gradient boost regression achieved the best predictive accuracy (R2 = 0.84, MSE = 0.45), outperforming both support vector regression and the Levenberg–Marquardt algorithm.

  • Comparative analysis revealed that gradient boost regression and the Euler–Maruyama method offer the most reliable predictions for skin permeability coefficient estimation.

  • This study provides a framework for integrating stochastic modeling with machine learning to account for experimental uncertainty in permeability predictions.

INTRODUCTION

Skin permeability coefficient (log Kp) prediction is essential for the design of transdermal medication delivery devices and for risk assessment.[1] A variety of in silico and in vitro experimental methodologies can be employed to forecast and quantify, respectively, the permeation of chemicals through the skin.[2] The mathematical tools comprise equations employed to forecast skin permeability based on physicochemical characteristics, including molecular weight (MW) and the octanol-water partition coefficient (logKow), among others.[2] The prediction of cutaneous absorption is a significant study area in the pharmaceutical and cosmetics industries, facilitating the optimization of transdermal permeation and aiding in hazard and risk assessment.[3] The primary advantages of theoretical predictions compared to experimental measurements encompass economic efficiency and the mitigation of ethical dilemmas.[3] Furthermore, the models may facilitate a more profound comprehension of absorption mechanisms.[3] The health risk that comes from skin contact with harmful chemicals can be measured by finding the substances’ skin penetration coefficient (Kp). Nonetheless, due to inherent limitations of the technique, Kp evaluation is not performed satisfactorily.[4] Despite the remarkable advancements achieved in recent decades, precise mathematical modeling of skin permeability continues to pose a significant challenge.[5]

A crucial factor determining a molecule’s capacity to permeate the skin is its permeability coefficient (Kp).[6] Quantitative structure–permeability relationships (QSPR) generally employ descriptors[7] such as MW, partition coefficient (log Kow), and topological indices to forecast log Kp.[8] While QSPR models have enhanced our comprehension of structure–permeability correlations, their deterministic characteristics frequently underestimate the variability observed in actual datasets,[9,10] which stems from variations in skin morphology, experimental circumstances, and measurement noise.[11,12] From a therapeutic perspective, it is essential to establish methodologies that can characterize and forecast the uncertainty in transdermal permeation parameters. Uncertainty quantification is receiving heightened attention, and numerous strategies are being devised to address this challenge.[13] Epidermal permeability, commonly represented by the stratum corneum permeability coefficient, Kp, is a significantly variable experimental parameter.[5] The measurements are influenced by the skin’s state and the experimental design.[5] In analogous in vitro tests, Kp-values may fluctuate by several orders of magnitude based on the source of the skin sample, such as whether it is derived from different persons, cadavers, or anatomical regions.[5] Similarly, experimental variables such as temperature, vehicle, pH, and others also affect Kp-values.[5]

Classical QSPR models offer deterministic predictions from molecular descriptors, but they often neglect biological variability and measurement uncertainty.[9,14] The predominant erroneous assumption in quantitative structure-activity relationship (QSAR) and quantitative structure-permeability relationship (as well as QSPR) modeling is that the reported value for any experimental endpoint represents the definitive “true” value for that measurement.[9] It is assumed that the provided experimental value represents the sample mean, which adequately approximates the population mean (actual value) of all potential measurements.[9] Consequently, data modeling often fails to facilitate a comprehensive knowledge of population distribution and associated uncertainties.[9] Uncertainty in physico-chemical data is intrinsic, regardless of whether the data are measured or modeled.[14]

Differential equations have been utilized by researchers to characterize systems.[15] Several scholars have employed ordinary or stochastic differential equations (SDEs) to model various dynamical systems.[16] Stochasticity, or unpredictability, influences mathematical modeling of scientific predictions.[17] SDEs are mathematical models extensively employed to characterize complex phenomena that are sometimes influenced by noise.[18] The dynamics of biological systems are predominantly affected by stochastic processes and are susceptible to random external disruptions.[19] The ramifications of such processes are frequently examined through the formulation and evaluation of stochastic models.[19] A multitude of scientists have employed SDEs to model diverse events.[20] SDEs can concurrently encapsulate the established deterministic behaviors of relevant variables while allowing a modeler to account for the unexplained random variables.[20]

Conventional QSPR models frequently employ deterministic regression based on molecular descriptors, including MW and lipophilicity. The intricate and diverse nature of the skin barrier, along with the random characteristics of molecule diffusion, necessitates the application of SDEs.[18] The Euler–Maruyama (EM) method is a widely used numerical methodology for modeling SDEs.[21] We additionally employed the Milstein scheme (MS) for forecasting. The Milstein approach approximates solutions to SDEs by integrating higher-order Itô’s lemma terms, hence enhancing accuracy compared to basic Euler–Maruyama techniques.[22] MS improves QSPR models of skin permeability by including uncertainty while preserving predicted accuracy.[13] It offers probabilistic predictions that facilitate decision-making in transdermal medication development and safety evaluations. In the Heston model, volatility is permitted to “diffuse” throughout the entire space governed by a diffusion mechanism.[23] The Heston model has two correlated SDEs.[23] The spectrum of Heston’s volatility is continuous. This study project employed the EM algorithm, the MS, and the Heston stochastic technique[13,24,25] to forecast the permeability coefficient.

In addition to the above-mentioned stochastic models, we also utilized machine learning algorithms (Gradient Boost, Support Vector Regression [SVR], as well as the Levenberg– Marquardt [LM] technique) for prediction. Gradient boosting (GB) methods use numerical optimization for classification and regression by iteratively advancing in the approximate direction of the negative gradient of the loss function.[26] Gradient boosting strategies typically utilize a “weak learner” (such as a regression tree) as an approximation of the steepest descent path, owing to the difficulties associated with accurately navigating the negative gradient.[26] The Gradient boosting regression (GBR) is an ensemble model comprising an iterative sequence of increasingly structured tree models, with each successive model learning from the errors of its predecessor.[27] This machine learning approach employs “boosting” of a collection of weak prediction models, usually decision trees (DTs), to develop a more resilient model. A GBR of M trees is conventionally represented as:

Equation 1

fmxi=mMγmhmxj

where fm is the predicted output value, M is the number of DTs, γm is a scaling factor adding the contribution of a tree to the model, and hm is the weak learner.[27] The forecast for a new observation is obtained by aggregating the predictions of all individual trees that comprise the model.[28] Gradient boosting utilizes a statistical method to improve traditional DT models.[29] The objective is to integrate a set of underlying models to develop a unified, robust model. The GB approach progressively constructs new DTs through residual reduction. This iterative method improves estimates by progressively integrating a new tree with a minimized loss function.[29]

SVR is a widely utilized supervised machine learning method.[30] It serves as the regression equivalent of the more widely recognized support vector (SV) machine classifiers.[30] SVR can be employed for non-linear regression (LR) through a technique referred to as the kernel trick.[31] This regression approach aims to estimate a function f(x) based on the principle of structural risk minimization derived from statistical learning theory.[32] It improves generalization capability by decreasing the upper limit of risk, hence alleviating overfitting and leading to a diminished generalization error.[32] The SVR model produces markedly nonlinear functional relationships between the input data and the feature space.[30] This technique enables comparatively straightforward mathematical formulations and calculations.[30] The radial basis (RB) function (RBF) kernel function converts the original input space into a feature space with higher dimensions.[33] The aim is to determine the regression hyperplane created by the SVs

Observations in the calibration set nearest to the hyperplane are referred to as SVs.[34] The direction of the hyperplane is defined by SVs. Epsilon (ε) quantifies the allowable error in the model’s predictions. An elevated ε permits a wider buffer for accommodating larger prediction errors.[34] The cost (C) regulates the equilibrium between error reduction and hyperplane margin enhancement, while sigma (σ) can alter the Gaussian basis functions in the RBF kernel. A diminished σ results in a more complex model.[34] SVR utilizes an error measure characterized by a margin of tolerance (ε-insensitive) to ignore errors that are less than ε.[35] This makes the fitting less vulnerable. SVR differs from LR using an ε-insensitive loss function instead of the traditional L2-norm loss function, hence improving its outof-sample performance.[32] The principle of SVR is to identify the best hyperplane that contains the maximum number of data points within the data space.[33] The effectiveness of an SVR model depends on three essential factors.[36] The parameter C enables the balance between training error and model complexity. Excessively large values of parameter C will result in overfitting. If C is too small, it may result in underfitting and an increase in training errors.The parameter ε governs the width of the ε-insensitive zone, employed to accommodate the training data. An increased ε-value results in a diminished quantity of selected SVs and yields flatter or less intricate regression estimations. If ε is very big, the separation error is substantial, the number of SVs is low, and vice versa. Nonlinear mapping from the input space to a high-dimensional feature space is determined by the kernel parameter (γ).[37]

The LM curve-fitting method combines two minimization methods: the Gauss-Newton method and the gradient descent method.[38] This optimization technique has been used to solve non-linear least squares problems. The identification of a model from a large dataset depends on the parameter optimization method.[36] Gradient-based optimization methods are commonly utilized for solving nonlinear problems because of their high reliability and computational efficiency.[36] The LM method is an iterative approach that can be used to compute unknown parameters.[39] LM includes a damping component that facilitates the dynamic modification of the iterative direction and step size, bridging the Gauss–Newton method with the steepest gradient technique.[36] By carefully selecting the damping factor, suitable convergence can be achieved, hence avoiding the constraint of a full rank (rows or columns are all linearly independent) that is required for a Jacobian matrix (a matrix of all first-order partial derivatives of a system).[36,40]

QSPR models connect some structural properties of molecules (termed molecular descriptors to skin permeability.[41] A topological index is a number assigned to a molecule based solely on its connectivity pattern or topology without taking into consideration angles or bond lengths.[42] Topological indices (a subset of molecular descriptors) are important tools in QSPR studies.[43] A molecular graph represents the structural formula of a chemical compound, where the vertices correspond to atoms and the edges represent atomic bonds.[44] Molecular graphs are applied in cheminformatics, QSAR, and QSPR.[45] Numerous topological indices have been proposed and used in various research domains.[46-49] Topological indices can be categorized into three groups: distance-based indices, degree-based indices, and spectrum-based indices.[50] A multitude of distinct descriptors (also called invariants) numbering in the hundreds have been utilized in QSAR/QSPR studies.[51] Degree-based indices encompass the Zagreb index, Randić index, and connective eccentricity index, among others.[52] Distance-based indices include the Wiener polarity index, Wiener index, Szeged index, Mostar index, Kirchhoff index, ABC index, Harary index, among others.[53,54] Spectrum-based topological descriptors refer to those computed from eigenvalues of valency-related or distance-related chemical matrices.[55] The adjacency energy, signless Laplacian energy, the Estrada index, the distance energy, as well as the Laplacian energy are spectrum-based topological indices.[55]

We employed topological indices as features (independent variables) in the machine learning models for predicting permeability. Topological descriptors are numbers used to analyze various physicochemical properties of molecules.[56] They are generally derived from chemical graphs. Chemical graphs portray molecules, with atoms depicted as vertices while edges represent chemical bonds.[57] According to the chemical graph theory, certain molecular properties can be evaluated using the data reflected in their corresponding chemical graphs.[57] A graph invariant, also known as a topological index, is a function applied to a graph that remains unaffected by the labeling of its vertices.[57] A molecular graph is a fundamental graph linked to a molecular structure.[49] Each vertex of a molecular graph represents an atom of the molecule, while the edges denote the atomic bonds.[49] Certain topological descriptors are utilized by researchers to evaluate drug molecules, facilitating the identification of important pharmacological features.[49] A molecular graph G=(V, E) consists of the edge set E(G) and the vertex set V(G).[49] The degree of a vertex v in graph G, denoted as dG(v), is the number of edges incident to it.[49] The topological index of a graph G is Top(G), and that of a graph H is Top(H). When Top(H) = Top(G), then graph H is considered to be isomorphic to graph G.[49]

The Balaban index of a graph G is a distance-based topological metric that has been effectively employed in many QSPR and QSAR models.[58] It has been demonstrated in comparative studies investigating the Balaban index [J(G)] and the Wiener index [W(G)] for alkanes that the J index decreases the degeneracy of W and imparts markedly improved discriminative power.[58] The Balaban index is an important topological index in chemical and biological research.[59] It is comparable to the Wiener index, but displays diminished degeneracy and improved discriminative power.[59] This is especially beneficial for more complicated and larger molecular structures. Degeneracy occurs when the topological metrics do not differentiate between molecules with equal cycle numbers or atom counts. The Balaban index solves this issue more successfully than competing indices, showcasing its relevance. This can be attributed to its high discriminatory ability and low degeneracy. J is especially useful when investigating complex molecules with numerous heteroatoms and bonds. Several researchers have used this index to study numerous properties such as octane ratings, density, boiling point, and refractive index. It has also been employed in the study of supramolecular substituted sulfonamides.Some investigators have used this index to predict the melting and glass transition temperatures of linear macromolecules and to determine the properties of polymers. J has been utilized in biological research that focuses on optimizing physiologically active lead compounds, alongside other chemical descriptors.[59]

The Harary index [(G)] is a distance-oriented topological index.[60] (G) was independently proposed in 1993 by Ivanciuc et al.[61] and Plavšić et al.[62] In a connected molecular graph, the Harary index is the inverse of the total distance sum between all unordered vertex pairs of G.[63] The Harary index is a useful measure for evaluating the compactness of a molecule. An elevated Harary index signifies enhanced molecular compactness. The Harary index serves to assess branching in alkanes. In compact structures, the number of short distances increases, leading to a Harary index that surpasses that of extended structures. Similarly, a more complex configuration demonstrates greater short distances, leading to an elevated Harary index for these structures.[64]

In 2015, Furtula and Gutman introduced a topological index, usually referred to as the Forgotten Index.[65] The Forgotten topological index is denoted as the summation of the squares of adjacent vertices. This index is regarded as a suitable instrument for examining the medical and biological data of drug molecular structures.[66] Furtula and Gutman proposed the forgotten index (F-index) to predict the characteristics of octane isomers.[65] This index was referenced in the same article that presented the Zagreb indices in 1972, albeit it garnered less notice.[67] After 43 years, Gutman and Furtula noted that the overlooked index is an extraordinarily effective and powerful instrument for ascertaining the characteristics of organic and inorganic compounds.[67] The F-index is defined as the summation of the cubes of the degrees of the vertices in molecular networks.[68] The Ramification Index is computed as the summation of all vertex degrees exceeding two, subtracting two from each vertex degree.[69,70] In this project, we used six different regression approaches. Table 1 shows the comparative analysis of stochastic and machine learning techniques. A working model of this project is shown in Figure 1 and Appendix 1.

Graphical abstract. MSE: Mean squared error, GBR: Gradient boost regression, E-M: Euler–Maruyama algorithm, SDEs: Stochastic differential equations.
Figure 1:
Graphical abstract. MSE: Mean squared error, GBR: Gradient boost regression, E-M: Euler–Maruyama algorithm, SDEs: Stochastic differential equations.
Table 1: Comparative analysis of stochastic and machine learning methods for skin permeability coefficient prediction.
Method Euler–Maruyama method Milstein scheme Heston model Levenberg- Marquardt method Gradient boost regression Support vector regression
Type Stochastic numerical method Stochastic numerical method Stochastic volatility model Optimization algorithm Ensemble machine learning method Kernel-based machine learning method
Primary purpose Numerical approximation of stochastic differential equations (SDEs) Numerical approximation of SDEs with improved accuracy Modeling systems with stochastic volatility Nonlinear least squares curve fitting and neural network training Predictive modeling through sequential ensemble learning Regression through optimal hyperplane fitting in high-dimensional space
Mathematical basis Extension of Euler method to SDEs; incorporates Wiener process Extends Euler–Maruyama by including higher-order Itô-Taylor expansion terms Two-dimensional SDE system coupling asset price with variance Combination of gradient descent and Gauss-Newton methods Sequential addition of weak learners minimizing loss function via gradient descent Minimization of ε-insensitive loss function with regularization; kernel trick for nonlinearity
Order of convergence Strong order 0.5; Weak order 1.0 Strong order 1.0; Weak order 1.0 Depends on discretization scheme applied Iterative; convergence depends on initial conditions and damping parameter Iterative; controlled by learning rate and number of estimators Convergence depends on kernel selection and hyperparameter tuning
Key equation component Drift term+Diffusion term×Random increment Drift term+Diffusion term×Random increment+ Correction term involving diffusion derivative Drift process+ Correlated Wiener processes for price and volatility Jacobian matrix computation; adaptive damping parameter (λ) Negative gradient of loss function; additive combination of decision trees Lagrangian multipliers; kernel function (RBF, linear, polynomial); slack variables
Handling of noise Additive Gaussian noise through Wiener increments More accurate capture of multiplicative noise effects Models time-varying, mean-reverting stochastic volatility Does not inherently model stochastic noise; deterministic optimization Robust to noise through ensemble averaging; regularization parameters Robust to outliers through ε-insensitive tube; soft margin allows tolerance
Computational complexity Low; simple to implement Moderate; requires computation of diffusion derivatives Higher; involves solving coupled SDEs Moderate to high; requires Jacobian computation at each iteration Moderate to high; depends on number of trees and depth Moderate; scales with training data size; kernel computation intensive
Key hyperparameters Time step size (Δt) Time step size (Δt) Mean reversion rate; long-term variance; volatility of volatility; correlation Damping parameter (λ); convergence tolerance Number of estimators; learning rate; maximum depth; minimum samples per leaf Kernel type; regularization parameter (C); epsilon (ε); gamma (for RBF kernel)
Advantages Simplicity; computational efficiency; easy implementation Higher accuracy than Euler–Maruyama for same step size Captures volatility clustering and mean reversion; realistic for financial data Fast convergence near optimum; robust for nonlinear problems High predictive accuracy; handles nonlinear relationships; feature importance ranking Effective in high-dimensional spaces; robust to overfitting; handles nonlinear relationships
Limitations Lower accuracy; may require smaller time steps More complex implementation; requires differentiable diffusion coefficient Parameter estimation can be challenging; computationally intensive May converge to local minima; sensitive to initial parameter estimates Prone to overfitting if not regularized; computationally expensive for large datasets Sensitive to hyperparameter selection; less interpretable; slower training on large datasets
Typical applications Physics simulations; financial modeling; biological systems Financial derivatives pricing; scientific simulations requiring higher precision Options pricing; risk management; volatility forecasting Neural network training; parameter estimation; regression analysis Predictive analytics; ranking systems; anomaly detection; QSAR/QSPR modeling Pattern recognition; bioinformatics; QSAR/QSPR modeling; time series forecasting
Performance in this study R2=0.81; MSE=0.5259 R2=0.74; MSE=0.634 R2=0.72; MSE=0.66 R2=0.58; MSE=0.94 R2=0.84; MSE=0.45 R2=0.78; MSE=0.62
Ranking by R2 value 2nd 4th 5th 6th 1st 3rd
Ranking by MSE value 2nd 4th 5th 6th 1st 3rd

MSE: Mean squared error, RBF: Radial basis function, QSAR: Quantitative structure-activity relationships, QSPR: Quantitative structure–permeability relationships

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

MODELING FRAMEWORK FOR PERMEABILITY COEFFICIENT PREDICTION

The EM stochastic framework

The EM method was used to improve the prediction of skin permeability by embedding stochastic fluctuations into model forecasts. Classical QSPR models are deterministic[71] while biological systems exhibit stochastic variability.[72] Additional degree of reality is achieved in modeling biological systems when the stochastic method is incorporated compared to the deterministic technique.[73] The EM method introduces randomness, enabling uncertainty quantification.[74] The Ornstein–Uhlenbeck (O–U) process is characterized by its Markov properties, Gaussian distribution, and stationary nature.[75] The O–U process is a more efficient approach, especially when considering noise, compared to linear models.[75] It provides obvious benefits, such as nonnegativity and continuity. In addition, it is less challenging to estimate parameters.[75] An O–U-type SDE[76,77] was adopted to represent the permeability coefficient log Kp (dynamics):

dYt = −θ (Yt−f(X))dt+σdWt Equation 2

where Yt is the skin permeability coefficient at time t, f(X) is the deterministic baseline prediction from molecular descriptors X, θ > 0 is the mean-reversion parameter, σ > 0 is the diffusion coefficient, and Wt is a standard Wiener process. The EM discretization approximates the SDE by:[78]

Yn+1 = Ynθ(Ynf(X))Δt+σΔtxn, Equation 3

where ξnN(0,1) and Δt is the time-step. For each compound, multiple stochastic trajectories of length T are generated. The mean of the terminal values YT provides the stochastic point prediction, and the standard deviation across trajectories quantifies predictive uncertainty. Averaging across multiple stochastic realizations yields the EM mean prediction used in the parity plot.

Milstein method

The Milstein method, a higher-order scheme for SDE simulation, offers an effective numerical approach to capture state-dependent noise and to generate probabilistic forecasts[13] of skin permeability:

The SDE for skin permeability is given in Equation 2. The EM scheme approximates the SDE but fails to fully account for multiplicative noise. The Milstein method[13] introduces a correction term:

Xt+Δt=Xt+aXtΔt+bXiΔWt+12bXtb'XtΔWt2Δt Equation 4

This term improves convergence to order one and captures the effect of state-dependent diffusion.

Heston framework

The Heston model is particularly attractive because it couples a mean-reverting process with a stochastic variance component.[25] We used this method to model variability in skin permeability coefficient (Kp) across chemical compounds. Using a curated dataset of molecular descriptors (MW, Log Kow, Balaban index, Harary index, Ramification index, and the Forgotten index), we compared deterministic regression models with a Heston stochastic framework. We modeled log Kp as an O–U–type process with stochastic volatility:[77]

dYt=κyμxYtdt+vtdWt1,dvt=κvθvvtdt+ξvtdWt2, Equation 5

with correlation n˜=CorrdWt1,dWt2.

Here:, Yt is the log Kp trajectory, μ(x) is a deterministic regression prediction, vt is the stochastic variance process, while the parameters are mean-reversion rates κyv, long-run variance θv, volatility of volatility ξ, and correlation ρ.

METHODS

Simulation

Data and molecular descriptors

The dataset comprised experimentally measured log Kp values together with six molecular descriptors commonly used in permeability modeling: MW, Log Kow, Balaban Index, Harary Index, Ramification Index, and the Forgotten Index. Experimental logKp values for a set of model compounds were extracted from published transdermal studies. The descriptors were included as inputs to f(X). The simulation variables were: step size: Δt = 0.01, iterations: 1,000–5,000 per compound, mean EM predictions obtained as the ensemble average. The evaluation metrics were: Mean squared error (MSE), coefficient of determination (R2). For the Heston model, data were preprocessed, normalized, and split into validation, training, and test sets. As a baseline, a Ridge regression model was trained to approximate the deterministic mapping:[79]

μ(x) = fridge (x), Equation 6

where x denotes the descriptor vector.

Visualization

Plots were generated comparing observed logKp values against EM, Milstein, and Heston mean predictions. The dashed lines represent the ideal y = x relationship.

Machine learning

Data collection and pre-processing

In 1990, Flynn published a dataset that has been cited widely.[80] In our research project, we extracted data from the Flynn dataset [Table 2] to predict skin permeability using gradient boost regression, SVR, and the LM algorithm (LMA). A robust predictive model can only be established using high-quality data samples.[81] Data preprocessing and normalization were performed. Data pre-processing aims to convert raw data into a format that is more accessible and efficient for further processing stages.[82] Data normalization is a method that entails converting the values of a feature or variable into a new range, often between 0 and 1 or −1 and 1.[83] A modified Flynn dataset was used to assess the influence of the physicochemical characteristics of solutes on skin permeability coefficient values. The data were preprocessed by addressing missing values, outliers, and inconsistencies.[84] Numerical data were normalized and standardized to ensure all features are on a uniform scale, hence preventing the dominance of variables with greater magnitudes.[84]

Table 2: Dataset of molecular descriptors and permeability coefficients.
Compound Molecular weight Log Kow Balaban index Harary index Forgotten index logkp
Aldosterone 360.44 1.08 1.74 5.65 41.0 -5.52
Amobarbital 226.27 1.96 2.87 3.67 25.0 -2.64
Atropine 289.38 1.81 1.63 4.69 33.5 -5.07
Barbital 184.19 0.65 3.01 3.08 22.0 -3.95
Benzyl alcohol 108.13 1.1 2.83 2.0 15.5 -2.22
4-bromophenol 173.01 2.59 3.03 1.83 15.5 -1.44
Butanoic acid (butyric acid) 88.1 0.79 2.83 1.58 4.0 -3.0
n-butanol 74.12 0.88 2.19 1.5 7.0 -2.6
2-butanone 72.1 0.28 2.85 1.33 24.0 -2.35
Butobarbital 212.24 1.65 2.89 3.58 16.5 -3.71
2-chlorophenol 128.56 2.15 3.13 1.86 15.5 -1.48
4-chlorophenol 128.56 2.39 3.03 1.83 15.5 -1.44
Chloroxylenol 156.61 3.39 3.31 2.22 17.5 -1.28
Chlorpheniramine 274.8 2.7 2.26 4.31 35.0 -2.66
Codeine 299.3 0.89 1.82 4.62 36.5 -4.31
Cortexolone (11-desoxy-17-hydroxycorticosterone) 346.45 2.52 2.3 4.67 42.0 -4.13
Cortexone (deoxycorticosterone) 330.45 2.88 2.24 4.44 41.0 -3.35
Corticosterone 346.45 1.94 1.7 5.28 37.0 -4.22
Cortisone 360.46 1.42 1.75 5.44 41.0 -5.0
o-cresol 108.14 1.95 3.13 1.86 15.5 -1.8
m-cresol 108.14 1.82 3.08 1.83 15.5 -1.82
p-cresol 108.14 1.95 3.03 1.83 15.5 -1.75
n-decanol 158.28 4.0 2.69 3.0 10.0 -1.1
2,4-dichlorophenol 127.55 3.08 3.17 2.03 16.5 -1.22
Diethylcarbamazine 199.29 1.89 2.44 3.39 17.0 -3.89
Digitoxin 764.92 1.86 0.83 11.16 67.0 -4.89
Ephedrine 165.23 1.03 2.71 2.89 19.5 -2.23
Estradiol 272.37 2.69 1.71 4.15 30.5 -3.52
Estriol 288.37 2.47 1.71 4.35 31.5 -4.4
Estrone 270.36 2.76 1.73 4.15 33.5 -2.44
Ethanol 46.07 -0.31 1.63 1.0 2.0 -3.1
Ethyl benzene 106.12 3.15 2.83 2.0 9.0 3.15
Ethyl ether 74.12 0.83 2.19 1.5 15.5 1.8
4-ethylphenol 122.17 2.4 2.88 2.17 4.0 -1.46
Etorphine 411.5 2.76 1.66 6.26 16.5 -2.1
Fentanyl 336.5 4.37 1.57 5.75 45.5 -3.15
Fluocinonide 494.55 3.19 1.62 7.06 45.0 -2.77
Heptanoic acid (enanthic acid) 130.18 2.5 2.83 2.33 54.0 -1.7
n-heptanol 116.2 2.72 2.53 2.25 11.0 -1.5
Hexanoic acid (caproic acid) 116.16 1.9 2.82 2.08 7.0 -1.85
n-hexanol 102.18 2.08 2.45 2.0 10.0 -1.89
Hydrocortisone 362.47 1.53 1.74 5.44 6.0 -5.52
Hydrocortisone-21-yl-N, N dimethyl succinamate 489.6 2.03 1.51 7.69 38.0 -4.17
Hydrocortisone-21-yl-hemipimelate 504.6 3.26 1.41 7.75 47.0 -2.75
Hydrocortisone-21-yl-hemisuccinate 462.5 2.11 1.53 7.0 63.0 -3.2
Hydrocortisone-21-yl-hexanoate 460.6 4.48 1.52 6.75 60.0 -1.75
Hydrocortisone-21-yl-octanoate 488.7 5.49 1.44 7.25 54.0 -1.21
hydrocortisone-21-yl-proprionate 418.5 3.0 0.0 6.0 54.0 -2.47
Hydromorphone 285.3 1.25 1.83 4.29 49.0 -4.82
Hydroxypregnenolone 330.45 3.0 2.3 4.67 35.5 -3.22
17a-hydroxyprogesterone 330.45 2.74 1.67 4.91 42.0 -3.22
Isoquinoline 129.15 2.03 2.89 2.28 36.0 -1.78
Meperidine 247.0 2.72 2.31 4.17 24.75 -2.43
Methanol 32.04 -0.77 1.0 1.0 29.5 -3.3
Methyl-[hydrocortisone-21-yl succinate 476.6 2.58 0.0 7.25 1.0 -3.68
Methyl-4-hydroxy benzoate 152.14 1.96 2.75 2.86 61.0 -2.04
Morphine 285.3 0.62 1.84 4.29 22.5 -5.03
2-naphthol 144.16 2.84 2.83 2.44 35.5 -1.55
Naproxen 230.26 3.18 2.48 3.83 25.75 -3.4
Nicotine 162.23 1.17 2.19 2.72 34.75 -1.71
Nitroglycerine 227.09 2.0 3.99 3.5 20.5 -1.96
3-nitrophenol 139.11 2.0 3.07 2.28 23.0 -2.25
4-nitrophenol 139.11 1.96 3.0 2.28 20.5 -2.25
n-nitrosodiethanolamine 134.13 -1.05 3.29 2.5 20.5 -5.22
n-nonanol 144.26 3.62 2.65 2.75 11.0 -1.22
Octanoic acid (caprylic acid) 144.21 3.0 2.84 2.58 9.0 -1.6
n-octanol 130.23 2.97 2.6 2.5 12.0 -1.28
Ouabain 584.64 -1.51 1.28 8.53 8.0 -6.11
Pentanoic acid (valeric acid) 102.13 1.3 2.82 1.83 52.0 -2.7
n-pentanol 88.15 1.56 2.34 1.75 9.0 -2.22
Phenobarbital 232.23 1.47 2.54 3.88 5.0 -3.34
Phenol 94.11 1.46 3.02 1.67 34.5 -2.09
Pregnenolone 316.47 3.13 1.66 4.75 14.5 -2.82
Progesterone 314.45 3.77 1.67 4.75 32.0 -2.82
n-propanol 60.1 0.25 1.97 1.25 35.0 -2.85
Resorcinol 110.11 0.8 3.08 1.83 3.0 -3.62
Salicylic acid 138.12 2.26 3.15 2.31 15.5 -2.2
Scopolamine 303.35 1.24 1.42 4.78 20.5 -4.3
Styrene 104.1 2.95 2.99 2.0 35.5 -0.19
Sucrose 342.3 -2.252 2.18 5.29 18.5 -5.28
Sufentanyl 387.5 4.59 1.74 6.26 24.0 -1.92
Testosterone 288.41 3.31 1.69 4.31 45.75 -3.4
Thymol 150.21 3.34 3.09 2.47 30.0 -1.28
2,4,6-trichlorophenol 162.0 3.69 3.31 2.22 17.5 -1.23
Water 18.01 -1.38 0.0 0.0 0.0 -3.3
3,4-xylenol 122.17 2.35 3.17 2.03 22.5 -1.44

Data preprocessing was performed to ensure the data were suitably prepared for training the various models utilized in this study.[84] A dataset was generated to analyze the impact of various molecular descriptors (Balaban distance connectivity index, Harary Index, Ramification Index) as well as the partition coefficient and the MW on the permeability coefficient. The indices were calculated in Python using RDKit techniques. The features included the MW, the Forgotten Index, the Balaban distance connectivity index, the Harary index, the Ramification index, the partition coefficient, while the target was the permeability coefficient (logKp). Regression analysis was conducted through Gradient Boost Regression, SVR, and the LMA. All computations were performed in a JupyterLab environment[85] with Python code.

Data splitting and model training

The data included in this investigation were divided into training and testing subgroups. The partitioning of the dataset into training and test samples is critically important.[86] Initially, it mitigates the risk of overfitting and enhances the generalizability and dependability of the models.[86] Furthermore, by evaluating the performance of several models and parameter combinations on the test set, the optimal model and parameters can be identified.[86] Nonetheless, dataset splitting possesses several possible drawbacks.[86] An uneven data distribution between the training set and the test set might adversely impact the proper assessment of model performance.[86] The training set was used to construct the models in the machine learning method, while the test set was applied to assess the accuracy and reliability of the models.[85]

Regression and optimization

Gradient boost regression (GBR) techniques documented in the literature often operate on precise data.[87] In the GBR model, basic predictors are frequently represented by simple DTs, sometimes referred to as decision stumps.[87] The model’s predictive power enhances with an increase in the number of boosting rounds.[87] In GB, DTs serve as the fundamental models.[29] The gradient boost regressor, SV regressor, and LMA were employed to predict the target (skin permeability coefficient) using a collection of features (MW, Log Kow, topological indices).

Actual and predicted values

The actual and anticipated values derived from the application of the three approaches (GBR, SVR, and LMA) were analyzed. The coefficient of determination (R2) and the mean square error (MSE) values were used to assess the accuracy of the prediction. When gradient boost regression was employed, the MSE was 0.45 while the R-squared was 0.84. Following the use of SVR, the mean squared error (MSE) was 0.6209, and R2 was 0.7793. The application of the LMA resulted in an MSE of 0.94 and an R2 of 0.58.

RESULTS

Simulation and modeling were conducted with techniques from stochastic calculus. Figure 2 shows a parity plot that compares observed log Kp values with the EM mean predictions. The plot from the EM method shows strong agreement between observed and predicted values across the logKp range (−6 to 0). Most predictions cluster near the identity line, with modest deviation at extreme values (very low permeability compounds). The data suggest the EM approach accurately captures permeability trends. The slight underprediction at extremes can be attributed to the limits of molecular descriptors or insufficient sampling in high-variance regions. An MSE of approximately 0.43 and an R2 of about 0.82 confirms the predictive accuracy of the EM model that is comparable to deterministic QSPR models, with the added benefit of variance estimates. By incorporating stochastic variability, the EM method avoids overconfidence in permeability predictions. This is especially critical for risk–benefit analysis in regulatory submissions, where uncertainty quantification strengthens robustness. The EM approach offers several key benefits. Rather than providing only a point estimate, the EM simulation yields a predictive distribution, allowing calculation of credible intervals for log Kp. Second, using a QSPR-based regression as the drift term, the method merges mechanistic insights from molecular descriptors with the flexibility of stochastic calculus. The method can be extended to multi-zone skin models and time-dependent release kinetics. Despite these advantages, some limitations should be recognized. Accurate estimation of theta and sigma is essential; poor calibration can distort both mean predictions and uncertainty estimates.

Parity plot that compares observed logKp values with the Euler–Maruyama (EM) mean predictions.
Figure 2:
Parity plot that compares observed logKp values with the Euler–Maruyama (EM) mean predictions.

When using the Milstein method, a Random Forest (RF) model was used to provide baseline predictions of logKp. Residual variance defined the diffusion term. Milstein simulations (2000 paths per compound) generated distributions of predicted logKp values, providing both mean predictions and confidence intervals. The Milstein stochastic augmentation preserves nearly all of the baseline model’s predictive accuracy while adding a probabilistic framework that provides confidence intervals and quantifies uncertainty. The very small changes in R2 and MSE show that incorporating the Milstein SDE does not degrade the core prediction quality; instead, it enriches the model with meaningful measures of variability that the deterministic baseline cannot supply. Minor deviations at the extremes (e.g., highly negative log Kp) reflect increased difficulty in modeling compounds with very low permeability, where experimental variability is greatest. The plot supports the conclusion that the EM mean retains the accuracy of the baseline regression while offering additional uncertainty quantification. Figure 3 (observed vs. predicted logKp, Milstein mean) compares experimental logKp values on the x-axis to the mean of the Milstein simulations on the y-axis. The dashed 45° line represents perfect agreement. Most points lie close to this line, indicating that the Milstein approach accurately reproduces experimental permeability coefficients. The scatter around the line reflects both biological variability and model error, and is consistent with the 95% confidence intervals derived from the stochastic simulations. Figure 4 (predictive uncertainty) shows a histogram of the widths of the 95% confidence intervals for predicted logKp values. Most intervals are about 1.7–1.9 log units wide, illustrating that the Milstein method provides a stable quantification of predictive uncertainty across different compounds. This information is valuable for risk assessment and for selecting candidates with acceptable variability in skin permeation.

Observed versus predicted logKp, Milstein mean.
Figure 3:
Observed versus predicted logKp, Milstein mean.
Predictive Uncertainty: a histogram of the widths of the 95% confidence intervals (CI) for predicted logKp values.
Figure 4:
Predictive Uncertainty: a histogram of the widths of the 95% confidence intervals (CI) for predicted logKp values.

The Milstein Sample Paths [Figure 5] show multiple stochastic trajectories of logKp versus time for a representative compound. Each colored line represents one Milstein-simulated path, while the dashed horizontal line marks the experimental logKp. The ensemble of paths captures the stochastic fluctuations around the mean prediction and confirms that the experimental value lies within the simulated envelope. This illustrates how the MS captures the inherent variability of the diffusion process. The Milstein method provides several advantages over simpler stochastic schemes. It accounts for state-dependent noise through the inclusion of the derivative term b’(X), offering improved strong convergence. For skin permeability modeling, this means that variability in logKp predictions scales realistically with molecular properties. The predictive distributions and confidence intervals produced enable more informed decisions in pharmaceutical formulation and toxicological risk analysis. Nonetheless, accurate estimation of the drift and diffusion functions remains essential, and computational demands increase with the number of simulated paths.

The Milstein sample paths showing multiple stochastic trajectories of logKp versus time for a representative compound.
Figure 5:
The Milstein sample paths showing multiple stochastic trajectories of logKp versus time for a representative compound.

Results of the Heston model [Figure 6] demonstrated improved handling of uncertainty, with predictive intervals aligning more closely to observed experimental spread. Evaluation metrics of the model included the mean absolute error (MAE), mean squared error (MSE), and the coefficient of determination (R2). The Heston model offers a promising pathway to incorporate stochastic dynamics into permeability predictions, bridging mathematics and percutaneous penetration. Optimal parameters (e.g., κy≈1.2, κv≈0.8, θv≈0.2) yielded predictive distributions consistent with observed scatter. The stochastic variance term captured a wider experimental spread in permeability. The Heston model achieved a mean standard error (MSE) of 0.66 and an R2 of 0.72. Compared with Ridge regression, the Heston framework improved both accuracy and uncertainty calibration. Predictive intervals encompassed ~92% of observed test values, consistent with nominal 90% coverage. The Heston stochastic volatility model introduces a dual dynamic: mean reversion toward molecular-descriptor-based permeability predictions, and variance evolution capturing unexplained variability. This framework mirrors biological realities: skin barrier variability, localized differences in stratum corneum thickness, and fluctuating hydration all introduce heteroskedasticity.

Observed versus predicted logKp values through the Heston model.
Figure 6:
Observed versus predicted logKp values through the Heston model.

Figure 7 compares the predicted versus true Log Kp values obtained using a Gradient Boosting Regression (GBR) model. The points are distributed along a diagonal trend, suggesting a generally good correlation between model predictions and experimental (true) values. The GBR model successfully captures the nonlinear relationships among molecular descriptors. Minor deviations from the diagonal line imply some residual error for compounds with extreme Log Kp values (i.e., highly lipophilic or hydrophilic compounds). Overall, the model exhibits robust predictive performance and generalization capability, typical of ensemble tree-based methods that reduce bias through boosting successive weak learners. The GBR model effectively balances variance and bias through its boosting iterations, making it suitable for QSPR modeling of permeability. Figure 8 presents a similar scatter plot comparing actual (experimental) and predicted Log Kp values using an SVR model. The SVR algorithm employs an RBF kernel that aims to fit the data within a defined error margin (ε-insensitive tube). Compared to GBR, SVR appears to slightly underpredict at higher Log Kp values, which may result from kernel smoothing effects or suboptimal hyperparameter tuning. Figure 9 compares observed (experimental) and predicted Log Kp values obtained through non-LR using the LMA–a hybrid approach combining the gradient descent and Gauss– Newton methods. The red dashed line represents the ideal correlation (y = x). The tighter clustering of points near the central line suggests good predictive precision for mid-range permeability compounds but slightly reduced sensitivity at the extremes. Unlike machine-learning regressors, the LMA-based nonlinear fit emphasizes parametric optimization, typically using sigmoidal or exponential models derived from Fickian diffusion kinetics. The consistency across a wide range of Log Kp values suggests that the model captures underlying physicochemical phenomena (e.g., molecular size, lipophilicity, and polarity) influencing permeability.

LogKp predictions using the gradient boosting regression.
Figure 7:
LogKp predictions using the gradient boosting regression.
Actual versus predicted LogKp values using the support vector regression.
Figure 8:
Actual versus predicted LogKp values using the support vector regression.
Observed versus predicted logKp values through the Levenberg–Marquardt algorithm.
Figure 9:
Observed versus predicted logKp values through the Levenberg–Marquardt algorithm.

DISCUSSION

Permeability models published in the literature emphasize deterministic QSPR.[8,88] Stochastic modeling aligns with emerging views that variability is not just noise but an intrinsic property of the drug absorption processes. We also used machine learning in our study. Machine learning seeks to employ algorithms to identify patterns within data samples, therefore creating models with generalization abilities that can “learn by analogy.”[89] A different method has recently been utilized for transdermal data. Hamadeh et al. employed a systematic methodology to accurately estimate the skin permeability of a topically applied chemical, utilizing descriptors of the permeant, vehicle, and skin conditions, alongside mechanistic insights into permeability across a finely discretized dermal strata.[90] The approach focuses on a direct computation of permeability derived from the mechanical characteristics of the permeation context.[90] The computational method can be seamlessly incorporated into algorithms for parameter discovery, optimization, and sensitivity analysis.[90]

Accurate forecasting of skin permeability is crucial for efficient transdermal drug delivery (TDD).[91] Abdallah et al., addressed this critical necessity to enhance TDD.[91] A dataset of 441 records for 140 substances with diverse LogKp values was examined.[91] The descriptor computation yielded 145 relevant descriptors.[91] Regression analysis was performed utilizing machine learning models, including RF, CatBoost, Multiple LR, XGBoost, LightGBM (LGBM), and Artificial Neural Networks (ANN).[91] XGBoost, LGBM, and gradient boost models demonstrated enhanced predictive accuracy relative to alternative models.[91] In the models, the key attributes with significant impact on skin permeability were hydrogen bond donors, hydrophobicity, topological polar surface area, and hydrogen bond acceptors.[91]

Baba et al. executed several compelling experiments, compiling an extensive dataset (322 entries) of in vitro human skin permeability coefficients.[92] From a chemical standpoint, this was a diverse set of penetrants. Subsequently, the authors created a dataset of several descriptors and analyzed the compounds through Gaussian process regression (GPR) and nonlinear SVR.[92] The SVR model exhibited slight superiority over the GPR model, with root mean square error, squared correlation coefficient, and MAE values of 0.29, 0.94, and 0.21, respectively.[92] Zeng et al. developed a three-descriptor quantitative structure–activity/toxicity relationship model to evaluate the skin permeability of 274 compounds.[93] The authors used an SV machine (SVM) with a genetic algorithm (GA) to analyze the dataset.[93] The investigators evaluated a training set of 139 chemicals and reported that the SVM model had a root mean square error of 0.253 and a coefficient of determination of 0.946.[93] A root mean square error of 0.302 and an R2 of 0.872 were obtained for a test set of 135 compounds.[93]

Rezaei et al. developed a 3D-QSPR model to evaluate the influence of the physicochemical characteristics of 211 compounds on skin permeability.[94] The Kennard-Stone method was used to divide the dataset into a test set of 52 molecules and a training set of 159 molecules.[94] GA, fractional factorial design, and successive projection algorithm (SPA) were utilized to ascertain the most important three-dimensional molecular structure.[94] The indices chosen through different feature selection techniques were correlated with skin permeability parameters utilizing SVM and partial least squares methods.[94] The SPA-SVM model produced a correlation coefficient of 0.96, a Q2 of 0.73, and an R2 of 0.76.[94] The hydrogen bonding acceptor and donor properties of the analyzed compounds may impact their skin transport.[94] Interestingly, it was shown that permeability increased with increased hydrophobicity and diminished with higher MW.[94] Hydrophobicity in the target molecule, as well as the configuration and location, exerted a significant impact on skin permeability.[94]

A recent study sought to assess the effectiveness of nonlinear and linear models in forecasting the skin penetration of 11 compounds.[95] The authors utilized machine learning based on the Gaussian method as well as QSPR to forecast the permeability coefficients of the compounds.[95] Transdermal absorption was evaluated using Franz cells and a standardized protocol that incorporated high-performance liquid chromatography.[95] A statistical analysis comparing expected and experimentally derived values was performed using MSE and the Pearson sample correlation coefficient.[95] The study’s findings revealed that the models poorly anticipated permeation, with certain instances deviating by two or three orders of magnitude from the experimentally obtained values.[95] Nonetheless, this assortment of compounds allowed the models to precisely rank the permeants.[95] The scientists concluded that although the models are unsuitable for accurate permeation predictions, they can successfully establish a rank order of permeation, facilitating the selection of candidate compounds for in vitro screening.[95] It is crucial to recognize that these predictions must take into account the true relative potencies of pharmaceutical alternatives.[95]

Lindh et al. utilized an aggregated conformal prediction (ACP) framework to predict the skin transport (log Kp) of compounds.[96] ACP offers a prediction interval at a set confidence level for each molecule, enhancing informed decision-making, such as suggesting the next product for synthesis.[96] The models were developed using SVM and RF techniques, based on empirically derived skin transport data from 211 different compounds.[96] The resulting models exhibited comparable predictive parameters compared to previously published models, while additionally offering a reliable, individually defined prediction range for each chemical, rather than a single predicted value.[96] The approach employs calculated indices and can swiftly predict the skin penetration rate of new chemicals.[96]

Gradient boosting (GB) is a potent technique used in machine learning for optimization.[26] Leo Breiman created the technique as a “arcing method” to reduce variability in classification.[26] Jerome Friedman also employed classification and regression.[26] GBR develops models incrementally, with each new model addressing the deficiencies of its forerunners.[97] This approach often yields improved predictive precision. Gradient boosting regression (GBR) amalgamates the strengths of multiple weak learners, especially DTs, to alleviate overfitting and improve generalization.[97] In machine learning, GBR adeptly models complex, non-linear relationships between targets and features, making it more versatile and effective for diverse datasets.[97] The method includes factors such as tree complexity and learning rate, which help mitigate overfitting and ensure effective generalization to new data.[97] Gradient boosting regression (GBR) can be used to handle various data types, including numerical and categorical variables, making it a suitable choice for numerous applications.[97]

SVR is an efficient machine learning method that uses SVs to determine a hyperplane that best accommodates the data while minimizing the margin.[98] SVR can be used to investigate nonlinear data and has diminished susceptibility to outliers relative to LR.[98] SVR prioritizes the overarching trend of the data, disregarding specific outliers that significantly deviate, hence improving predictions.[98] It uses several kernel functions such as sigmoid, polynomial, linear, and RB.[98] This adaptability enables the modeling of diverse data patterns and can improve forecast accuracy.[98] The LMA is a non-LR technique utilized in ANN to tackle various problems.[99] The LMA is a conventional optimization method that combines the Gauss–Newton approach with the gradient descent methodology.[99] The calculation of the Jacobian matrix is essential in the LMA, affecting its convergence and performance.[99]

Three regression models were utilized to evaluate the relationship between permeability coefficient (target) and different features – MW, partition coefficient, Balaban distance connection index, Harary index, and Forgotten index. We applied machine learning methodologies to a revised Flynn dataset. We introduced the Forgotten index as an independent variable. After applying GBR, the R2 was 0.84, and the mean square error (MSE) was 0.45. SVR with RBF kernel is a non-linear model that maps input features into a higher-dimensional space through a Gaussian kernel. This allows the model to discern complex, non-linear relationships between targets and features. The RBF kernel is commonly utilized in SVR because it may model many relationships without requiring the specification of the non-linearity’s structure. In the implementation of SVR, the R2 and mean squared error (MSE) were measured at 0.78 and 0.62, respectively. The SVR model was created to predict the logarithm of skin permeability (logkp) using six aforementioned features. The SVR model utilized an RBF kernel, suitable for nonlinear relationships. The R2 and MSE values were 0.58 and 0.94, respectively, following the use of the LMA. This method is an optimization method designed for non-linear least squares problems. The efficient identification of a model from a large dataset is a function of parameter optimization.[36] Gradient-based optimization methods are commonly utilized for nonlinear systems because of their high computational efficiency and strong reliability.[36]

The EM method enhances the predictive modeling of skin permeability by embedding random variability directly into the forecasting process. The presented figures – parity plot, residuals versus prediction, and predictive uncertainty distribution – collectively demonstrate that EM mean predictions align closely with experimental log Kp values, that residuals are largely unbiased, and that predictive uncertainty is well characterized. This approach provides pharmaceutical scientists and biomedical engineers with both accurate point estimates and rigorous quantification of uncertainty, thereby supporting more informed decisions in the design of TDD systems. We also used the Milstein approach. By incorporating the MS into QSPR modeling of skin permeability, researchers can move beyond point estimates to full probabilistic forecasts. This approach enhances the robustness of permeability predictions and provides a quantitative framework for uncertainty, aligning modeling practices with the inherent stochasticity of skin transport processes. The Heston method provides a rigorous framework for probabilistic prediction of skin permeability. Theoretical predictions regarding the cutaneous transport of chemicals are crucial for the development of transdermal therapeutic systems as well as the risk assessment of dangerous substances. In this project, we have used methods from stochastic calculus and machine learning.

The accurate prediction of skin permeability coefficients using SDE methods (EM algorithm, MS, and Heston model) and machine learning approaches (LMA, gradient boost regression, and SVR) holds significant potential for optimizing TDD systems across various therapeutic indications. In the context of chronic pain management, for instance, these predictive models can guide the rational design of transdermal patches by enabling formulators to screen candidate analgesic compounds based on their predicted permeation characteristics, thereby identifying molecules with optimal physicochemical properties for passive diffusion through the stratum corneum. The stochastic methods, particularly the EM algorithm with its superior predictive performance (R2 = 0.81), can account for the inherent biological variability in skin permeability arising from differences in skin hydration, lipid composition, and regional anatomical variations among patients, thus providing more realistic permeability estimates that inform dosing regimen design. Meanwhile, machine learning approaches such as gradient boosted regression, which demonstrated the highest predictive accuracy (R2 = 0.84), can rapidly evaluate extensive compound libraries during early-stage formulation development, enabling pharmaceutical scientists to prioritize lead candidates and optimize excipient selection to enhance drug penetration. These predictive tools facilitate the determination of appropriate drug loading concentrations, patch surface areas, and release kinetics required to achieve therapeutic plasma levels while minimizing local irritation and systemic toxicity. Furthermore, by correlating molecular descriptors such as MW, partition coefficient (log Kow), and topological indices with permeability outcomes, formulators can strategically modify drug molecules through prodrug approaches or select appropriate permeation enhancers that temporarily and reversibly alter skin barrier function. The integration of these computational methods into pharmaceutical development workflows ultimately accelerates the formulation optimization process, reduces the need for extensive in vitro and in vivo permeation studies, and supports the development of patient-friendly transdermal delivery systems for indications ranging from hormone replacement therapy and smoking cessation to the management of cardiovascular diseases and neurological disorders.

CONCLUSION

In this research project, the EM algorithm, the MS, and the Heston stochastic model were used to predict the permeability coefficient. Following the use of the EM method, the MSE was 0.5259, and the R2 was 0.81. When the Milstein method was used, the MSE was 0.634, and the R2 was 0.74. When we employed the Heston model, the mean standard error (MSE) was 0.66, and the R2 was 0.72. Three other regression techniques (gradient boosting regression, SVR, and the LMA) were also employed to analyze skin permeation data. When gradient boost regression was utilized, the MSE was 0.45 while the R-squared was 0.84. Following the use of SVR, the mean squared error (MSE) was 0.62, and R2 was 0.78. The application of the LMA resulted in an MSE of 0.94 and an R2 of 0.58. Optimization of machine learning algorithms augments predictive accuracy and diminishes the time, resources, and labor expended on skin transport studies. The integration of SDE methods (particularly EM) with machine learning approaches (especially Gradient Boost Regression) provides a comprehensive framework for skin permeability prediction that balances predictive accuracy with the ability to model inherent biological variability, ultimately supporting rational TDD system design.

Ethical approval:

Institutional Review Board approval is not required.

Declaration of patient consent:

Patient’s consent not required as there are no patients in this study.

Conflicts of interest:

There is no conflicts of interest.

Use of artificial intelligence (AI)-assisted technology for manuscript preparation:

The authors confirm that there was no use of artificial intelligence (AI)-assisted technology for assisting in the writing or editing of the manuscript, and no images were manipulated using AI.

Financial support and sponsorship: Nil.

References

  1. , , . Surging footprints of mathematical modeling for prediction of transdermal permeability. Asian J Pharm Sci. 2017;12:299-325.
    [CrossRef] [PubMed] [Google Scholar]
  2. , , , , , , et al. Prediction of the skin permeability of topical drugs using in silico and in vitro models. Eur J Pharm Sci. 2019;136:104945.
    [CrossRef] [PubMed] [Google Scholar]
  3. , , , , , , et al. Quantitative structure-skin permeability relationships. Toxicology. 2017;387:27-42.
    [CrossRef] [PubMed] [Google Scholar]
  4. , , . Predicting skin permeability of chemical substances using a quantitative structure-activity relationship. Proced Eng. 2012;45:875-9.
    [CrossRef] [Google Scholar]
  5. , , , , , . Random sampling or 'random' model in skin flux measurements? [Commentary on “investigation of the mechanism of flux across human skin in vitro by quantitative structure-permeability relationships”] Eur J Pharm Sci. 2001;14:197-200.
    [CrossRef] [PubMed] [Google Scholar]
  6. , , , . Evaluating molecular properties involved in transport of small molecules in stratum corneum: A quantitative structure-activity relationship for skin permeability. Molecules. 2018;23:911.
    [CrossRef] [PubMed] [Google Scholar]
  7. , , , . First report on QSAR modelling for chemical penetration enhancement ratio (ER) of different FDA-approved drugs in Poloxamer 407: A next step towards better skin permeability of drugs. Int J Pharm. 2025;669:125083.
    [CrossRef] [PubMed] [Google Scholar]
  8. , . Predicting skin permeability. Pharm Res. 1992;9:663-9.
    [CrossRef] [PubMed] [Google Scholar]
  9. , . The effect of noise on the predictive limit of QSAR models. J Cheminform. 2021;13:92.
    [CrossRef] [PubMed] [Google Scholar]
  10. , , . Identification and description of the uncertainty, variability, bias and influence in quantitative structure-activity relationships (QSARs) for toxicity prediction. Regul Toxicol Pharmacol. 2019;106:90-104.
    [CrossRef] [PubMed] [Google Scholar]
  11. , , , , , , et al. Evaluation of in vitro/in vivo correlations for three fentanyl transdermal delivery systems using in vitro skin permeation testing and human pharmacokinetic studies under the influence of transient heat application. J Control Release. 2022;342:134-47.
    [CrossRef] [PubMed] [Google Scholar]
  12. , , . Noninvasive measurement of transdermal drug delivery by impedance spectroscopy. Sci Rep. 2017;7:44647.
    [CrossRef] [PubMed] [Google Scholar]
  13. , , , , . Milstein-driven neural stochastic differential equation model with uncertainty estimates. Pattern Recognit Lett. 2023;174:71-7.
    [CrossRef] [Google Scholar]
  14. , , . Identifying uncertainty in physical-chemical property estimation with IFSQSAR. J Cheminform. 2024;16:65.
    [CrossRef] [PubMed] [Google Scholar]
  15. , , , . Stochastic delay differential equations: A comprehensive approach for understanding biosystems with application to disease modelling. Appl Math. 2023;3:702-21.
    [CrossRef] [Google Scholar]
  16. , , , . Hybridization of stochastic hydrological models and machine learning methods for improving rainfall-runoff modeling. Results Eng. 2025;25:104079.
    [CrossRef] [Google Scholar]
  17. . Review: The prevailing mathematical modeling classifications and paradigms to support the advancement of sustainable animal production. Animal. 2023;17:100813.
    [CrossRef] [PubMed] [Google Scholar]
  18. , , , , , , et al. Data-driven discovery of stochastic differential equations. Engineering. 2022;17:244-52.
    [CrossRef] [Google Scholar]
  19. , . Exploring behaviors of stochastic differential equation models of biological systems using change of measures. BMC Bioinformatics. 2012;13(Suppl 5):S8.
    [CrossRef] [PubMed] [Google Scholar]
  20. , , , . Statistical inference for stochastic differential equations. WIREs Comput Stat. 2023;15:e1585.
    [CrossRef] [Google Scholar]
  21. , , . Numerical methods for simulation of stochastic differential equations. Adv Differ Equ. 2018;2018:17.
    [CrossRef] [Google Scholar]
  22. , , . On the convergence of the Euler-Maruyama method and Milstein scheme for the solution of stochastic differential equations. Int J Appl Math Model. 2013;1:9-15.
    [Google Scholar]
  23. , . A new simple tree approach for the Heston's stochastic volatility model. Comput Math Appl. 2019;78:1993-2010.
    [CrossRef] [Google Scholar]
  24. , , , , . An Euler-Maruyama method for diffusion equations with discontinuous coefficients and a family of interface conditions. J Comput Appl Math. 2020;368:112545.
    [CrossRef] [Google Scholar]
  25. . A closed-form solution for options with stochastic volatility with applications to bond and currency options. Rev Financ Stud. 2005;6:382-97.
    [CrossRef] [Google Scholar]
  26. , , , , , , et al. Gradient boosting: A computationally efficient alternative to markov chain monte carlo sampling for fitting large Bayesian spatio-temporal binomial regression models. Infect Dis Model. 2025;10:189-200.
    [CrossRef] [PubMed] [Google Scholar]
  27. , , , , . Application of gradient boosting regression model for the evaluation of feature selection techniques in improving reservoir characterisation predictions. J Pet Sci Eng. 2022;208:109244.
    [CrossRef] [Google Scholar]
  28. , , , , , , et al. Machine learning regression algorithms to predict emissions from steam boilers. Heliyon. 2024;10:e26892.
    [CrossRef] [PubMed] [Google Scholar]
  29. , . Modeling and prediction of biodiesel production by using different artificial intelligence methods: Multi-layer perceptron (MLP), Gradient boosting (GB), and Gaussian process regression (GPR) Arab J Chem. 2023;16:104801.
    [CrossRef] [Google Scholar]
  30. , , . A novel framework for crash frequency prediction: Geographic support vector regression based on agent-based activity models in Greater Melbourne. Accid Anal Prev. 2024;207:107747.
    [CrossRef] [PubMed] [Google Scholar]
  31. , , , , . Simulation study of ionic current distribution and Li dendrite growth on the anode in lithium-ion batteries using support vector regression machine learning. J Energy Storage. 2024;92:112115.
    [CrossRef] [Google Scholar]
  32. , , . Convex support vector regression. Eur J Oper Res. 2024;313:858-70.
    [CrossRef] [Google Scholar]
  33. , , . Multilayer perceptron and support vector regression models for feline parturition date prediction. Heliyon. 2024;10:e27992.
    [CrossRef] [PubMed] [Google Scholar]
  34. , , , , , , et al. Support vector regression for prediction of stable isotopes and trace elements using hyperspectral imaging on coffee for origin verification. Food Res Int. 2023;174:113518.
    [CrossRef] [PubMed] [Google Scholar]
  35. , , . Calibrating lab and field reflectance spectra for nutrient estimation in potato plants using local support vector regression models. Smart Agric Technol. 2024;8:100492.
    [CrossRef] [Google Scholar]
  36. , , . Hierarchical recursive LevenbergMarquardt algorithm for radial basis function autoregressive models. Inf Sci. 2023;647:119506.
    [CrossRef] [Google Scholar]
  37. , , , , . Performance evaluation of support vector regression machine models in water level forecasting. Proced Comput Sci. 2024;234:436-47.
    [CrossRef] [Google Scholar]
  38. , , , , , , et al. Efficient and robust Levenberg-marquardt algorithm based on damping parameters for parameter inversion in underground metal target detection. Comput Geosci. 2023;176:105354.
    [CrossRef] [Google Scholar]
  39. , , , , . Solving an inverse problem with four unknown boundary conditions in a lid-driven cavity with heated walls using the levenberg-marquardt method. Applications in Eng Sci. 2025;23:100245.
    [CrossRef] [Google Scholar]
  40. , . Partial derivatives and jacobian matrix In: Encyclopedia of Electrical and Electronic Power Engineering. Oxford: Elsevier; . p. :610-3.
    [CrossRef] [Google Scholar]
  41. , , . Comparison of supercritical fluid chromatographic methods to predict the skin permeability of pharmaceutical and cosmetic compounds. J Chromatogr A. 2023;1692:463855.
    [CrossRef] [PubMed] [Google Scholar]
  42. , , , , , . Exploring expected values of topological indices of random cyclodecane chains for chemical insights. Sci Rep. 2024;14:10065.
    [CrossRef] [PubMed] [Google Scholar]
  43. , , . On entropy measures of molecular graphs using topological indices. Arab J Chem. 2020;13:6285-98.
    [CrossRef] [Google Scholar]
  44. , , , . Numerical and scientific investigation of some molecular structures based on the criterion of super classical average assignments. J Math. 2024;2024:9360076.
    [CrossRef] [Google Scholar]
  45. . Chemical graphs, molecular matrices and topological indices in chemoinformatics and quantitative structure-activity relationships. Curr Comput Aided Drug Des. 2013;9:153-63.
    [CrossRef] [PubMed] [Google Scholar]
  46. , , , , . On physical analysis of topological indices and entropy measures for porphyrazine structure using logarithmic regression model. Sci Rep. 2024;14:28669.
    [CrossRef] [PubMed] [Google Scholar]
  47. , , . Wiener index of some graph operations. Discrete Appl Math. 2012;160:1333-44.
    [CrossRef] [Google Scholar]
  48. , , . Randic index and information. AKCE Int J Graphs Comb. 2018;15:307-12.
    [CrossRef] [Google Scholar]
  49. , , , . Computing the narumi-katayama index and modified narumi-katayama index of some families of dendrimers and tetrathiafulvalene. J Math. 2021;2021:8492991.
    [CrossRef] [Google Scholar]
  50. , , , , , . The degree-based topological indices for two special families of graphs of diameter three. Adv Math Phys. 2021;2021:4051026.
    [CrossRef] [Google Scholar]
  51. , , , . Molecular topological invariants of certain chemical networks. Main Group Metal Chem. 2021;44:141-9.
    [CrossRef] [Google Scholar]
  52. , , , . Degree-based topological indices and polynomials of hyaluronic acid-curcumin conjugates. Saudi Pharm J. 2020;28:1093-100.
    [CrossRef] [PubMed] [Google Scholar]
  53. , , . On distance-based topological indices and co-indices of fractal-type molecular graphs and their respective graph entropies. PLoS One. 2023;18:e0290047.
    [CrossRef] [PubMed] [Google Scholar]
  54. , , , , , , et al. On distance-based topological descriptors of chemical interconnection networks. J Math. 2021;2021:5520619.
    [CrossRef] [Google Scholar]
  55. , , . Quality testing of spectrum-based distance descriptors for polycyclic aromatic hydrocarbons with applications to carbon nanotubes and nanocones. Arab J Chem. 2021;14:102994.
    [CrossRef] [Google Scholar]
  56. , , , . Role of topological indices in predictive modeling and ranking of drugs treating eye disorders. Sci Rep. 2025;15:1271.
    [CrossRef] [PubMed] [Google Scholar]
  57. . Sharp lower bounds on the Narumi-Katayama index of graph operations. Appl Math Comput. 2014;239:409-21.
    [CrossRef] [Google Scholar]
  58. , , , . Several asymptotic bounds on the balaban indices of trees. Math Probl Eng. 2020;2020:2081241.
    [CrossRef] [Google Scholar]
  59. , , , . Group theoretic approach towards the balaban index of catacondensed benzenoid systems and linear chain of anthracene. Symmetry. 2024;16:996.
    [CrossRef] [Google Scholar]
  60. , . On the additively weighted harary index of some composite graphs. Mathematics. 2017;5:16.
    [CrossRef] [Google Scholar]
  61. , , . Design of topological indices. Part 4: Reciprocal distance matrix, related local vertex invariants and topological indices. J Math Chem. 1993;12:309-18.
    [CrossRef] [Google Scholar]
  62. , , , . On the harary index for the characterization of chemical graphs. J Math Chem. 1993;12:235-50.
    [CrossRef] [Google Scholar]
  63. . Harary index, binding number and toughness of graphs. Kuwait J Sci. 2024;51:100176.
    [CrossRef] [Google Scholar]
  64. , , . Extremal harary index of graphs with given number of vertices of odd degree. Math Probl Eng. 2022;2022:4916308.
    [CrossRef] [Google Scholar]
  65. , . A forgotten topological index. J Math Chem. 2015;53:1184-90.
    [CrossRef] [Google Scholar]
  66. , . QSPR study using four novel topological indices for benzenoid hydrocarbons. AIP Conf Proceed. 2023;2797:020005.
    [CrossRef] [Google Scholar]
  67. , , . On QSAR modeling with novel degree-based indices and thermodynamics properties of eye infection therapeutics. Front Chem. 2024;12:1383206.
    [CrossRef] [PubMed] [Google Scholar]
  68. , , , . Forgotten index of generalized operations on graphs. J Chem. 2021;2021:9971277.
    [CrossRef] [Google Scholar]
  69. , , , , , . A comprehensive machine learning-based models for predicting mixture toxicity of azole fungicides toward algae (Auxenochlorella pyrenoidosa) Environ Int. 2024;194:109162.
    [CrossRef] [PubMed] [Google Scholar]
  70. , . Some bounds for the connectivity index of a chemical graph. J Chem Inf Comput Sci. 1998;38:827-831.
    [CrossRef] [Google Scholar]
  71. , , . An evaluation of mathematical models for predicting skin permeability. J Pharm Sci. 2008;97:584-98.
    [CrossRef] [PubMed] [Google Scholar]
  72. , . Stochasticity, individuality and behavior. Curr Biol. 2018;28:R8-12.
    [CrossRef] [PubMed] [Google Scholar]
  73. , , , , . Stochastic dynamics of hepatitis B epidemics. Results Phys. 2021;20:103730.
    [CrossRef] [Google Scholar]
  74. , , . Euler-Maruyama method for solving first order uncertain stochastic differential equations. Afr J Math Comput Sci Res. 2023;16:17-26.
    [CrossRef] [Google Scholar]
  75. , . Modeling HIV/AIDS with the effect of screening in India: A stochastic sex-structured approach incorporating logarithmic mean-reverting OrnsteinUhlenbeck process. Chaos Solitons Fractals. 2025;191:115899.
    [CrossRef] [Google Scholar]
  76. , . Parameter estimation for threshold OrnsteinUhlenbeck processes from discrete observations. J Comput Appl Math. 2022;411:114264.
    [CrossRef] [Google Scholar]
  77. , , . Estimation of stochastic volatility by using ornstein-uhlenbeck type models. Phys A Stat Mech Its Appl. 2018;491:167-76.
    [CrossRef] [Google Scholar]
  78. , , . Data informed solution estimation for forward backward stochastic differential equations. Anal Appl. 2020;19:p439-64.
    [CrossRef] [Google Scholar]
  79. , . Ridge regression: Biased estimation for nonorthogonal problems. Technometrics. 1970;12:55-67.
    [CrossRef] [Google Scholar]
  80. . Physiochemical determinants of skin absorption In: Principles of Route-to-Route Extrapolation for Risk Assessment. New York: Elsevier; . p. :93-127.
    [Google Scholar]
  81. , , , , . In silico prediction of skin permeability using a two-QSAR approach. Pharmaceutics. 2022;14:961.
    [CrossRef] [PubMed] [Google Scholar]
  82. , . Data preprocessing and feature selection for machine learning intrusion detection systems. ICIC Express Lett. 2019;13:93-101.
    [Google Scholar]
  83. , , . Optimizing acidizing design and effectiveness assessment with machine learning for predicting post-acidizing permeability. Sci Rep. 2023;13:11851.
    [CrossRef] [PubMed] [Google Scholar]
  84. , . A high accurate user-friendly energy audit platform of a university building using ANN Bayesian regularization and Levenberg-Marquardt algorithm. Energy Rep. 2024;11:2220-35.
    [CrossRef] [Google Scholar]
  85. , . Artificial intelligence for skin permeability prediction: Deep learning. J Drug Target. 2024;32:1-20.
    [CrossRef] [PubMed] [Google Scholar]
  86. , . Construction of neural network model for exercise load monitoring based on yoga training data and rehabilitation therapy. Heliyon. 2024;10:e32679.
    [CrossRef] [PubMed] [Google Scholar]
  87. , . FyzzyGBR-a gradient boosting regression software with fuzzy target values. Softw Impacts. 2022;14:100430.
    [CrossRef] [Google Scholar]
  88. , , , , , , et al. Mathematical models of skin permeability: An overview. Int J Pharm. 2011;418:115-29.
    [CrossRef] [PubMed] [Google Scholar]
  89. , , , , , , et al. A self-targeting, dual ROS/pH-responsive apoferritin nanocage for spatiotemporally controlled drug delivery to breast cancer. Biomacromolecules. 2018;19:1026-36.
    [CrossRef] [PubMed] [Google Scholar]
  90. , . Efficient large-scale mechanism-based computation of skin permeability. Comput Toxicol. 2023;26:100263.
    [CrossRef] [Google Scholar]
  91. , , . Predictive modeling of skin permeability for molecules: Investigating FDA-approved drug permeability with various AI algorithms. PLOS Digit Health. 2024;3:e0000483.
    [CrossRef] [PubMed] [Google Scholar]
  92. , , , . Quantitative prediction of ionization effect on human skin permeability. Int J Pharm. 2017;522:222-33.
    [CrossRef] [PubMed] [Google Scholar]
  93. , , , . Correlation between the structure and skin permeability of compounds. Sci Rep. 2021;11:10076.
    [CrossRef] [PubMed] [Google Scholar]
  94. , , , . Exploring 3D-QSPR models of human skin permeability for a diverse dataset of chemical compounds. J Recept Signal Transduct Res. 2019;39:442-50.
    [CrossRef] [PubMed] [Google Scholar]
  95. , , , , , , et al. An evaluation of the potential of linear and nonlinear skin permeation models for the prediction of experimentally measured percutaneous drug absorption. J Pharm Pharmacol. 2012;64:566-77.
    [CrossRef] [PubMed] [Google Scholar]
  96. , , . Predicting the rate of skin penetration using an aggregated conformal prediction framework. Mol Pharm. 2017;14:1571-6.
    [CrossRef] [PubMed] [Google Scholar]
  97. , , , , . Exploring and understanding the microwave-assisted pyrolysis of waste lignocellulose biomass using gradient boosting regression machine learning model. Renew Energy. 2024;231:120968.
    [CrossRef] [Google Scholar]
  98. , , , . A QSPR analysis of physical properties of antituberculosis drugs using neighbourhood degree-based topological indices and support vector regression. Heliyon. 2024;10:e28260.
    [CrossRef] [PubMed] [Google Scholar]
  99. , , , , , , et al. A combination of library search and Levenberg-Marquardt algorithm in optical scatterometry. Thin Solid Films. 2023;767:139670.
    [CrossRef] [Google Scholar]
Show Sections