Objective: In the petrochemical industry, molecular reconstruction is crucial for understanding and optimizing the compositions of complex crude oil and petroleum products. As the first step of process simulation, quality control, and economic evaluation, precise molecular reconstruction approaches usually employ mathematical models to calculate the molecular compositions of petroleum products that align with their macroscopic properties. Traditional molecular reconstruction methods employ the gamma distribution to represent the carbon number distributions of homologs, but the coupling effects between the parameters "shape (α)" and "scale (β)" pose notable challenges in achieving desired interpretability and optimization efficiency. This study addresses these challenges by introducing a novel shape-decoupled parameter method that enhances the model's interpretability and simplifies the optimization process. Methods: The proposed shape-decoupled parameter method modifies a traditional gamma distribution by replacing the parameter's shape and scale with two new independent variables called peak position (m) and variance (σ2). Notably, m provides direct control over the zenith of the distribution, whereas σ2 independently determines the spread or width of the distribution, effectively reducing the coupling issue between parameters that exists in conventional gamma distribution models. Aiming at enhancing the stability and convergence speed during optimization, a multivariate linear regression (MLR) model was employed to estimate the initial parameter values. This regression model was trained on historical data of molecular compositions to provide reasonable initial values and decrease the probability of being trapped in local minima. The molecule-type homologous series (MTHS) matrix is used to represent the molecular composition of hydrocarbons, namely paraffins, isoparaffins, olefins, naphthenes, and aromatics (PIONA), with a comprehensive depiction of their multiple homologs. Moreover, an optimization problem was developed to minimize the prediction errors of the macroscopic properties, including molecular weight, density, PIONA group composition, and true boiling point curves. Upon a comparative analysis of multiple deterministic and heuristic optimization techniques, the differential evolution (DE) algorithm was determined as a favorable optimization tool by virtue of its superior accuracy and robustness. Results: Experimental evaluations showed that the shape-decoupled parameter method outperformed traditional methods in accuracy and optimization efficiency. Specifically, the density error decreased from 0.012 to 0.0059 g/cm3, and the average percentage relative error for the PIONA group composition also exhibits notable reductions. Moreover, the decoupled approach achieves faster convergence, requiring fewer iterations—reducing from 1 000 to as few as 20—without compromising accuracy. This reduction highlights the computational efficiency of the proposed method, which is a notable advantage in industrial applications with limited computational resources and time. Moreover, the proposed method exhibits enhanced robustness in addressing extreme molecular composition distributions, maintaining low errors in peak position and molecular composition predictions. This robustness becomes particularly evident when managing scenarios considered challenging by conventional methods, such as distributions with narrow ranges or hydrocarbons with approximately zero components at the boundary. Furthermore, the decoupled method provides better interpretability via independent control strategies for peak position and distribution width. The overall optimization performance was enhanced by the appropriate integration of the DE algorithm and effective initial parameter estimation by the MLR model. Conclusions: Compared with traditional methods, the proposed shape-decoupled parameter method provides a more interpretable, efficient, and accurate approach to the molecular reconstruction of petroleum products. By reducing the coupling effect between the parameters controlling the peak position and distribution width, this method simplifies the optimization process and achieves superior prediction accuracy and faster convergence. The results indicate the feasibility of its application for complex or extreme homolog distributions of hydrocarbons, revealing its higher reliability and robustness compared with traditional approaches. Future work is expected to focus on incorporating advanced machine learning techniques to further increase the accuracy and applicability of the model across a wider range of petroleum compositions, potentially enabling real-time molecular reconstruction for dynamic process optimization.
Dong QIU, Qiming ZHAO, Yijiong HU, Tong QIU. Molecular reconstruction of petroleum products based on shape-decoupled gamma distribution parameters[J]. Journal of Tsinghua University(Science and Technology), 2025, 65(5): 813-824. DOI: 10.16511/j.cnki.qhdxxb.2024.21.034
为解决这一问题,研究者们提出了基于常见宏观物性的分子重构方法。这些方法试图通过平均分子量、蒸馏曲线、密度和PIONA(paraffin,iso-paraffin,olefin,naphthene,aromatics)族组成等易获得的性质,推断可能的分子组成。现有的分子重构方法主要包括随机重构(stochastic reconstruction,SR)[5-8]、预定义确定性分子方法[9-10]、熵最大化重构(reconstruction by entropy maximization, REM)方法[11-12]、基于不同分子表达方式如结构导向集总(structure-oriented lumping,SOL)[13-18]、分子类型同源序列(molecule type homologous series,MTHS)矩阵[19-25]和状态空间[26]的衍生方法,以及基于多元线性回归(multiple linear regression,MLR)模型[27]和人工神经网络(artificial neural network,ANN)[27-28]等黑箱模型的数据驱动方法。
STRATIEVD, SHISHKOVAI, TANKOVI, et al. Challenges in characterization of residual oils. A review[J]. Journal of Petroleum Science and Engineering, 2019, 178, 227- 250.
HUDEBINED, VERSTRAETEJ J. Molecular reconstruction of LCO gasoils from overall petroleum analyses[J]. Chemical Engineering Science, 2004, 59 (22-23): 4755- 4763.
NEUROCKM, LIBANATIC, NIGAMA, et al. Monte Carlo simulation of complex reaction systems: Molecular structure and reactivity in modelling heavy oils[J]. Chemical Engineering Science, 1990, 45 (8): 2083- 2088.
ALBAHRIT A. Molecularly explicit characterization model (MECM) for light petroleum fractions[J]. Industrial & Engineering Chemistry Research, 2005, 44 (24): 9286- 9298.
10
ASTARITA G, SANDLER S I. Kinetic and thermodynamic lumping of multicomponent mixtures[M]. Amsterdam: Elsevier, 2012.
11
XUEX D, JIANGH B, OUYANGF S, et al. Molecular reconstruction of naphtha by maximum information entropy method[J]. Petroleum Science and Technology, 2019, 37 (5): 535- 542.
HUDEBINED, VERSTRAETEJ J. Reconstruction of petroleum feedstocks by entropy maximization. Application to FCC gasolines[J]. Oil & Gas Science and Technology —Revue d'IFP Energies nouvelles, 2011, 66 (3): 437- 460.
13
JAFFES B, FREUNDH, OLMSTEADW N. Extension of structure-sriented lumping to vacuum residua[J]. Industrial & Engineering Chemistry Research, 2005, 44 (26): 9840- 9852.
14
CHENJ C, FANGZ, QIUT. Molecular reconstruction model based on structure oriented lumping and group contribution methods[J]. Chinese Journal of Chemical Engineering, 2018, 26 (8): 1677- 1683.
ZHANGC, BIK X, QIUT. Molecular reconstruction of crude oil: Novel structure-oriented homologous series lumping with a cloud model[J]. Industrial & Engineering Chemistry Research, 2022, 61 (51): 18810- 18820.
16
YANGB L, ZHOUX W, CHENC, et al. Molecule simulation for the secondary reactions of fluid catalytic cracking gasoline by the method of structure oriented lumping combined with Monte Carlo[J]. Industrial & Engineering Chemistry Research, 2008, 47 (14): 4648- 4657.
17
QUANNR J, JAFFES B. Structure-oriented lumping: Describing the chemistry of complex hydrocarbon mixtures[J]. Industrial & Engineering Chemistry Research, 1992, 31 (11): 2483- 2497.
QIUT, CHENJ C, FANGZ. Molecular reconstruction model for petroleum fractions based on structure oriented lumping[J]. Journal of Tsinghua University (Science and Technology), 2016, 56 (4): 424- 429.
ZHANG Y. A molecular approach for characterization and property predictions of petroleum mixtures with applications to refinery modeling[D]. Manchester: The University of Manchester, 1999.
20
AYEM M S, ZHANGN. A novel methodology in transforming bulk properties of refining streams into molecular information[J]. Chemical Engineering Science, 2005, 60 (23): 6702- 6717.
PYLS P, HOUZ, VAN GEEMK M, et al. Modeling the composition of crude oil fractions using constrained homologous series[J]. Industrial & Engineering Chemistry Research, 2011, 50 (18): 10850- 10858.
22
WUY W, ZHANGN. Molecular characterization of gasoline and diesel streams[J]. Industrial & Engineering Chemistry Research, 2010, 49 (24): 12773- 12782.
23
AHMADM I, ZHANGN, JOBSONM. Molecular components-based representation of petroleum fractions[J]. Chemical Engineering Research and Design, 2011, 89 (4): 410- 420.
PENG B. Molecular modelling of petroleum processes[D]. Manchester: University of Manchester, 1999.
25
RENY, LIAOZ W, SUNJ Y, et al. Molecular reconstruction of naphtha via limited bulk properties: Methods and comparisons[J]. Industrial & Engineering Chemistry Research, 2019, 58 (40): 18742- 18755.
26
MEIH, CHENGH, WANGZ L, et al. Molecular characterization of petroleum fractions using state space representation and its application for predicting naphtha pyrolysis product distributions[J]. Chemical Engineering Science, 2017, 164, 81- 89.
PYLS P, VAN GEEMK M, REYNIERSM F, et al. Molecular reconstruction of complex hydrocarbon mixtures: An application of principal component analysis[J]. AIChE Journal, 2010, 56 (12): 3174- 3188.