Strategies to develop robust neural network models: Prediction of flash point as a case study.

Alibakshi, Amin (2018) Strategies to develop robust neural network models: Prediction of flash point as a case study. Analytica Chimica Acta, 1026 . pp. 69-76. DOI 10.1016/j.aca.2018.05.015.

[img] Text
Alibakshi.pdf - Published Version
Restricted to Registered users only

Download (555Kb) | Contact
[img] Other (Excel File)
Alibakshi_Suppl1.xlsx - Supplemental Material
Restricted to Registered users only

Download (9Kb) | Contact
[img] Other (Excel File)
Alibakshi_Suppl2.xlsx - Supplemental Material
Restricted to Registered users only

Download (13Mb) | Contact
[img] Other (Excel File)
Alibakshi_Suppl3.xlsx - Supplemental Material
Restricted to Registered users only

Download (64Mb) | Contact
[img] Text
Alibakshi_AcceptedManuscript.pdf - Accepted Version
Restricted to Registered users only until 4 May 2020.

Download (824Kb) | Contact

Supplementary data:

Abstract

Artificial neural network (ANN) is one of the most widely used methods to develop accurate predictive models based on artificial intelligence and machine learning. In the present study, the important practical aspects of developing a reliable ANN model e.g. appropriate assignment of the number of neurons, number of hidden layers, transfer function, training algorithm, dataset division and initialization of the network are discussed. As a case study, predictability of the flash point for a dataset of 740 organic compounds using ANNs was investigated via a total number of 484220ANNs to allow covering a wide range of parameters affecting the performance of an ANN. Among all studied parameters, the number of neurons or layers was found to be the most important parameters to develop a reliable ANN with low overfitting risk. To evaluate appropriate number of neurons and layers, a value of equal or greater than 10 for the ratio of the training samples to the ANN constants was suggested as a rule of thumb. More ever, a strategy for evaluation of the authentic performance of ANNs and deciding about the reliability of an ANN model was proposed which is applicable to other models developed by supervised learning. Based on the introduced considerations, an ANN model was proposed for predicting the flash point of pure organic compounds. According to the results, the new model was found to produce the lowest error compared to other available models.

Document Type: Article
Keywords: Artificial neural networks, Supervised learning, Overfitting, Appropriate training, Group contribution method, QSPR, Flash point, Regression, Model developement
Research affiliation: OceanRep > GEOMAR > FB2 Marine Biogeochemistry > FB2-MG Marine Geosystems
Refereed: Yes
Open Access Journal?: No
DOI etc.: 10.1016/j.aca.2018.05.015
ISSN: 0003-2670
Date Deposited: 04 Jun 2018 06:59
Last Modified: 04 Jun 2018 11:12
URI: http://oceanrep.geomar.de/id/eprint/43232

Actions (login required)

View Item View Item

Document Downloads

More statistics for this item...