Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
Autori
Cvejić, SandraHrnjaković, Olivera
Jocković, Milan
Kupusinac, Aleksandar
Doroslovački, Ksenija
Gvozdenac, Sonja
Jocić, Siniša
Miladinović, Dragana
Članak u časopisu (Objavljena verzija)
Metapodaci
Prikaz svih podataka o dokumentuApstrakt
Due to the increased demand for sunfower production, its breeding assignment is the ntensifcation
of the development of highly productive oil seed hybrids to satisfy the edible oil industry. Sunfower
Oil Yield Prediction (SOYP) can help breeders to identify desirable new hybrids with high oil yield and their characteristics using machine learning (ML) algorithms. In this study, we developed ML models to predict oil yield using two sets of features. Moreover, we evaluated the most relevant features for accurate SOYP. ML algorithms that were used and compared were Artifcial Neural Network (ANN), Support Vector Regression, K-Nearest Neighbour, and Random Forest Regressor (RFR). The dataset consisted of samples for 1250 hybrids of which 70% were randomly selected and were used to train the model and 30% were used to test the model and assess its performance. Employing MAE, MSE, RMSE and R2 evaluation metrics, RFR consistently outperformed in all datasets, achieving a peak of 0.92 for R2 ...in 2019. In contrast, ANN recorded the lowest MAE, reaching 65 in 2018 The paper revealed that in addition to seed yield, the following characteristics of hybrids were important for SOYP: resistance to broomrape (Or) and downy mildew (Pl) and maturity. It was also disclosed that the locality feature could be used for the estimation of sunfower oil yield but it is highly dependable on weather conditions that afect the oil content and seed yield. Up to our knowledge, this is the frst study in which ML was used for sunfower oil yield prediction. The obtained results indicate that ML has great potential for application in oil yield prediction, but also selection of parental lines for hybrid production, RFR algorithm was found to be the most efective and along with locality feature is going to be further evaluated as an alternative method for genotypic selection.
Ključne reči:
sunflower / oil yield prediction / machine learning / oil yield / hybrids / hybrid selection / algorithmsIzvor:
Scientific Reports, 2023, 13, 1, 17611-Izdavač:
- Nature Portfolio
Finansiranje / projekti:
- Ministarstvo nauke, tehnološkog razvoja i inovacija Republike Srbije, institucionalno finansiranje - 200032 (Naučni institut za ratarstvo i povrtarstvo, Novi Sad) (RS-MESTD-inst-2020-200032)
- SmartSun - Creating climate smart sunflower for future challenges (RS-ScienceFundRS-Ideje-7732457)
- CROPINNO - Stepping up scientific excellence and innovation capacity for climate-resilient crop improvement and production (EU-HE-101059784)
- COST Action CA18111: Genome editing in plants - a technology with transformative potential
- COST Action CA19125: EPIgenetic mechanisms of Crop Adaptation To Climate cHange (EPI-CATCH)
- Climate Crops - Centre of Excellence for Innovations in Breeding of Climate-Resilient Crops, Institute of Field and Vegetable Crops
Kolekcije
Institucija/grupa
FiVeRTY - JOUR AU - Cvejić, Sandra AU - Hrnjaković, Olivera AU - Jocković, Milan AU - Kupusinac, Aleksandar AU - Doroslovački, Ksenija AU - Gvozdenac, Sonja AU - Jocić, Siniša AU - Miladinović, Dragana PY - 2023 UR - http://fiver.ifvcns.rs/handle/123456789/3918 AB - Due to the increased demand for sunfower production, its breeding assignment is the ntensifcation of the development of highly productive oil seed hybrids to satisfy the edible oil industry. Sunfower Oil Yield Prediction (SOYP) can help breeders to identify desirable new hybrids with high oil yield and their characteristics using machine learning (ML) algorithms. In this study, we developed ML models to predict oil yield using two sets of features. Moreover, we evaluated the most relevant features for accurate SOYP. ML algorithms that were used and compared were Artifcial Neural Network (ANN), Support Vector Regression, K-Nearest Neighbour, and Random Forest Regressor (RFR). The dataset consisted of samples for 1250 hybrids of which 70% were randomly selected and were used to train the model and 30% were used to test the model and assess its performance. Employing MAE, MSE, RMSE and R2 evaluation metrics, RFR consistently outperformed in all datasets, achieving a peak of 0.92 for R2 in 2019. In contrast, ANN recorded the lowest MAE, reaching 65 in 2018 The paper revealed that in addition to seed yield, the following characteristics of hybrids were important for SOYP: resistance to broomrape (Or) and downy mildew (Pl) and maturity. It was also disclosed that the locality feature could be used for the estimation of sunfower oil yield but it is highly dependable on weather conditions that afect the oil content and seed yield. Up to our knowledge, this is the frst study in which ML was used for sunfower oil yield prediction. The obtained results indicate that ML has great potential for application in oil yield prediction, but also selection of parental lines for hybrid production, RFR algorithm was found to be the most efective and along with locality feature is going to be further evaluated as an alternative method for genotypic selection. PB - Nature Portfolio T2 - Scientific Reports T1 - Oil yield prediction for sunflower hybrid selection using different machine learning algorithms IS - 1 SP - 17611 VL - 13 DO - 10.1038/s41598-023-44999-3 ER -
@article{ author = "Cvejić, Sandra and Hrnjaković, Olivera and Jocković, Milan and Kupusinac, Aleksandar and Doroslovački, Ksenija and Gvozdenac, Sonja and Jocić, Siniša and Miladinović, Dragana", year = "2023", abstract = "Due to the increased demand for sunfower production, its breeding assignment is the ntensifcation of the development of highly productive oil seed hybrids to satisfy the edible oil industry. Sunfower Oil Yield Prediction (SOYP) can help breeders to identify desirable new hybrids with high oil yield and their characteristics using machine learning (ML) algorithms. In this study, we developed ML models to predict oil yield using two sets of features. Moreover, we evaluated the most relevant features for accurate SOYP. ML algorithms that were used and compared were Artifcial Neural Network (ANN), Support Vector Regression, K-Nearest Neighbour, and Random Forest Regressor (RFR). The dataset consisted of samples for 1250 hybrids of which 70% were randomly selected and were used to train the model and 30% were used to test the model and assess its performance. Employing MAE, MSE, RMSE and R2 evaluation metrics, RFR consistently outperformed in all datasets, achieving a peak of 0.92 for R2 in 2019. In contrast, ANN recorded the lowest MAE, reaching 65 in 2018 The paper revealed that in addition to seed yield, the following characteristics of hybrids were important for SOYP: resistance to broomrape (Or) and downy mildew (Pl) and maturity. It was also disclosed that the locality feature could be used for the estimation of sunfower oil yield but it is highly dependable on weather conditions that afect the oil content and seed yield. Up to our knowledge, this is the frst study in which ML was used for sunfower oil yield prediction. The obtained results indicate that ML has great potential for application in oil yield prediction, but also selection of parental lines for hybrid production, RFR algorithm was found to be the most efective and along with locality feature is going to be further evaluated as an alternative method for genotypic selection.", publisher = "Nature Portfolio", journal = "Scientific Reports", title = "Oil yield prediction for sunflower hybrid selection using different machine learning algorithms", number = "1", pages = "17611", volume = "13", doi = "10.1038/s41598-023-44999-3" }
Cvejić, S., Hrnjaković, O., Jocković, M., Kupusinac, A., Doroslovački, K., Gvozdenac, S., Jocić, S.,& Miladinović, D.. (2023). Oil yield prediction for sunflower hybrid selection using different machine learning algorithms. in Scientific Reports Nature Portfolio., 13(1), 17611. https://doi.org/10.1038/s41598-023-44999-3
Cvejić S, Hrnjaković O, Jocković M, Kupusinac A, Doroslovački K, Gvozdenac S, Jocić S, Miladinović D. Oil yield prediction for sunflower hybrid selection using different machine learning algorithms. in Scientific Reports. 2023;13(1):17611. doi:10.1038/s41598-023-44999-3 .
Cvejić, Sandra, Hrnjaković, Olivera, Jocković, Milan, Kupusinac, Aleksandar, Doroslovački, Ksenija, Gvozdenac, Sonja, Jocić, Siniša, Miladinović, Dragana, "Oil yield prediction for sunflower hybrid selection using different machine learning algorithms" in Scientific Reports, 13, no. 1 (2023):17611, https://doi.org/10.1038/s41598-023-44999-3 . .