Journal of Animal Breeding and Genomics (J Anim Breed Genom)
Indexed in KCI
OPEN ACCESS, PEER REVIEWED
pISSN 1226-5543
eISSN 2586-4297
Review Article

Study for Applications of Machine Learning Method using Livestock Genomic Data

Department of Animal Science and Technology, Chung-Ang University, Anseong, Gyeonggi-do, 17546, Republic of Korea

Correspondence to Jun-Mo Kim, E-mail: junmokim@cau.ac.kr

Volume 6, Number 2, Pages 45-56, June 2022.
Journal of Animal Breeding and Genomics 2022, 6(2), 45-56. https://doi.org/10.12972/jabng.20220006
Received on March 21, 2022, Revised on June 06, 2022, Accepted on June 27, 2021, Published on June 30, 2022.
Copyright © 2022 Korean Society of Animal Breeding and Genetics.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0).

ABSTRACT

Currently, we have entered the era of big data in which a lot of information and data are produced, and artificial intelligence technology is in the spotlight as a technology that can be used as an intelligent service. With the development of machine learning and deep learning technologies, artificial intelligence has been able to be grafted into various fields from signal processing and speech recognition to medical care and welfare. Furthermore, the application of machine learning to the behavioural observation of animals is being made in the livestock industry. If machine learning is applied to the behavioural observation of animals, it is possible to predict and analyze normal and abnormal behaviour through self-learning. In other words, it became possible to predict the phenotype. In addition to applying machine learning to phenotypes such as behavioural observations of animals in the livestock industry, genomic analysis can find relevant trait gene markers and apply them to machine learning. The application of machine learning through such genomic analysis is expected to be possible not only to predict genetic ability, but also to study desired breed improvement for excellent livestock breeds by trait. Genome-Wide Association Study is one of the good analysis methods to discover markers for related traits. In the application of machine learning, various strategies and methods are used, rather than simply learning data on a machine. It is important to consider and apply which strategies and methods will achieve the optimal results.

KEYWORDS

Machine Learning, Livestock, Genomics, Genome-Wide Association Study, DNA marker

ACKNOWLEDGEMENTS

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (NRF2022R1A2C1005830).

REFERENCES

Abdi, Hervé, and Lynne J. Williams. Principal component analysis. Wiley interdisciplinary reviews: computational statistics 2.4 (2010): 433459. https://doi.org/10.1002/wics.101
[DOI]

Albawi, Saad, Tareq Abed Mohammed, and Saad Al-Zawi. Understanding of a convolutional neural network. 2017 international conference on engineering and technology (ICET). Ieee, 2017. https://doi.org/10.1109/ICEngTechnol.2017.8308186
[DOI]

Arac, Ahmet, et al. DeepBehavior: A deep learning toolbox for automated analysis of animal and human behavior imaging data. Frontiers in systems neuroscience 13 (2019): 20. https://doi.org/10.3389/fnsys.2019.00020
[DOI][PubMed][PMC]

Arganda-Carreras, Ignacio, et al. Trainable Weka Segmentation: a machine learning tool for microscopy pixel classification. Bioinformatics 33.15 (2017): 2424-2426. https://doi.org/10.1093/bioinformatics/btx180
[DOI][PubMed]

Behjati, Sam, and Patrick S. Tarpey. What is next generation sequencing? Archives of Disease in Childhood-Education and Practice 98.6 (2013): 236-238. https://doi.org/10.1136/archdischild-2013-304340
[DOI][PubMed][PMC]

Black, Michael, Wenzhi Wang, and Wei Wang. Ischemic stroke: From next generation sequencing and GWAS to community genomics? OMICS: A Journal of Integrative Biology 19.8 (2015): 451-460. https://doi.org/10.1089/omi.2015.0083
[DOI][PubMed]

Bzdok, Danilo, Martin Krzywinski, and Naomi Altman. Machine learning: supervised methods. Nature methods 15.1 (2018): 5. https://doi.org/10.1038/nmeth.4551
[DOI][PubMed][PMC]

Chanock, Stephen J., et al. Replicating genotype-phenotype associations. (2007).

Chen, Chien-Chang, et al. Unsupervised learning and pattern recognition of biological data structures with density functional theory and machine learning. Scientific reports 8.1 (2018): 1-11. https://doi.org/10.1038/s41598-017-18931-5
[DOI][PubMed][PMC]

Chen, Shouhong, et al. Parameter selection algorithm of DBSCAN based on K‐means two classification algorithm. The Journal of Engineering 2019.23 (2019): 8676-8679. https://doi.org/10.1049/joe.2018.9082
[DOI]

Ciaburro, Giuseppe, and Balaji Venkateswaran. Neural Networks with R: Smart models using CNN, RNN, deep learning, and artificial intelligence principles. Packt Publishing Ltd, 2017.

Corradin, Olivia, et al. Combinatorial effects of multiple enhancer variants in linkage disequilibrium dictate levels of gene expression to confer susceptibility to common traits. Genome research 24.1 (2014): 1-13. https://doi.org/10.1101/gr.164079.113
[DOI][PubMed][PMC]

Criminisi, Antonio, Jamie Shotton, and Ender Konukoglu. Decision forests: A unified framework for classification, regression, density estimation, manifold learning and semi-supervised learning. Foundations and trends® in computer graphics and vision 7.2-3 (2012): 81-227. https://doi.org/10.1561/0600000035
[DOI]

Deng, Li, and Dong Yu. Deep learning: methods and applications. Foundations and trends in signal processing 7.3-4 (2014): 197-387. https://doi.org/10.1561/2000000039
[DOI]

Edwards, K., C. Johnstone, and C1 Thompson. A simple and rapid method for the preparation of plant genomic DNA for PCR analysis. Nucleic acids research 19.6 (1991): 1349. https://doi.org/10.1093/nar/19.6.1349
[DOI][PubMed][PMC]

Ghahramani, Zoubin. Unsupervised learning. Summer school on machine learning. Springer, Berlin, Heidelberg, 2003.

Goddard, M. E., and B. J. Hayes. Genomic selection. Journal of Animal breeding and Genetics 124.6 (2007): 323-330. https://doi.org/10.1111/j.1439-0388.2007.00702.x
[DOI][PubMed]

Greene, Derek, Pádraig Cunningham, and Rudolf Mayer. Unsupervised learning and clustering. Machine learning techniques for multimedia. Springer, Berlin, Heidelberg, 2008. 51-90. https://doi.org/10.1007/978-3-540-75171-7_3
[DOI]

Javaid, Ahmad, et al. A deep learning approach for network intrusion detection system. Eai Endorsed Transactions on Security and Safety 3.9 (2016): e2. https://doi.org/10.4108/eai.3-12-2015.2262516
[DOI]

Jones, David T. Setting the standards for machine learning in biology. Nature Reviews Molecular Cell Biology 20.11 (2019): 659-660. https://doi.org/10.1038/s41580-019-0176-5
[DOI][PubMed]

Jordan, Michael I., and Tom M. Mitchell. Machine learning: Trends, perspectives, and prospects. Science 349.6245 (2015): 255-260. https://doi.org/10.1126/science.aaa8415
[DOI][PubMed]

Kotsiantis, Sotiris B., Ioannis Zaharakis, and P. Pintelas. Supervised machine learning: A review of classification techniques. Emerging artificial intelligence applications in computer engineering 160.1 (2007): 3-24.

Kyriazakis, Ilias, and Colin T. Whittemore. Whittemore’s science and practice of pig production. John Wiley & Sons, 2008.

LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. Deep learning. nature 521.7553 (2015): 436-444. https://doi.org/10.1038/nature14539
[DOI][PubMed]

Lee, Eun-A., et al. Evaluation of whole pork belly qualitative and quantitative properties using selective belly muscle parameters. Meat science 137 (2018): 92-97. https://doi.org/10.1016/j.meatsci.2017.11.012
[DOI][PubMed]

Libbrecht, Maxwell W., and William Stafford Noble. Machine learning applications in genetics and genomics. Nature Reviews Genetics 16.6 (2015): 321-332. https://doi.org/10.1038/nrg3920
[DOI][PubMed][PMC]

Men, Artem E., et al. Sanger DNA sequencing. Next Generation Genome Sequencing: Towards Personalized Medicine (2008): 1-11. https://doi.org/10.1002/9783527625130.ch1
[DOI]

Niculescu-Mizil, Alexandru, and Rich Caruana. Predicting good probabilities with supervised learning. Proceedings of the 22nd international conference on Machine learning. 2005. https://doi.org/10.1145/1102351.1102430
[DOI]

Nielsen, Rasmus, et al. Genotype and SNP calling from next-generation sequencing data. Nature Reviews Genetics 12.6 (2011): 443-451. https://doi.org/10.1038/nrg2986
[DOI][PubMed][PMC]

Sathya, Ramadass, and Annamma Abraham. Comparison of supervised and unsupervised learning algorithms for pattern classification. International Journal of Advanced Research in Artificial Intelligence 2.2 (2013): 34-38. https://doi.org/10.14569/IJARAI.2013.020206
[DOI]

Schmidhuber, Jürgen. Deep learning in neural networks: An overview. Neural networks 61 (2015): 85-117. https://doi.org/10.1016/j.neunet.2014.09.003
[DOI][PubMed]

Schweikert, Gabriele, et al. mGene: accurate SVM-based gene finding with an application to nematode genomes. Genome research 19.11 (2009): 2133-2143. https://doi.org/10.1101/gr.090597.108
[DOI][PubMed][PMC]

Valpola, Harri. From neural PCA to deep unsupervised learning. Advances in independent component analysis and learning machines. Academic Press, 2015. 143-171. https://doi.org/10.1016/B978-0-12-802806-3.00008-7
[DOI]

Wang, Hongru, et al. The power of inbreeding: NGS-based GWAS of rice reveals convergent evolution during rice domestication. Molecular Plant 9.7 (2016): 975-985. https://doi.org/10.1016/j.molp.2016.04.018
[DOI][PubMed]

Winston, Patrick Henry. Artificial intelligence. Addison-Wesley Longman Publishing Co., Inc., 1992.

Yang, Jian, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nature genetics 44.4 (2012): 369-375. https://doi.org/10.1038/ng.2213
[DOI][PubMed][PMC]

Yin, Chuanlong, et al. A deep learning approach for intrusion detection using recurrent neural networks. Ieee Access 5 (2017): 21954-21961. https://doi.org/10.1109/ACCESS.2017.2762418
[DOI]

Zhu, Xiaojin Jerry. Semi-supervised learning literature survey. (2005).

Section