Analysis of Plant Breeding on Hadoop and Spark

作者:Shuangxi, Chen; Chunming, Wu; Yongmao, Yu
来源:Advances in Agriculture, 2016, 2016: 1-6.
DOI:10.1155/2016/7081491

摘要

<jats:p>Analysis of crop breeding technology is one of the important means of computer-assisted breeding techniques which have huge data, high dimensions, and a lot of unstructured data. We propose a crop breeding data analysis platform on Spark. The platform consists of Hadoop distributed file system (HDFS) and cluster based on memory iterative components. With this cluster, we achieve crop breeding large data analysis tasks in parallel through API provided by Spark. By experiments and tests of Indica and Japonica rice traits, plant breeding analysis platform can significantly improve the breeding of big data analysis speed, reducing the workload of concurrent programming.</jats:p>