Analysis of Plant Breeding on Hadoop and Spark

Analysis of crop breeding technology is one of the important means of computer-assisted breeding techniques which have huge data, high dimensions, and a lot of unstructured data. We propose a crop breeding data analysis platform on Spark. The platform consists of Hadoop distributed file system (HDFS...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuangxi Chen, Chunming Wu, Yongmao Yu
Format: Article
Language:English
Published: Wiley 2016-01-01
Series:Advances in Agriculture
Online Access:http://dx.doi.org/10.1155/2016/7081491
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Analysis of crop breeding technology is one of the important means of computer-assisted breeding techniques which have huge data, high dimensions, and a lot of unstructured data. We propose a crop breeding data analysis platform on Spark. The platform consists of Hadoop distributed file system (HDFS) and cluster based on memory iterative components. With this cluster, we achieve crop breeding large data analysis tasks in parallel through API provided by Spark. By experiments and tests of Indica and Japonica rice traits, plant breeding analysis platform can significantly improve the breeding of big data analysis speed, reducing the workload of concurrent programming.
ISSN:2356-654X
2314-7539