DeepGS:Predicting phenotypes from genotypes using Deep Learning
The R package ‘DeepGS’ can be used to perform genomic selection (GS), which is a promising
breeding strategy in plants and animals. DeepGS predicts phenotypes using genomewide
genotypic markers with an advanced machine learning technique (deep learning). The effectiveness
of DeepGS has been demonstrated in predicting eight phenotypic traits on a population
of 2000 Iranian bread wheat (Triticum aestivum) lines from the wheat gene bank of the International
Maize and Wheat Improvement Center (CIMMYT).
Version and download
Version 1.0 -First version released on Feb, 15th, 2017
Version 1.1 -Second version released on Oct, 12th, 2017
Version 1.2 -Third version released on Jun, 6th, 2018
1.’ELBPSO’ funtion was added for ensemble learning based on particle swarm optimization (ELBPSO)
2.Update package document
3.Function optimization for building deep learning Genomic selection prediction model
DeepGS-CPU Installation
Docker installation and start
For Windows (Test on Windows 10 Enterprise version):
Drag the docker into Applications and complete installation;
Start docker from Launchpad by click it.
For Ubuntu (Test on Ubuntu 14.04 LTS and Ubuntu 16.04 LTS):
Go to Docker, choose your Ubuntuversion, browse to pool/stable and choose amd64, armhf, ppc64el or s390x. Download the DEB file for the Docker version you want to install;
Install Docker, supposing that the DEB file is download into following path:“/home/docker-ce~ubuntu_amd64.deb”
Once Docker installation is completed, we can run hello-world image to verify if Docker is installed correctly. Open terminal in Mac OS X and Linux operating system and open CMD for Windows operating system, then type the following command:
$ docker run hello-world
Note: root permission is required for Linux operating system.
Note: considering that differences between different computers may exist, please refer to official installation manual if instructions above don’t work.
DeepGS-CPU Docker image installation and quickly start
$ docker pull malab/deepgs_cpu
$ docker run -it -v /host directory of dataset:/home/data malab/deepgs_cpu R
Note: Supposing that users’ private dataset is located in directory /home/test, then change the words above (/host directory of dataset) to host directory (/home/test)
library(DeepGS)
setwd("/home/data/")
Important: the directory (/home/data/) is a virtual directory in DeepGS Docker image. In order to use private dataset more easily, the parameter “-v” is strongly recommended to mount host directory of dataset to DeepGS image.
DeepGS:Predicting phenotypes from genotypes using Deep Learning
The R package ‘DeepGS’ can be used to perform genomic selection (GS), which is a promising breeding strategy in plants and animals. DeepGS predicts phenotypes using genomewide genotypic markers with an advanced machine learning technique (deep learning). The effectiveness of DeepGS has been demonstrated in predicting eight phenotypic traits on a population of 2000 Iranian bread wheat (Triticum aestivum) lines from the wheat gene bank of the International Maize and Wheat Improvement Center (CIMMYT).
Version and download
1.’ELBPSO’ funtion was added for ensemble learning based on particle swarm optimization (ELBPSO)
2.Update package document
3.Function optimization for building deep learning Genomic selection prediction model
DeepGS-CPU Installation
Docker installation and start
For Windows (Test on Windows 10 Enterprise version):
For Mac OS X (Test on macOS Sierra version 10.12.6 and macOS High Sierra version 10.13.3):
For Ubuntu (Test on Ubuntu 14.04 LTS and Ubuntu 16.04 LTS):
Verify if Docker is installed correctly
Once Docker installation is completed, we can run hello-world image to verify if Docker is installed correctly. Open terminal in Mac OS X and Linux operating system and open CMD for Windows operating system, then type the following command:
Note: root permission is required for Linux operating system. Note: considering that differences between different computers may exist, please refer to official installation manual if instructions above don’t work.
DeepGS-CPU Docker image installation and quickly start
Note: Supposing that users’ private dataset is located in directory
/home/test, then change the words above (/host directory of dataset) to host directory (/home/test)Important: the directory (
/home/data/) is a virtual directory in DeepGS Docker image. In order to use private dataset more easily, the parameter “-v” is strongly recommended to mount host directory of dataset to DeepGS image.DeepGS-GPU Installation
The details of DeepGS installation are available at: https://github.com/cma2015/DeepGS/blob/master/DeepGS_GPU_installation.md
Data preparation and paramaters setting
Training DeepGS model
Prediction
Performance evaluation
ELBPSO
Ask questions
Please use DeepGS/issues for how to use DeepGS and reporting bugs.
Citation
Ma, W., Qiu, Z., Song, J., Li, J., Cheng, Q., Zhai, J., & Ma, C. (2018). A deep convolutional neural network approach for predicting phenotypes from genotypes. Planta, 248(5): 1307-1318.