Data & Resource Sharing Plan

We will disseminate information as widely, freely, and rapidly as possible. All sequence and expression data will be made publicly available as soon as they have passed our filtering and quality controls by submission to GenBank and Gene Expression Omnibus, respectively (Table 1). Phenotypic data will be made publicly available through thePhenome Networks,Project Unity Platform. Other kinds of data such as soil characteristics will be made available by deposition onDryad and via prompt publication. Information will also be distributed to the general public and to our end users through our project website, which will involve extending theSunflower Genome Database to accommodate new data, as well as to provide information about the current project.

The germplasm resources produced under this award (i.e., the multi-species MAGIC populations) will be deposited in the USDA sunflower germplasm repository at the North Central Regional Plant Introduction Station in Ames, IA for ongoing maintenance and distribution. The germplasm will be made freely available, subject to the standard SMTA required by the Treaty. We also will provide sufficient seed for immediate distribution to interested parties. We have already made our sunflower association mapping (SAM) population available via this mechanism, and it has since been distributed to numerous research groups throughout the world.

Table 1. Data release and resource sharing plan.

Data/resource typeTime of data/resource generationTime of data/resource releaseData/resource location
WGS sequence dataYear 3Year 3GenBank
Expression dataYears 2-4Years 2-4Gene Expression Omnibus
Phenotypic dataYears 1-4Years 1-4Project Unity Platform
OtherYears 1-4Years 1-4Dryad
GermplasmYear 3Year 3USDA-ARS sunflower genebank


Also, we will provide a central data mining resource on SAP’s cloud computing network. Different genomics aspects of the reference genome, elite varieties and wild relatives of sunflower will be publicly accessible including analytical tools (e.g., association mapping), data mining (e.g., sequences, variants, annotations, expression profiles) and interactive visualization tools (e.g., genome browser and many plotting functions). Using an interactive HTML5 UIs in the cloud instance will provide an independent and efficient platform for browsing and analyzing complex datasets in a simple and friendly environment. An advantage of having this resource on the Cloud is that analyses can be done extremely rapidly from any place on the globe.