Welcome to AgBioData!

We are a consortium of agricultural biological databases with the mission of consolidating standards and best practices for acquiring, displaying, and reusing genomic, genetic, and breeding (GGB) data (Harper et al., 2018). Formed in 2015, our consortium involves 44 GGB databases (the complete list here) and over 200 members, including database curators, researchers, librarians, and anybody that works with agricultural data. 

The AgBioData consortium embraces the Findable, Accessible, Interoperable, and Reusable (FAIR) principles to facilitate and maximize the accessibility and reuse of large-scale data in agricultural research. We recently received funding for a National Science Foundation (NSF) for Research Coordination Network project that aims to:

  1. define community-based standards for FAIR agricultural data;
  2. expand our network by recruiting anybody that generates, uses, curate, archive, and publish data;
  3. provide educational material to train scientists on FAIR data sharing;
  4. develop a roadmap for a sustainable GGB database ecosystem.

If you are interested in our work and want to be part of our community, you can:

Participating Databases and Resources

  • The Enterprise Breeding System (EBS) is an open-source breeding informatics software.

  • GOBii (Genomic Open-source Breeding informatics initiative) is an open-source genomic data management and analysis tool.

  • SorghumBase is a web portal for comparative plant genomics focused on Sorghum crop varieties.

  • The central goal of the Plant Metabolic Network (PMN) is to bring together biochemical pathway databases and research communities focused on plant metabolism.

  • The Pulse Crop Database (PulseDB) is being developed by the Main Bioinformatics Laboratory at WSU.

  • The Genome Database for Rosaceae is a curated and integrated web-based relational database...

  • MaizeGDB is a community-oriented, long-term, federally funded informatics service to researchers focused on the crop plant and model organism Zea mays.

  • WheatIS provide a single-access web base system for the wheat research community

  • GrainGenes, a database for Triticeae and Avena, is a comprehensive resource for molecular and phenotypic information

  • AgBase is a curated, open-source, Web-accessible resource for functional analysis of agricultural plant and animal gene products.

  • The Arabidopsis Information Resource (TAIR) collects information and maintains a database of genetic and molecular biology data for Arabidopsis thaliana, a widely used model plant.

  • SoyBase database was established in the 1990s as the USDA Soybean Genetics Database. Originally...

  • The Sol Genomics Network (SGN) is a clade-oriented database dedicated to the biology of the Solanaceae family...

  • In the new age of comparative plant biology, we are looking at datasets from numerous inter and intra-specific...

  • This web portal is designed to provide convenient access to peanut genetic and genomic data...

  • It is the mission of NADC to conduct basic and applied research on selected diseases of economic importance to the U.S. livestock and poultry industries.

  • The TreeGenes database and Dendrome project provide custom informatics tools to manage...

  • The i5k Workspace@NAL is a platform for communities around ‘orphaned’ arthropod genome projects to access, visualize, curate and disseminate their data. 

  • CottonGen is a new cotton community genomics, genetics and breeding database being developed to enable basic, translational and applied research in cotton. 

  • The Animal Quantitative Trait Loci (QTL) Database (Animal QTLdb) strives to collect all publicly available trait mapping data, i.e. QTL

  • The Genome Database for Vaccinium (GDV) is being developed to house and integrate genomic, genetic and breeding data for blueberry, cranberry and other Vaccinium species.

  • The Citrus Genome Database, known as CGD, is a USDA and NSF funded resource to enable basic...

  • The Germplasm Resources Information Network (GRIN) web server provides germplasm...

  • The Triticeae Toolbox (T3) is a repository for public wheat data generated by the Wheat Coordinated Agricultural Project...

  • A collaborative, community resource to facilitate crop improvement by integrating genetic, genomic, and trait data across legume species.

  • The Hardwood Genomics Web serves forest tree scientists by providing online access to hardwood tree genomic...

  • CyVerse is funded by the National Science Foundation’s Directorate for Biological Sciences.

  • Gramene is a curated, open-source, integrated data resource for comparative functional genomics in crops and model plant species.