Current language will be adjusted as these plans and guidance evolve.
Resources for Promoting Inclusion and Equity in Research (PIER) Plans
Recruitment Strategy for Promoting Inclusive and Equitable Research
If you need resources for building a diverse team and expanding collaborations to promote equity and inclusion within your proposal, the following programs are available.
DOE-sponsored and collaborating National Lab programs:
- Department of Energy’s Office of Science (DOE-SC) Workforce Development for Teachers and Scientists (WDTS) programs (https://science.osti.gov/wdts)
- ORNL: Educational Programs (https://education.ornl.gov/)
- ANL: Educational Programs and Outreach (https://www.anl.gov/education)
- BNL: Workforce Development and Science Education (https://www.bnl.gov/education/)
KBase supported and affiliated programs:
- KBase Educators supports and collaborates with educators and students at approximately 30 Minority-Serving Institutions in the US and in 19 countries across 6 continents.
- Regularly hosts summer high school, undergraduate, and graduate interns.
- Involved with the NSF Research Experience for Undergraduates National Summer Undergraduate Research Project (NSURP; https://nsurp.org/).
Creating and Sustaining an Inclusive, Safe, and Professional Environment
We recommend referring to your home-institution programs for resources on Diversity, Equity, and Inclusion training, including bias and mentoring.
Example language for proposals that include KBase staff or support the KBase community:
Examples of programs and events can be found on respective Lab sites:
- LBNL: https://diversity.lbl.gov/
- ORNL: https://diversity.ornl.gov/
- ANL: https://www.anl.gov/diversity-and-inclusion
- BNL: https://www.bnl.gov/diversity/
Data Management Plans (DMPs)
Required for all publishable data, metadata, and software from research funded by the the Genomic Science Program (https://genomicscience.energy.gov/datasharing/) and Department of Energy (https://www.energy.gov/datamanagement/doe-policy-digital-research-data-management).
Data Management Plans, or DMPs, should address how samples, data, software, and any other research products generated by the project will be accessed, shared, and preserved, and include ownership/responsibility considerations. These topics support the generation of Findable, Accessible, Interoperable, and Reusable (FAIR, Wilkinson et al. 2016) scientific research products. Quality metadata about the research products, especially samples, data, and the provenance on how each were generated, also support comparability and reproducibility. We recommend reviewing the 10 Simple Rules and Getting and Giving Credit for Data (Wood-Charlson et al. 2022) for additional details and resources.
For each topic, we have provided information and example text on how KBase adheres to community standards, enables sharing and public access, and fits into the broader architecture of DOE-funded resources that support a complete data lifecycle (Figure 1).
Fig. 1 Data and Analysis Ecosystem BER researchers and facilities produce a wealth of data for predictively understanding complex biological systems. To fully leverage this data, BSSD supports integrative computational and data science platforms to facilitate community access, analysis, and sharing of omics, imaging, and other data types. [Courtesy Lawrence Berkeley National Laboratory] (U.S. DOE. 2021. Biological Systems Science Division Strategic Plan, DOE/SC-0205. U.S. Department of Energy Office of Science (genomicscience.energy.gov/2021bssdstrategicplan/).
Samples and data
Describe the types of samples that will be generated; the protocols, tools, and instruments used to generate data; and any analysis the data will undergo. Include descriptions (size, format, frequency, etc) of any existing or planned datasets.
While not a sample repository, KBase does support community standards used to describe sample metadata. Examples include: International Geo/General Sample Number (IGSN) – a community-standard persistent identifier for physical samples, and NCBI Biosamples (see documentation on KBase samples for more information). For microbiome-specific samples, we recommend also reviewing the National Microbiome Data Collaborative’s DMP page on “Microbiome Community Standards & Repositories.”
KBase supports a wide range of data types and file formats related to systems biology research, including genomes, annotations, metagenomes, expression, protein-protein interactions, models of organismal and community metabolism, gene regulation, and sample metadata. See links for a complete list of KBase data types and file formats and KBase Data Sources.
KBase supports open source software and best practices as a resource developed to provide public access to cutting-edge software tools relevant to the mission of the Department of Energy Office of Science. It also provides a means for users to share their data and optimized software workflows. All KBase code is available in the KBase Github repository, as a community-standard, open source platform for making software and code widely available.
Access, sharing, and preservation
KBase is free and open to anyone able to access a web browser and create an account. Data and analysis workflows can be made accessible outside the KBase login by creating a “static” HTML snapshot of the Narrative. Full access (e.g., add data, run tools, or download) requires viewers to create an account, as that supports provenance and reuse tracking.Therefore, it is recommended to provide sufficient documentation in the Narrative such that readers can explore your results prior to being asked to login.
A specific version of a KBase Narrative can be saved and assigned a DOI, by request. The Narrative DOI. The default reuse policy for Narratives with a DOI is a Creative Commons CC0 license, allowing unlimited reuse. Narrative authors may request a CC-BY-4.0 license (reuse with attribution) for a Narrative as an analysis workflow, but all data within the Narrative remain CC0.
All KBase user data are backed up to the Amazon S3 cloud services, ensuring that the full system could be restored if necessary. If KBase were to no longer be supported, KBase has a partnership with the California Digital Library to migrate all data associated with a Narrative with a DOI to Dryad for long-term access and storage.
Ownership and responsibilities
The DMP should ensure that roles and responsibilities are accounted for and it is clear how the requirements for data management will be met. It should also be clear how responsibilities will be transferred if personnel leave the project. In KBase, the creator of a Narrative or Organization is permanently assigned. However, full administrative privileges can be granted (and revoked for) to other KBase users.
KBase also supports bioinformaticians to contribute code and analysis tools to the KBase open source project. Community developers are trained in the KBase software development kit (SDK) and supported throughout the KBase software release process. It is expected that community developers maintain and update their code as necessary.
How to cite KBase: Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, et al. KBase: The United States Department of Energy Systems Biology Knowledgebase. Nature Biotechnology. 2018; 36: 566. doi: 10.1038/nbt.4163
How to cite the KBase DMP: Arkin AP, et al. DOE Systems Biology Knowledgebase (KBase) Data Management Plan. DMPTool. 2024. doi: 10.48321/D13BA96b4d