HuBMAP Frequently Asked Questions (FAQ)
General Questions
What is HuBMAP & what are its goals?
HuBMAP is the Human BioMolecular Atlas Program. The vision for HuBMAP is to catalyze the development of a framework for mapping of the human body at single-cell resolution to transform our understanding of normal tissue organization and function. This will be achieved by:- Accelerating the development of the next generation of tools and techniques for constructing high resolution spatial tissue maps that quantify multiple types of biomolecules either sequentially or simultaneously.
- Generating foundational 3D human tissue maps using validated high-content, high-throughput imaging and omics assays.
- Establishing an open data platform that will develop novel approaches to integrating, visualizing, and modelling imaging and omics data to build multi-dimensional tissue maps, and making data rapidly findable, accessible, interoperable, and reusable by the global research community.
- Coordinating and collaborating with other funding agencies, programs, and the biomedical research community to build the framework and tools for mapping the human body at single-cell resolution.
- Supporting pilot projects that demonstrate the value of the resources developed by the program to study normal individual variations and tissue changes across the lifespan and the health-disease continuum.
- Potentially lead to better understanding of the significance of normal inter-individual variability and changes across the lifespan.
- Inform about the emergence of disease at the biomolecular level before the appearance of clinical symptoms.
- Integrating high content, high resolution spatial and omics information to comprehensively profile biomolecular distribution and morphology of tissues in a high throughput manner.
- Arranging this information into 3D tissue maps amenable to modelling.
- For HuBMAP, a high-resolution assay is one that can reliably and reproducibly assign detected biomolecules to individual cells or extracellular compartments of a tissue.
- A high content approach is one that maximizes identification of tissue features through a combination of biomolecular depth, spatial resolution and multiplexing of complementary, multi-parameter assays.
- A high throughput pipeline is one that maximizes the bandwidth of data production to result in any or all of the following:
- Accelerated speed of analysis, so that hundreds or thousands of samples can be analyzed simultaneously.
- Greater depth of analysis, so that hundreds or thousands of molecules can be analyzed in a single sample.
- Enhanced capacity for volume, so that a given set of molecules can be analyzed in all the cells within a larger tissue sample.
HuBMAP, which made its first external awards in Fall 2018, is funded through the NIH Common Fund as a short-term (8 years), goal-driven strategic investment, with deliverables intended to catalyze research across multiple biomedical research disciplines. The NIH Common Fund supports cross-cutting programs that are expected to have exceptionally high impact. All Common Fund initiatives invite investigators to develop bold, innovative, and often risky approaches to address problems that may seem intractable in isolation or to seize new opportunities that offer the potential for rapid progress.
For a more in depth understanding, read the HuBMAP marker paper, see the course on HuBMAP data acquisition, analysis, and visualization, the Visible Human MOOC, or see this video: HuBMAP Overview. Stay in touch by subscribing to our mailing list and YouTube channel.
Who are the intended users of HuBMAP resources?
HuBMAP's rich datasets and associated resources are intended for broad use by the research community, including:- Computational researchers exploring organizing principles of human tissues, new structural-functional relationships, and biomolecular networks
- Biologists exploring hypotheses using publicly available HuBMAP datasets prior to or in parallel with work in their own labs
- Experimentalists interested in using the same protocols or computational tools in their labs
- Educators developing new teaching materials
- Technology developers interested in developing new assays with enhanced performance
How does HuBMAP fit into the ecosystem of other single cell efforts within and outside the NIH?
HuBMAP is part of a rich ecosystem of established and emerging atlasing programs supported by NIH and globally by other funding organizations, many of which are focused on specific organs or diseases.- HuBMAP connects with these programs to ensure data interoperability, avoid duplication of work, and leverage and synergize gained knowledge.
- The consortium has organized a number of events to bring together these communities to discuss topics of shared interest (e.g. CCF meeting, NIH-HCA meeting) and is committed to improving coordination and collaboration among different programs.
- In addition, many HuBMAP PIs actively participate in these efforts, helping with cross-pollination and advancing our global understanding.
- Techniques for integrating disparate, multi-dimensional and multi-scale datasets
- The development of a Common Coordinate Framework (CCF) for integrating data across many individuals
- The development and validation of these assays
Introduction to HuBMAP resources
We welcome all to explore the HuBMAP Consortium website and HuBMAP Data Portal.
Visitors can provide feedback through the Contact form.
The Visible Human MOOC provides an overview of HuBMAP and an introduction to data acquisition, analysis, and visualization.
Getting Started with the HuBMAP Data Portal
How can I start using the portal?
Here's a link to the HuBMAP Data Portal. HuBMAP Data Portal (and related) documentation can be found on the HuBMAP Documentation page and also the HuBMAP FAQ - this page.
How can I learn more about HuBMAP data acquisition, analysis, and visualization tools?
The Visible Human MOOC provides an overview of HuBMAP and introduction to data acquisition, analysis, and visualization.
Can I sign up to be notified of HuBMAP news and events?
Yes, you can sign up for our mailing list to keep informed on everything that is happening in HuBMAP.
Do you track my activities on your portal?
Yes, interactions with the site are recorded in server logs and on Google Analytics and are mapped to your IP address.
In that regard the HuBMAP portal is no different from the rest of the internet.
What web-browsers are supported in the portal?
All modern, mainstream browsers are supported (i.e. Chrome, Edge, Firefox, Safari, etc.).
Where should I report errors, make suggestions for new features or functionality, or ask questions about using the portal?
The HuBMAP Consortium welcomes your comments, feedback, and help in identifing errors on the HuBMAP Data Portal.
- You can provide error reports, make suggestions, or ask questions through our contact-form.
- For help with specific issues related to the portal, please contact the HuBMAP Helpdesk and submit a support ticket.
HuBMAP Data and Metadata Questions
Can I use HuBMAP data for my own research?
- Yes, follow the guidelines outlined in the HuBMAP External Data Sharing Policy.
- Access to NIH HuBMAP data is guided by the NIH Genomic Data Sharing policy.
- This acknowledgement helps justify and sustain funding needed to continue providing open access to a growing set of data and tools.
- Suggested language for such an acknowledgment is: “The results (published or shown) here are in whole or part based upon data generated by the HuBMAP Program."
- When referencing a specific HuBMAP data set, include the following information:
- Creator(s) (Year published). Data set name. HuBMAP Consortium. DOI or HuBMAP ID (with link) For example...
- Smith, J., Jones S., et al. (2024). Auto-fluorescence Microscopy Left Kidney. HuBMAP Consortium. doi10.99999/HBM999.XYZQ.111
Can I use your data for another project that I am working on?
Yes! The Consortium provides raw and processed data (at multiple levels) for the community to access through the HuBMAP Data Portal.
HuBMAP products are broadly available to the research community to establish the foundations for a human body map that other programs and the international community can build upon, including methods, tools, reagents, biospecimens, datasets, and software.
To acknowledge HuBMAP in your findings, the Consortium suggests language of the form:
- “The results (published or shown) here are in whole or part based upon data generated by the HuBMAP Program."
- When referencing a specific HuBMAP data set, include the following information:
- Creator(s) (Year published). Data set name. HuBMAP Consortium. DOI or HuBMAP ID (with link) For example...
- Smith, J., Jones S., et al. (2024). Auto-fluorescence Microscopy Left Kidney. HuBMAP Consortium. doi10.99999/HBM999.XYZQ.111
Can I submit my own data to HuBMAP? Why would I want to?
- Yes, HuBMAP allows investigators to submit their own data via the HuBMAP Data Portal.
- Why share? Having your own data on HuBMAP will ...
- Allow other researchers access to your results. In this way, others may extend and interact with your scientific work.
- Provide additional resources for creating cellular and molecular level anatomical maps of the healthy human.
Can I add my own tools to HuBMAP portal? Why would I want to submit my tools to HuBMAP?
- Yes, HuBMAP seeks to host relevant tools and welcomes community input to help with feature prioritization and development for the HuBMAP Portal.
- Adding your Tools to the HuBMAP Portal will help you get others to use your tools and provide feedback to improve the scientific impact of your work.
- One of the first tools released was Azimuth, an app for reference-based single-cell analysis, that lets users annotate cell-types in their own data based on HuBMAP approaches.
What do I need to do to gain access to HuBMAP data? How can I download HuBMAP data?
Access to data on HuBMAP's Data Portal is open to all interested viewers, without additional barriers (account creation, login, etc.).How can I download HuBMAP data?
To download HuBMAP data from the Data Portal you will need to register as a member of HuBMAP. Note that downloads of specific datasets (e.g., raw genetic data) require NIH approval. Contact the HuBMAP Helpdesk for assitance with accessing this type of data.
How can I register as a member of HuBMAP?
1. Go to the HuBMAP Consortium website2. Under Member Services select Member Register
3. Complete the registration form (you will need a valid institutional or eRA Commons ID)
Can I make HuBMAP data available through my resource?
- You may use HuBMAP data for any purposes permitted by the Data Sharing Policy.
- The CCF 3D Reference Object Library provides anatomically correct reference organs. The organs are developed by a specialist in 3D medical illustration and approved by organ experts.
- As of December 2023, the 6th HRA release included 30 organ objects that can be freely used in teaching, research, or commercial applications.
How would I cite and reference HuBMAP and related data?
To acknowledge HuBMAP data in publications or presentations, we suggest:
- “The results (published or shown) here are in whole or part based upon data generated by the HuBMAP Program."
- When referencing a specific HuBMAP data set, include the following information:
- Creator(s) (Year published). Data set name. HuBMAP Consortium. DOI or HuBMAP ID (with link) For example...
- Smith, J., Jones S., et al. (2024). Auto-fluorescence Microscopy Left Kidney. HuBMAP Consortium. doi10.99999/HBM999.XYZQ.111
The HuBMAP marker paper should be cited as:
- Snyder, M.P., Lin, S., Posgai, A. et al. The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574, 187–192 (2019).
The Visible Human reference organs are freely available via the CCF 3D Reference Object Library. Please cite as:
- Browne K, Cross LE, Herr, II BW, Record EG, Quardokus EM, Bueckle A, Börner K. 2020. HuBMAP CCF 3D Reference Object Library
Can I use figures from HuBMAP in my publications?
Yes, as long as you cite the source of the figure. See the preceeding question for more details.
What metadata is included in HuBMAP? Will HuBMAP include additional metadata in future releases?
- HuBMAP accepts donor, sample (block, section, or suspension), and assay metadata.
- See this HuBMAP metadata resource for more information.
- In future releases, metadata will be linked to various ontologies to make integration more efficient.
How does HuBMAP curate and validate metadata?
HuBMAP data submitters can access lists of metadata schemas (and related information) and download metadata templates from HuBMAP’s Data Upload Guidelines page. Data submitters can validate their own metadata using a Validator tool.- Click on any assay or sample type on the lists to jump to a page for that type.
- Select the Excel template on that page.
- Download and complete the Excel template.
- Validate your metadata spreadsheet (template) using the Metadata Spreadsheet Validator.
- The Validator tool compares entries in your metadata template against specifications stored in the CEDAR repository.
- It categorizes any errors found and provides hints on how to fix those errors.
- If errors are detected, use the Validator tools to correct any errors.
- When no errors are found, you may safely upload your spreadsheet.
- Once all errors are corrected, download the corrected metadata spreadsheet (TSV format).
- Send validated (organ or sample) metadata TSV files to the HuBMAP Helpdesk
- A Data Curator at the HuBMAP Helpdesk will manually validate and upload the files.
- Access Help documentation for the Validator.
- TMCs are required to validate their metadata BEFORE uploading it to the HuBMAP Helpdesk.
Is there a “map” in HuBMAP?
The Human Reference Atlas (HRA) is a comprehensive, high-resolution, three-dimensional atlas of all the cells in a healthy human body.- The HRA provides standard terminologies and data structures for describing:
- specimens
- biological structures
- spatial positions
- Tools in the HRA include:
- Anatomical Structures, Cell Types and Biomarkers (ASCT+B) Reporter
- Cell Population Graphs
- Registration User Interface (RUI): Register and annotate organs
- Exploration User Interface (EUI): Interact with registered organs
- VR Organ Gallery: Immersive experience for exploring organs
- Millitome: 3D-printed tool for organ sectioning
- APIs for querying and interacting with the HRA
How can I run queries? What type queries are currently supported? Can you show me example queries?
You can query and interact with HuBMAP (and other) data registered in the Human Reference Atlas (HRA) using APIs.Can I suggest types of queries I am interested in?
Contact the HRA Team with feedback or suggestions regarding APIs and queries for HuBMAP HRA data.
What controlled vocabularies or ontologies are being used by HuBMAP?
Each donor metadata item uses Unified Medical Language System (UMLS) Concept Unique Identifiers (CUIs) and related SNOMEDCT_US codes.- This list will be expanded as clinical data transactions, not just metadata, are added for donors for which data is available.
- Similarly, other metadata will be encoded with applicable ontologies.
- The HuBMAP Knowledge Graph underpins all ontologies used in HuBMAP but is not yet deployed.
- The current CCF ontology uses Uberon, Kidney Tissue Atlas Ontology (KTAO) and Cell Ontology (CL), see details
API (Application Programming Interface) Questions
Do you support an API for programmatic access?
The HuBMAP portal is built using an extensible API structure that supports all component interactions.- APIs are registered at SmartAPI.
- For external access to APIs, please submit a request to the HuBMAP Helpdesk.
What APIs are currently supported by HuBMAP?
The following APIs are currently supported by HuBMAP:
- Ingest API supports writing data and metadata to HuBMAP. TMCs use Ingest API to contribute (ingest) data and the HIVE uses it to deposit (processed) data derived via pipelines.
- UUID API supports donor and tissue sample registration, submission of data, and collection of provenance information via the Ingest UI.
- Search & Index API supports searching and reindexing of HuBMAP metadata and data.
- Ontology API accesses an instance of a UBKG, a neo4j knowledge graph that links information from a variety of biomedical vocabulary systems.
- Entity API returns information about HuBMAP data entities.
Human Reference Atlas (HRA) APIs
The HuBMAP-produced HRA provides a comprehensive, high-resolution, three-dimensional atlas of cells in the healthy human body. The HRA provides standard terminologies and data structures for describing specimens, biological structures, and spatial positions linked to existing ontologies. In addition to user interfaces, you can also query the HRA via APIs.- CCF API returns reference information about the HRA and related experimental data registered with the HRA.
- HRA via SPARQL: The HRA is published as an RDF knowledge graph and may be queried using a multitude of available libraries. It is published at biomedical ontology websites including OLS and BioPortal.
What are the licensing requirements to use HuBMAP APIs, software, and data?
- The HuBMAP APIs, SmartAPI, and CCF 3D Reference Object Library (data) are released under Creative Commons Attribution 4.0 International (CC BY 4.0).
- Most of HuBMAP's software is licensed under the MIT License or GPL v3 License.
- Most of HuBMAP's source code can be found in GitHub.
- A few source code repositories use different open source licensing. You can verify this by viewing the LICENSE file in the respective repository.
How do you plan to handle error fixes and updates? Will I be notified of these?
- You can submit a bug or request a new Data Portal feature through this form.
- For help with specific issues related to the portal, please contact the HuBMAP Helpdesk and submit a support ticket.
- Be sure you are up-to-date on all HuBMAP news, sign up for our mailing list.
Experimental Design & Data Analysis Questions
Where can I find information about experimental design associated with HuBMAP data?
An overview of the Information on the experimental design and choice of modalities can be found within this reference:- Snyder, M.P., Lin, S., Posgai, A. et al. The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574, 187–192 (2019). https://doi.org/10.1038/s41586-019-1629-x (PMC6800388)
University of Florida:
Vanderbilt University:
UCSD:
Where can I find information about HuBMAP experimental protocols?
All published protocols that are used in HuBMAP are available on protocols.io
How do you define raw and processed data?
- We define raw data as the data that comes directly off of the instrument (e.g. mass spectrometer, microscopy, etc.)
- Processed data has been transformed in some manner (e.g. normalization, background subtracted, aligned, etc.).
- The level of processing is defined by the data state as detailed below.
- Data states are dependent upon the modality.
- In general, data state 0 (raw data) and state 1 (processed data) are available on the portal for downloading.
Data State | Description | Example File Type |
---|---|---|
0 | Raw image data: This is the data that comes directly off the instrument without preprocessing. (may not always be included). | CZI, TIFF |
1 | Processed data: Can include stitching, thresholding, background subtraction, z-stack alignment, deconvolution | CZI, TIFF, OME-TIFF |
2 | Segmentation: Computationally predicted cell (nucleus, cytoplasm) and/or structural boundaries (tubules, ventricles, etc.) | CSV, TIFF |
3 | Annotation (Cells and Structures): Interpretation of microscopy image and/or segmentation in terms of biology (e.g. unhealthy vs healthy, cell-type, function, functional region). | TIFF, PNG |
Data State | Description | Example File Type |
---|---|---|
0 | Raw image data: This is the data that comes directly off the mass spectrometer without preprocessing; sometimes referred to as raw spectral data. | imzML |
1 | Processed imaging MS data: Can include peak alignment, intensity normalization, m/z recalibration. | CSV, OME-TIFF |
Data State | Description | Example File Type |
---|---|---|
0 | Raw data: This is the raw sequence data (unprocessed) generated directly by the sequence instrument in files either with Phred quality scores (fastq). | FASTQ |
1 | Aligned data: SAM files contain sequence data that has been aligned to a reference genome and includes chromosome coordinates. BAM files are compressed binary versions of SAM files. The reference genome used is hg38. | SAM, BAM |
In what formats is your data available?
- Imaging based raw or processed data is available as TIFF or OME.TIFF formats
- Segmented imaging data is generated as csv and TIFF formats
- Annotated imaging data is TIFF, PNG, and PDF
- Raw sequence data is provided as fastq and metadata via tsv
- Imaging mass spectrometry raw data is provided as a .d and processed data is imzml, or a csv and a series of ome-tiffs.
Can I get the code that you used to process data?
All available code can be found on the HuBMAP github page.
Can I find an explanation for the choice of HuBMAP pipelines and algorithms?
- See brief descriptions of the HuBMAP data analysis pipelines.
- All code made available to users can be found on the HuBMAP github page.
Which samples should I consider to be technical replicates?
- Technical replicates are repeated measurements of the same existing sample.
- As even serial tissue sections represent distinct samples, we do not consider any images of tissues to be technical replicates.
- Technical replicates for sequencing assays would be any sequencing libraries generated from the same sample or aliquot of cells or nuclei.
Which samples should I consider to be biological replicates?
- Biological replicates are datasets from samples that originate from the same organ and organ donor.
- As such, each dataset within the HuBMAP database that is provided for a given donor organ for a comparable anatomical region/structure would be a biological replicate.
How are each organ/tissue handled to maintain high reproducibility and validity for the assay?
Protocols.io detailed processing with QA/QC HuBMAP Method Development Community timeline
How have antibodies been verified?
Initially, all antibodies were validated by individual groups. With later data releases, complete antibody information, including antibody clone, vendor, RRID, conjugation information, etc. became available. Additional antibody validation standards were also implemented. For the development of our antibody validation levels, we followed the antibody verification guidelines established in the following manuscripts: A proposal for validation of antibodies- Uhlen M, Bandrowski A, Carr S, Edwards A, Ellenberg J, Lundberg E, Rimm DL, Rodriguez H, Hiltke T, Snyder M, Yamamoto T. Nat Methods. 2016 Oct;13(10):823-7. doi: 10.1038/nmeth.3995. Epub 2016 Sep 5.PMID: 27595404
- Voskuil, J., Bandrowski, A., Begley, C. G., Bradbury, A., Chalmers, A. D., Gomes, A. V., Hardcastle, T., Lund-Johansen, F., Plückthun, A., Roncador, G., Solache, A., Taussig, M. J., Trimmer, J. S., Williams, C., & Goodman, S. L. MAbs. 2020;12(1):1794421. doi:10.1080/19420862.2020.1794421. PMID: 32748696