Getting started

Input files

You can download our sample dataset used in the following Getting Started there: CEU Trio NA12878 (1.4 GB)

This is a Gemini database containing the NA12878 individuals with her parents called using the GATK best practices. This trio is commonly used for testing genomics tools. For demo purpose we also added CNVs of NA12878 (not for the parents) extracted from Mills et al. 2011.

Load a dataset

On the left side, in the gravity menu, click the load button a select your Gemini file.


To produce the network as shown below, the following prefilters were applied for loading our sample dataset:

  • impacts: HIGH and MED only
  • maximum allele frequency <= 0.01
  • ignore multiallelic variants
  • remove CNVs with repeat masker

Loading the data can take a few minutes depending on your computer and the size of your dataset. The prefilters make the loading faster but can only be changed by reloading the data.

Visualise your first network

  • Select your patient (and eventually his parents if your pedigree file is incomplete)


In our sample dataset, the patient is NA12878, her father is NA12891 and her mother is 12892.

  • In the Network tab, check the “run on all genes instead” option below the Pathways/Genes list.
  • In the Filter tab, you need 2 filters: cadd_scaled >= 15 (found in ‘Pathogenecity’) and sub_type != DUP (found in ‘Other’). Those will greatly reduce the number of variants displayed in the first network.
  • Hit the create a new network button

Your first Gravity network!



For the sample dataset, we made the BAM visualization available online. If you select a gene and double click a variant on the right panel, your web browser should open and show you the BAM section of this variant. This visualisation is performed using pileup.js: Vanderkam, Dan, et al. “pileup. js: a JavaScript library for interactive and in-browser visualization of genomic data.” Bioinformatics 32.15 (2016): 2378-2379.

Instructional video