Home > Resources > Autism Cohorts > Simons Simplex Collection

Simons Simplex Collection

SSC family participants

The Simons Simplex Collection (SSC) is a core project and resource of the Simons Foundation Autism Research Initiative (SFARI). The SSC achieved its primary goal to establish a permanent repository of genetic samples from 2,600 simplex families, each of which has one child affected with an autism spectrum disorder, and unaffected parents and siblings.

Each genetic sample has an associated collection of data that provides a precise characterization, or phenotype, of the individual. Rigorous phenotyping maximizes the value of the resource for a wide variety of future research projects on the causes and mechanisms of autism. Roughly 1,500 families are available for recontacting for additional research projects via a registry called the Simons Simplex Community at the Interactive Autism Network ([email protected]).

The SSC was operated by SFARI in collaboration with 12 university-affiliated research clinics. The clinics identified and assessed potential SSC participants, with guidance from the University of Michigan Autism and Communication Disorders Center, to ensure uniformity across clinics. Active enrollment of participants ended in 2011.

Previous pioneering efforts to collect genetic samples focused on families that include multiple individuals with autism, most notably the Autism Genetic Resource Exchange, an ongoing effort to identify these so-called ‘multiplex’ families. The SSC differs from those efforts in its focus on simplex families, and in its clinic-based assessment and diagnosis.


SSC data:

All SSC data are available by request after logging into SFARI Base.

SSC genetic data and gene-expression data analyzed in the following publications are also available via the NIH’s National Database of Autism Research (NDAR) or NCBI's Gene Expression Omnibus (GEO):

Array-based comparative genomic hybridization (CGH) data

  1. Levy D.et al. Neuron 70, 886-897 (2011) PubMed, GEO data

Single nucleotide polymorphism (SNP) genotype data

  1. Sanders S.J. et al. Neuron 70, 863-885 (2011) PubMed
  2. Sanders S.J. et al. Neuron 87,1215-1233 (2015) PubMed

Whole-exome sequencing data

  1. O'Roak B.J. et al. Nat. Genet. 43, 585-589 (2011) PubMed
  2. Sanders S.J. et al. Nature 485, 237-241 (2012) PubMed, NDAR data
  3. O'Roak B.J. et al. Nature 485, 246-250 (2012) PubMed, NDAR data
  4. Iossifov I. et al. Neuron 74, 285-299 (2012) PubMed, NDAR data
  5. Iossifov I. et al. Nature 515, 216-221 (2014) PubMed, NDAR data
  6. Krumm N. et al. Nat. Genetics 47, 582-588 (2015) PubMed, NDAR data

Whole-genome sequencing data

  1. Turner T. et al. Am. J. Hum. Genet. 98, 58-74 (2016) PubMed, Data available through SFARI Base (please reference accessions SFARI_SSC_WGS_P [40 quad families] and SFARI_SSC_WGS_trioP [13 trio families]), variant calls are also available in NDAR.

Gene-expression data in lymphoblastoid cell lines

  1. Luo R. et al. Am. J. Hum. Genet. 91, 38-55 (2012) PubMed, GEO data

DNA methylation profiling array-based data

  1. Alisch R.S. et al. Genome Res. 22, 623-632 (2012) PubMed, Data only available through GEO

Additional links: