The Sequences of 150,119 Genomes in the UK Biobank

Bjarni V. Halldorsson*, Hannes P. Eggertsson, Kristjan H. S. Moore, Hannes Hauswedell, Ogmundur Eiriksson, Magnus O. Ulfarsson, Gunnar Palsson, Marteinn T. Hardarson, Asmundur Oddsson, Brynjar O. Jensson, Snaedis Kristmundsdottir, Brynja D. Sigurpalsdottir, Olafur A. Stefansson, Doruk Beyter, Guillaume Holley, Vinicius Tragante, Arnaldur Gylfason, Pall I. Olason, Florian Zink, Margret AsgeirsdottirSverrir T. Sverrisson, Brynjar Sigurdsson, Sigurjon A. Gudjonsson, Gunnar T. Sigurdsson, Gisli H. Halldorsson, Gardar Sveinbjornsson, Kristjan Norland, Unnur Styrkarsdottir, Droplaug N. Magnusdottir, Steinunn Snorradottir, Kari Kristinsson, Emilia Sobech, Helgi Jonsson, Arni J. Geirsson, Isleifur Olafsson, Palmi Jonsson, Ole Birger Pedersen, Christian Erikstrup, Søren Brunak, Sisse Rye Ostrowski, Steffen Andersen, Karina Banasik, Kristoffer Sølvsten Burgdorf, Maria Didriksen, Khoa Manh Dinh, Daniel F. Gudbjartsson, Thomas Folkmann Hansen, Henrik Hjalgrim, Gregor Borut Ernst Jemec, Poul Jennum, Pär Ingemar Johansson, Margit Anita Hørup Larsen, Susan Mikkelsen, Kasper René Nielsen, Mette Nyegaard, Susanne Sækmose, Erik Sørensen, Unnur Thorsteinsdottir, Mie Topholmt Brun, Henrik Ullum, Thomas Werge, Gudmar Thorleifsson, Frosti Jonsson, Pall Melsted, Ingileif Jonsdottir, Thorunn Rafnar, Hilma Holm, Hreinn Stefansson, Jona Saemundsdottir, Daniel F. Gudbjartsson, Olafur T. Magnusson, Gisli Masson, Agnar Helgason, Hakon Jonsson, Patrick Sulem, Kari Stefansson*, DBDS Genetic Consortium

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

28 Downloads (Pure)

Abstrakt

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.
OriginalsprogEngelsk
TidsskriftNature
Vol/bind607
Udgave nummer7920
Sider (fra-til)732–740
Antal sider9
ISSN0028-0836
DOI
StatusUdgivet - 28 jul. 2022

Bibliografisk note

Published online: 20 Jul 2022.

Emneord

  • DNA sequencing
  • Genetic markers
  • Genetic variation
  • Genetics research
  • Genome-wide association studies

Citationsformater