Software and data resources
Research overview
The development of software tools and data platforms is central to the work we do.
OpenGWAS
In collaboration with Gibran Hemani (IEU Mendelian randomization programme), Philip Haycock (CRUK Integrative Cancer Epidemiology Programme), Ben Elsworth, Matt Lyon, Tom Palmer and others, we developed the OpenGWAS platform. This comprises one of the largest open access databases of genome-wide association studies (GWAS) in the world (ElasticSearch running on Oracle cloud), and is integrated with a suite of R and Python tools to enable post-GWAS analysis (including for Mendelian randomization).
EpiGraphDB
A core output of the group, the EpiGraphDB platform is a knowledge graph comprising a graph database (Neo4J), a web interface and API to query the graph, and an R package. EpiGraphDB includes data from several of our drug-target prioritization projects, is the datasource for the ASQ application, and has been used to systematically identify disease risk factors.
Other tools
We have a number of other software tools and data platforms (some listed below), and the MRCIEU software page provides a more extensive list of MRC IEU software tools.
We aim to make all software open source and data resources open access to maximize the impact of our research.