Introduction¶
Code in this repository is provided under a MIT license. This documentation is provided under a CC-BY-4.0 license.
Visit our lab website here. Contact Benjamin Hogan at ben.hogan@petermac.org.
Highlights¶
We provide a pipeline with the following features:
Takes fastq files, samplesheet as input
Returns candidate SNPs for further investigation
Interactive visualisation is a component
Detailed documentation on multiple case studies
Graphical abstract¶
For more information, visit our poster here:
Abstract¶
Forward genetic screening in model organisms remains a powerful tool for identifying unexpected new genes involved in any biological process of interest. While biological and computational protocols for identifying genetic variation are mature, many workflows are designed for human and mouse data. Therefore, we developed a species-agnostic pipeline which compares any combination of mutant and reference samples. We demonstrate the capabilities of our pipeline with a case study using the common laboratory model organism Danio rerio: the zebrafish. A large scale forward genetic screen was performed using the chemical mutagen N-ethyl-N-nitrosourea (ENU) to discover new autosomal recessive mutations in biological processes of interest. To identify candidate causative mutations for screened mutants, a low coverage (~10x) whole genome sequencing (WGS) was performed. To map homozygous mutants to the genome for subsequent identification of candidate mutations, WGS data was generated from mutant animals and control samples. We optimised a pipeline that (a) mapped mutations to a region of genomic linkage and (b) identified candidate SNPs predicted to be damaging to gene function, matching (c) essential criteria of homozygous mutations which are ENU-induced and absent in control reference samples. Other features of our pipeline include modern, interactive visualisations. Future plans include AlphaFold integration as an additional predictor for SNP impact as well as improved scalability with nextflow. Our pipeline is documented with a case study for reference and is hosted in an open-source software repository. The tool has minimal dependencies, with apptainer and conda instances available for usability.
Cite us with¶
Manuscript and zenodo citation to follow