Interpretable Machine Learning in Plant Genomes

Interpretable Machine Learning in Plant Genomes
Author: Christina Brady Azodi
Publisher:
Total Pages: 217
Release: 2019
Genre: Electronic dissertations
ISBN: 9781392717943


Download Interpretable Machine Learning in Plant Genomes Book in PDF, Epub and Kindle

Complex systems are ubiquitous in genetics and genomics. From the regulation of gene expression to the genetic basis of complex traits, we see that complex networks of diverse cellular molecules underpin the natural world. Driven by technological advances, today's researchers have access to large amounts of omics data from diverse species. At the same time, improvements in computer processing and algorithms have produced more powerful computational tools. Taken together, these advances mean that those working at the interface of data science and biology are poised to better model and understand complex biological systems. The research in this dissertation demonstrates how a data-driven approach can be used to better understand three complex systems: (1) transcriptional response to single and combined heat and drought stress in Arabidopsis thaliana, (2) the genetic basis of flowering time, a complex trait, in Zea mays, and (3) the social basis for opinions and beliefs about biotechnology products.To study the first system, we generated models of the cis-regulatory code from information about DNA sequence and additional omics levels using both classic machine learning and deep learning algorithms. We identified 1,061 putative cis-regulatory elements associated with different patterns of response to single and combined heat and drought stress and found that information about additional levels of regulation, especially chromatin accessibility and known transcription factor binding, improved our models of the cis-regulatory code. To study the second system, we generated phenotype prediction models for flowering time, height, and yield based on either genetic markers or transcript levels at the seedling stage. We found that, while genetic marker-based models performed better than transcript level-based models, models that integrated both types of data performed best. Furthermore, transcript-based models were more useful for finding genes known to be associated with flowering time, highlighting how using additional levels of omics data can improve our ability to understand the genetic basis of complex traits. Finally, to study the third system, we integrated 29 characteristics about a person (e.g. age, political ideology, education, values, environmental beliefs) into a machine learning model that would predict an individual's beliefs and opinions about five different types of biotechnology products (e.g. biofortification, biopharmaceuticals). While this approach was particularly usefully for identifying individuals that were broadly supportive of biotechnology, finding characteristics of individuals with negative or conditional (i.e. support product A, but not B) opinions was more challenging, highlighting the complexity of public opinions about biotechnology.


Interpretable Machine Learning in Plant Genomes
Language: en
Pages: 217
Authors: Christina Brady Azodi
Categories: Electronic dissertations
Type: BOOK - Published: 2019 - Publisher:

GET EBOOK

Complex systems are ubiquitous in genetics and genomics. From the regulation of gene expression to the genetic basis of complex traits, we see that complex netw
Interpretable Machine Learning
Language: en
Pages: 320
Authors: Christoph Molnar
Categories: Artificial intelligence
Type: BOOK - Published: 2020 - Publisher: Lulu.com

GET EBOOK

This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simp
Interpretable Machine Learning Methods for Regulatory and Disease Genomics
Language: en
Pages:
Authors: Peyton Greis Greenside
Categories:
Type: BOOK - Published: 2018 - Publisher:

GET EBOOK

It is an incredible feat of nature that the same genome contains the code to every cell in each living organism. From this same genome, each unique cell type ga
Interpretable Machine Learning for Scientific Discovery in Regulatory Genomics
Language: en
Pages:
Authors: Avanti Shrikumar
Categories:
Type: BOOK - Published: 2020 - Publisher:

GET EBOOK

All cells in our body have approximately the same DNA sequence, yet different cell-types have distinct behavior due to differential expression of genes. This ce
Elements of Causal Inference
Language: en
Pages: 289
Authors: Jonas Peters
Categories: Computers
Type: BOOK - Published: 2017-11-29 - Publisher: MIT Press

GET EBOOK

A concise and self-contained introduction to causal inference, increasingly important in data science and machine learning. The mathematization of causality is