Computational Methods for Large-scale Single-cell RNA-seq and Multimodal Data

Download Computational Methods for Large-scale Single-cell RNA-seq and Multimodal Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : pages
Book Rating : 4.:/5 (128 download)

DOWNLOAD NOW!


Book Synopsis Computational Methods for Large-scale Single-cell RNA-seq and Multimodal Data by : Văn Hoàn Đỗ

Download or read book Computational Methods for Large-scale Single-cell RNA-seq and Multimodal Data written by Văn Hoàn Đỗ and published by . This book was released on 2021 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt:

Benchmarking Statistical and Machine-Learning Methods for Single-cell RNA Sequencing Data

Download Benchmarking Statistical and Machine-Learning Methods for Single-cell RNA Sequencing Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 203 pages
Book Rating : 4.:/5 (129 download)

DOWNLOAD NOW!


Book Synopsis Benchmarking Statistical and Machine-Learning Methods for Single-cell RNA Sequencing Data by : Nan Xi

Download or read book Benchmarking Statistical and Machine-Learning Methods for Single-cell RNA Sequencing Data written by Nan Xi and published by . This book was released on 2021 with total page 203 pages. Available in PDF, EPUB and Kindle. Book excerpt: The large-scale, high-dimensional, and sparse single-cell RNA sequencing (scRNA-seq) data have raised great challenges in the pipeline of data analysis. A large number of statistical and machine learning methods have been developed to analyze scRNA-seq data and answer related scientific questions. Although different methods claim advantages in certain circumstances, it is difficult for users to select appropriate methods for their analysis tasks. Benchmark studies aim to provide recommendations for method selection based on an objective, accurate, and comprehensive comparison among cutting-edge methods. They can also offer suggestions for further methodological development through massive evaluations conducted on real data. In Chapter 2, we conduct the first, systematic benchmark study of nine cutting-edge computational doublet-detection methods. In scRNA-seq, doublets form when two cells are encapsulated into one reaction volume by chance. The existence of doublets, which appear as but are not real cells, is a key confounder in scRNA-seq data analysis. Computational methods have been developed to detect doublets in scRNA-seq data; however, the scRNA-seq field lacks a comprehensive benchmarking of these methods, making it difficult for researchers to choose an appropriate method for their specific analysis needs. Our benchmark study compares doublet-detection methods in terms of their detection accuracy under various experimental settings, impacts on downstream analyses, and computational efficiency. Our results show that existing methods exhibited diverse performance and distinct advantages in different aspects. In Chapter 3, we develop an R package DoubletCollection to integrate the installation and execution of different doublet-detection methods. Traditional benchmark studies can be quickly out-of-date due to their static design and the rapid growth of available methods. DoubletCollection addresses this issue in benchmarking doublet-detection methods for scRNA-seq data. DoubletCollection provides a unified interface to perform and visualize downstream analysis after doublet-detection. Additionally, we created a protocol using DoubletCollection to execute and benchmark doublet-detection methods. This protocol can automatically accommodate new doublet-detection methods in the fast-growing scRNA-seq field. In Chapter 4, we conduct the first comprehensive empirical study to explore the best modeling strategy for autoencoder-based imputation methods specific to scRNA-seq data. The autoencoder-based imputation method is a family of promising methods to denoise sparse scRNA-seq data; however, the design of autoencoders has not been formally discussed in the literature. Current autoencoder-based imputation methods either borrow the practice from other fields or design the model on an ad hoc basis. We find that the method performance is sensitive to the key hyperparameter of autoencoders, including architecture, activation function, and regularization. Their optimal settings on scRNA-seq are largely different from those on other data types. Our results emphasize the importance of exploring hyperparameter space in such complex and flexible methods. Our work also points out the future direction of improving current methods.

Computational Methods for Single-Cell Data Analysis

Download Computational Methods for Single-Cell Data Analysis PDF Online Free

Author :
Publisher : Humana Press
ISBN 13 : 9781493990566
Total Pages : 271 pages
Book Rating : 4.9/5 (95 download)

DOWNLOAD NOW!


Book Synopsis Computational Methods for Single-Cell Data Analysis by : Guo-Cheng Yuan

Download or read book Computational Methods for Single-Cell Data Analysis written by Guo-Cheng Yuan and published by Humana Press. This book was released on 2019-02-14 with total page 271 pages. Available in PDF, EPUB and Kindle. Book excerpt: This detailed book provides state-of-art computational approaches to further explore the exciting opportunities presented by single-cell technologies. Chapters each detail a computational toolbox aimed to overcome a specific challenge in single-cell analysis, such as data normalization, rare cell-type identification, and spatial transcriptomics analysis, all with a focus on hands-on implementation of computational methods for analyzing experimental data. Written in the highly successful Methods in Molecular Biology series format, chapters include introductions to their respective topics, lists of the necessary materials and reagents, step-by-step, readily reproducible laboratory protocols, and tips on troubleshooting and avoiding known pitfalls. Authoritative and cutting-edge, Computational Methods for Single-Cell Data Analysis aims to cover a wide range of tasks and serves as a vital handbook for single-cell data analysis.

Computational Methods for the Analysis of Single-Cell RNA-Seq Data

Download Computational Methods for the Analysis of Single-Cell RNA-Seq Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : pages
Book Rating : 4.:/5 (119 download)

DOWNLOAD NOW!


Book Synopsis Computational Methods for the Analysis of Single-Cell RNA-Seq Data by : Marmar Moussa

Download or read book Computational Methods for the Analysis of Single-Cell RNA-Seq Data written by Marmar Moussa and published by . This book was released on 2019 with total page pages. Available in PDF, EPUB and Kindle. Book excerpt: Single cell transcriptional profiling is critical for understanding cellular heterogeneity and identification of novel cell types and for studying growth and development of tissues and tumors. Leveraging recent advances in single cell RNA sequencing (scRNA-Seq) technology requires novel methods that are robust to high levels of technical and biological noise and scale to datasets of millions of cells. In this work, we address several challenges in the analysis work-flow of scRNA-Seq data: First, we propose novel computational approaches for unsupervised clustering of scRNA-Seq data based on Term Frequency - Inverse Document Frequency (TF-IDF) transformation that has been successfully used in text analysis. Here, we present empirical experimental results showing that TF-IDF methods consistently outperform commonly used scRNA-Seq clustering approaches. Second, we study the so called 'drop-out' effect that is considered one of the most notable challenges in scRNA-Seq analysis, where only a fraction of the transcriptome of each cell is captured. The random nature of drop-outs, however, makes it possible to consider imputation methods as means of correcting for drop-outs. In this part we study existing scRNA-Seq imputation methods and propose a novel iterative imputation approach based on efficiently computing highly similar cells. We then present results of a comprehensive assessment of existing and proposed methods on real scRNA-Seq datasets with varying per cell sequencing depth. Third, we present a computational method for assigning and/or ordering cells based on their cell-cycle stages from scRNA-Seq. And finally, we present a web-based interactive computational work-flow for analysis and visualization of scRNA-seq data.

Computational Methods for Studying Cellular Differentiation Using Single-cell RNA-sequencing

Download Computational Methods for Studying Cellular Differentiation Using Single-cell RNA-sequencing PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 176 pages
Book Rating : 4.:/5 (122 download)

DOWNLOAD NOW!


Book Synopsis Computational Methods for Studying Cellular Differentiation Using Single-cell RNA-sequencing by : Hui Ting Grace Yeo

Download or read book Computational Methods for Studying Cellular Differentiation Using Single-cell RNA-sequencing written by Hui Ting Grace Yeo and published by . This book was released on 2020 with total page 176 pages. Available in PDF, EPUB and Kindle. Book excerpt: Single-cell RNA-sequencing (scRNA-seq) enables transcriptome-wide measurements of single cells at scale. As scRNA-seq datasets grow in complexity and size, more complex computational methods are required to distill raw data into biological insight. In this thesis, we introduce computational methods that enable analysis of novel scRNA-seq perturbational assays. We also develop computational models that seek to move beyond simple observations of cell states toward more complex models of underlying biological processes. In particular, we focus on cellular differentiation, which is the process by which cells acquire some specific form or function. First, we introduce barcodelet scRNA-seq (barRNA-seq), an assay which tags individual cells with RNA ‘barcodelets’ to identify them based on the treatments they receive. We apply barRNA-seq to study the effects of the combinatorial modulation of signaling pathways during early mESC differentiation toward germ layer and mesodermal fates. Using a data-driven analysis framework, we identify combinatorial signaling perturbations that drive cells toward specific fates. Second, we describe poly-adenine CRISPR gRNA-based scRNA-seq (pAC-seq), a method that enables the direct observation of guide RNAs (gRNAs) in scRNA-seq. We apply it to assess the phenotypic consequences of CRISPR/Cas9-based alterations of gene cis-regulatory regions. We find that power to detect transcriptomic effects depend on factors such as rate of mono/biallelic loss, baseline gene expression, and the number of cells per target gRNA. Third, we propose a generative model for analyzing scRNA-seq containing unwanted sources of variation. Using only weak supervision from a control population, we show that the model enables removal of nuisance effects from the learned representation without prior knowledge of the confounding factors. Finally, we develop a generative modeling framework that learns an underlying differentiation landscape from population-level time-series data. We validate the modeling framework on an experimental lineage tracing dataset, and show that it is able to recover the expected effects of known modulators of cell fate in hematopoiesis.

efficient statistical and computational methods for large scale sequencing data

Download efficient statistical and computational methods for large scale sequencing data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 0 pages
Book Rating : 4.:/5 (143 download)

DOWNLOAD NOW!


Book Synopsis efficient statistical and computational methods for large scale sequencing data by :

Download or read book efficient statistical and computational methods for large scale sequencing data written by and published by . This book was released on with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Machine Learning in Single-Cell RNA-seq Data Analysis

Download Machine Learning in Single-Cell RNA-seq Data Analysis PDF Online Free

Author :
Publisher : Springer Nature
ISBN 13 : 9819767032
Total Pages : 104 pages
Book Rating : 4.8/5 (197 download)

DOWNLOAD NOW!


Book Synopsis Machine Learning in Single-Cell RNA-seq Data Analysis by : Khalid Raza

Download or read book Machine Learning in Single-Cell RNA-seq Data Analysis written by Khalid Raza and published by Springer Nature. This book was released on with total page 104 pages. Available in PDF, EPUB and Kindle. Book excerpt:

Relative Distribution Methods in the Social Sciences

Download Relative Distribution Methods in the Social Sciences PDF Online Free

Author :
Publisher : Springer Science & Business Media
ISBN 13 : 0387226583
Total Pages : 272 pages
Book Rating : 4.3/5 (872 download)

DOWNLOAD NOW!


Book Synopsis Relative Distribution Methods in the Social Sciences by : Mark S. Handcock

Download or read book Relative Distribution Methods in the Social Sciences written by Mark S. Handcock and published by Springer Science & Business Media. This book was released on 2006-05-10 with total page 272 pages. Available in PDF, EPUB and Kindle. Book excerpt: This monograph presents methods for full comparative distributional analysis based on the relative distribution. This provides a general integrated framework for analysis, a graphical component that simplifies exploratory data analysis and display, a statistically valid basis for the development of hypothesis-driven summary measures, and the potential for decomposition - enabling the examination of complex hypotheses regarding the origins of distributional changes within and between groups. Written for data analysts and those interested in measurement, the text can also serve as a textbook for a course on distributional methods.

Statistical Simulation and Analysis of Single-cell RNA-seq Data

Download Statistical Simulation and Analysis of Single-cell RNA-seq Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 0 pages
Book Rating : 4.:/5 (141 download)

DOWNLOAD NOW!


Book Synopsis Statistical Simulation and Analysis of Single-cell RNA-seq Data by : Tianyi Sun

Download or read book Statistical Simulation and Analysis of Single-cell RNA-seq Data written by Tianyi Sun and published by . This book was released on 2023 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: The recent development of single-cell RNA sequencing (scRNA-seq) technologies has revolutionized transcriptomic studies by revealing the genome-wide gene expression levels within individual cells. In contrast to bulk RNA sequencing, scRNA-seq technology captures cell-specific transcriptome landscapes, which can reveal crucial information about cell-to-cell heterogeneity across different tissues, organs, and systems and enable the discovery of novel cell types and new transient cell states. According to search results from PubMed, from 2009-2023, over 5,000 published studies have generated datasets using this technology. Such large volumes of data call for high-quality statistical methods for their analysis. In the three projects of this dissertation, I have explored and developed statistical methods to model the marginal and joint gene expression distributions and determine the latent structure type for scRNA-seq data. In all three projects, synthetic data simulation plays a crucial role. My first project focuses on the exploration of the Beta-Poisson hierarchical model for the marginal gene expression distribution of scRNA-seq data. This model is a simplified mechanistic model with biological interpretations. Through data simulation, I demonstrate three typical behaviors of this model under different parameter combinations, one of which can be interpreted as one source of the sparsity and zero inflation that is often observed in scRNA-seq datasets. Further, I discuss parameter estimation methods of this model and its other applications in the analysis of scRNA-seq data. My second project focuses on the development of a statistical simulator, scDesign2, to generate realistic synthetic scRNA-seq data. Although dozens of simulators have been developed before, they lack the capacity to simultaneously achieve the following three goals: preserving genes, capturing gene correlations, and generating any number of cells with varying sequencing depths. To fill in this gap, scDesign2 is developed as a transparent simulator that achieves all three goals and generates high-fidelity synthetic data for multiple scRNA-seq protocols and other single-cell gene expression count-based technologies. Compared with existing simulators, scDesign2 is advantageous in its transparent use of probabilistic models and is unique in its ability to capture gene correlations via copula. We verify that scDesign2 generates more realistic synthetic data for four scRNA-seq protocols (10x Genomics, CEL-Seq2, Fluidigm C1, and Smart-Seq2) and two single-cell spatial transcriptomics protocols (MERFISH and pciSeq) than existing simulators do. Under two typical computational tasks, cell clustering and rare cell type detection, we demonstrate that scDesign2 provides informative guidance on deciding the optimal sequencing depth and cell number in single-cell RNA-seq experimental design, and that scDesign2 can effectively benchmark computational methods under varying sequencing depths and cell numbers. With these advantages, scDesign2 is a powerful tool for single-cell researchers to design experiments, develop computational methods, and choose appropriate methods for specific data analysis needs. My third project focuses on deciding latent structure types for scRNA-seq datasets. Clustering and trajectory inference are two important data analysis tasks that can be performed for scRNA-seq datasets and will lead to different interpretations. However, as of now, there is no principled way to tell which one of these two types of analysis results is more suitable to describe a given dataset. In this project, we propose two computational approaches that aim to distinguish cluster-type vs. trajectory-type scRNA-seq datasets. The first approach is based on building a classifier using eigenvalue features of the gene expression covariance matrix, drawing inspiration from random matrix theory (RMT). The second approach is based on comparing the similarity of real data and simulated data generated by assuming the cell latent structure as clusters or a trajectory. While both approaches have limitations, we show that the second approach gives more promising results and has room for further improvements.

Statistical and Computational Methods for Single-cell Transcriptome Sequencing and Metagenomics

Download Statistical and Computational Methods for Single-cell Transcriptome Sequencing and Metagenomics PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 246 pages
Book Rating : 4.:/5 (16 download)

DOWNLOAD NOW!


Book Synopsis Statistical and Computational Methods for Single-cell Transcriptome Sequencing and Metagenomics by : Fanny Perraudeau

Download or read book Statistical and Computational Methods for Single-cell Transcriptome Sequencing and Metagenomics written by Fanny Perraudeau and published by . This book was released on 2018 with total page 246 pages. Available in PDF, EPUB and Kindle. Book excerpt: I propose statistical methods and software for the analysis of single-cell transcriptome sequencing (scRNA-seq) and metagenomics data. Specifically, I present a general and flexible zero-inflated negative binomial-based wanted variation extraction (ZINB-WaVE) method, which extracts low-dimensional signal from scRNA-seq read counts, accounting for zero inflation (dropouts), over-dispersion, and the discrete nature of the data. Additionally, I introduce an application of the ZINB-WaVE method that identifies excess zero counts and generates gene and cell-specific weights to unlock bulk RNA-seq differential expression pipelines for zero-inflated data, boosting performance for scRNA-seq analysis. Finally, I present a method to estimate bacterial abundances in human metagenomes using full-length 16S sequencing reads.

Technology and Method Developments for High-throughput Translational Medicine

Download Technology and Method Developments for High-throughput Translational Medicine PDF Online Free

Author :
Publisher : Stanford University
ISBN 13 :
Total Pages : 122 pages
Book Rating : 4.F/5 ( download)

DOWNLOAD NOW!


Book Synopsis Technology and Method Developments for High-throughput Translational Medicine by : Junhee Seok

Download or read book Technology and Method Developments for High-throughput Translational Medicine written by Junhee Seok and published by Stanford University. This book was released on 2011 with total page 122 pages. Available in PDF, EPUB and Kindle. Book excerpt: Translation of knowledge from basic science to medicine is essential to improving both clinical research and practice. In this translation, high-throughput genomic approaches can greatly accelerate our understanding of molecular mechanisms of diseases. A successful high-throughput genomic study of disease requires, first, comprehensive and efficient platforms to collect genomic data from clinical samples, and second, computational analysis methods that utilize databases of prior biological knowledge together with experimental data to derive clinically meaningful results. In this thesis, we discuss the development of a new microarray platform as well as computational methods for knowledge-based analysis along with their applications in clinical research. First, we and other colleagues have developed a new high-density oligonucleo-tide array of the human transcriptome for high-throughput and cost-efficient analysis of patient samples in clinical studies. This array allows comprehensive examination of gene expression and genome-wide identification of alternative splicing, and also pro-vides assays for coding SNP detection and non-coding transcripts. Compared with high-throughput mRNA sequencing technology, we show that this array is highly re-producible in estimating gene and exon expression, and sensitive in detecting expres-sion changes. In addition, the exon-exon junction feature of this array is shown to im-prove detection efficiency for mRNA alternative splicing when combined with an ap-propriate computational method. We implemented the use of this array in a multi-center clinical program and have obtained comparable levels of high quality and re-producible data. With low costs and high throughputs for sample processing, we antic-ipate that this array platform will have a wide range of applications in high-throughput clinical studies. Second, we investigated knowledge-based methods that utilize prior know-ledge from biology and medicine to improve analysis and interpretation of high-throughput genomic data. We have developed knowledge-based methods to enrich our prior knowledge, illustrate dynamic response to external stimulus, and identify distur-bances in cellular pathways by chemical exposure, as well as discover hidden biological signatures for the prediction of patient outcomes. Finally, we applied a knowledge-based approach in a large scale genomic study of trauma patients. Cooperating with clinical information, prior knowledge improved the interpretation of common and dif-ferential genomic response to injury, and provided efficient risk assessment for patient outcomes. The clinical and genomic data as well as analysis results in this trauma study were systematically organized and provided to research communities as new knowledge of traumatic injury. The microarray platform and knowledge-based methods presented in this thesis provide appropriate research tools for high-throughput translational medicine in a large clinical setting. This thesis is expected to advance understanding and treatment for dis-eases, and finally, improve public health.

Machine Learning Methods for Multi-Omics Data Integration

Download Machine Learning Methods for Multi-Omics Data Integration PDF Online Free

Author :
Publisher : Springer Nature
ISBN 13 : 303136502X
Total Pages : 171 pages
Book Rating : 4.0/5 (313 download)

DOWNLOAD NOW!


Book Synopsis Machine Learning Methods for Multi-Omics Data Integration by : Abedalrhman Alkhateeb

Download or read book Machine Learning Methods for Multi-Omics Data Integration written by Abedalrhman Alkhateeb and published by Springer Nature. This book was released on 2023-12-15 with total page 171 pages. Available in PDF, EPUB and Kindle. Book excerpt: The advancement of biomedical engineering has enabled the generation of multi-omics data by developing high-throughput technologies, such as next-generation sequencing, mass spectrometry, and microarrays. Large-scale data sets for multiple omics platforms, including genomics, transcriptomics, proteomics, and metabolomics, have become more accessible and cost-effective over time. Integrating multi-omics data has become increasingly important in many research fields, such as bioinformatics, genomics, and systems biology. This integration allows researchers to understand complex interactions between biological molecules and pathways. It enables us to comprehensively understand complex biological systems, leading to new insights into disease mechanisms, drug discovery, and personalized medicine. Still, integrating various heterogeneous data types into a single learning model also comes with challenges. In this regard, learning algorithms have been vital in analyzing and integrating these large-scale heterogeneous data sets into one learning model. This book overviews the latest multi-omics technologies, machine learning techniques for data integration, and multi-omics databases for validation. It covers different types of learning for supervised and unsupervised learning techniques, including standard classifiers, deep learning, tensor factorization, ensemble learning, and clustering, among others. The book categorizes different levels of integrations, ranging from early, middle, or late-stage among multi-view models. The underlying models target different objectives, such as knowledge discovery, pattern recognition, disease-related biomarkers, and validation tools for multi-omics data. Finally, the book emphasizes practical applications and case studies, making it an essential resource for researchers and practitioners looking to apply machine learning to their multi-omics data sets. The book covers data preprocessing, feature selection, and model evaluation, providing readers with a practical guide to implementing machine learning techniques on various multi-omics data sets.

Revealing Translational and Fundamental Insights Via Computational Analysis of Single-cell Sequencing Data

Download Revealing Translational and Fundamental Insights Via Computational Analysis of Single-cell Sequencing Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 0 pages
Book Rating : 4.:/5 (14 download)

DOWNLOAD NOW!


Book Synopsis Revealing Translational and Fundamental Insights Via Computational Analysis of Single-cell Sequencing Data by : Jessica Lu Zhou

Download or read book Revealing Translational and Fundamental Insights Via Computational Analysis of Single-cell Sequencing Data written by Jessica Lu Zhou and published by . This book was released on 2023 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Single-cell sequencing has emerged as a powerful tool for dissecting cellular heterogeneity and providing cell type-specific biological insights. Single-cell sequencing technologies have rapidly proliferated over the last decade, leading to an explosion of data generated from such experiments. However, several challenges exist in the computational analysis of single-cell sequencing data due to its large and complex nature, including the need for sophisticated statistical methods to distinguish biologically meaningful signals from noise, the integration of single-cell sequencing data with other types of biological information, and the development of scalable and reproducible computational pipelines that can handle the large and complex nature of the data. In this dissertation, I present two distinct projects analyzing single-cell sequencing data. The first is of an analytical nature and tackles a translational question. In this project, I built computational pipelines for processing and analyzing single-nucleus RNA- and ATAC-sequencing datasets generated from the amygdalae of genetically diverse heterogenous stock rats, which were subjected to a behavioral protocol for studying addiction-like behaviors following cocaine self-administration. In doing so, I provide a standard reference for analyzing such data as well as reveal cell type-specific insights into the molecular underpinnings of cocaine addiction. The second project is oriented towards methods development and seeks to understand the fundamental biological question of transcriptional regulation. Here, I developed a statistical framework for simulating and modeling data from single-cell CRISPR regulatory screens and used it to perform a genome-wide interrogation of epistatic-like interactions between enhancer pairs. I found that multiple enhancers act together in a multiplicative fashion with little evidence for interactive effects between them. This work revealed novel insights into the collective behavior of multiple regulatory elements and provides a tool that can be applied to future datasets generated from such experiments. This dissertation exemplifies how computational methods can be applied in different contexts to extract meaning from a variety of single-cell sequencing modalities. By tackling both a translational and fundamental biological question, I have showcased the breadth of what can be revealed by studying single-cell sequencing data and the computational methods necessary to extract this information.

Next Steps for Functional Genomics

Download Next Steps for Functional Genomics PDF Online Free

Author :
Publisher : National Academies Press
ISBN 13 : 0309676738
Total Pages : 201 pages
Book Rating : 4.3/5 (96 download)

DOWNLOAD NOW!


Book Synopsis Next Steps for Functional Genomics by : National Academies of Sciences, Engineering, and Medicine

Download or read book Next Steps for Functional Genomics written by National Academies of Sciences, Engineering, and Medicine and published by National Academies Press. This book was released on 2020-12-18 with total page 201 pages. Available in PDF, EPUB and Kindle. Book excerpt: One of the holy grails in biology is the ability to predict functional characteristics from an organism's genetic sequence. Despite decades of research since the first sequencing of an organism in 1995, scientists still do not understand exactly how the information in genes is converted into an organism's phenotype, its physical characteristics. Functional genomics attempts to make use of the vast wealth of data from "-omics" screens and projects to describe gene and protein functions and interactions. A February 2020 workshop was held to determine research needs to advance the field of functional genomics over the next 10-20 years. Speakers and participants discussed goals, strategies, and technical needs to allow functional genomics to contribute to the advancement of basic knowledge and its applications that would benefit society. This publication summarizes the presentations and discussions from the workshop.

Computational Methods to Elucidate Post-transcriptional Gene Regulation Using High-throughput Sequencing Data

Download Computational Methods to Elucidate Post-transcriptional Gene Regulation Using High-throughput Sequencing Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 151 pages
Book Rating : 4.:/5 (111 download)

DOWNLOAD NOW!


Book Synopsis Computational Methods to Elucidate Post-transcriptional Gene Regulation Using High-throughput Sequencing Data by : Zijun Zhang

Download or read book Computational Methods to Elucidate Post-transcriptional Gene Regulation Using High-throughput Sequencing Data written by Zijun Zhang and published by . This book was released on 2019 with total page 151 pages. Available in PDF, EPUB and Kindle. Book excerpt: Post-transcriptional regulation plays a central role in the flow of information from genotypes to phenotypes in the cellular machinery. Disruptions of post-transcriptional regulatory mechanisms underlie many human diseases. As high-throughput sequencing technology becomes the standard protocol in studying post-transcriptional regulation, large-scale data in public domain provides an unprecedented resource to understand the complex regulatory networks of gene regulation, while also presents challenges for the development of computational methods to analyze and interpret empirical data into biological knowledge. In this dissertation, novel statistical models and computational frameworks were developed to elucidate post-transcriptional gene regulation using high-throughput sequencing data. Utilizing these new tools, we demonstrated that we can robustly characterize the molecular signals and variations across diverse biological states, and more importantly, identify bona fide regulatory events that are inaccessible by conventional analyses. The first part of the dissertation describes CLIP-seq Analysis of Multi-mapped reads (CLAM), a comprehensive computational pipeline for analyzing Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP/RIP-seq) data. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads in datasets with different regulatory features, providing a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data. The second part of the dissertation presents Deep-learning Augmented RNA-seq analysis of Transcript Splicing (DARTS), a novel computational framework that integrates deep learning-based predictions with empirical RNA-seq datasets to infer differential alternative splicing between biological conditions. A major limitation of RNA sequencing (RNA-seq) analysis of alternative splicing is its reliance on high sequencing coverage. DARTS employs a deep neural network (DNN) that predicts differential alternative splicing using cis RNA sequence features and trans RNA binding protein levels. DARTS DNN trained on public RNA-seq displays a high prediction accuracy and generalizability. Incorporating DARTS DNN prediction as an informative prior significantly improves the inference of differential alternative splicing. DARTS leverages public RNA-seq big data to provide a knowledge base of splicing regulation via deep learning, thereby helping researchers better characterize alternative splicing using RNA-seq datasets even with modest coverage.

Stitching and Sketching Large-scale Single-cell Transcriptomic Data

Download Stitching and Sketching Large-scale Single-cell Transcriptomic Data PDF Online Free

Author :
Publisher :
ISBN 13 :
Total Pages : 111 pages
Book Rating : 4.:/5 (11 download)

DOWNLOAD NOW!


Book Synopsis Stitching and Sketching Large-scale Single-cell Transcriptomic Data by : Brian Hie

Download or read book Stitching and Sketching Large-scale Single-cell Transcriptomic Data written by Brian Hie and published by . This book was released on 2019 with total page 111 pages. Available in PDF, EPUB and Kindle. Book excerpt: Researchers are generating single-cell RNA sequencing (scRNA-seq) profiles of diverse biological systems [1]-[7] and every cell type in the human body [8] at an unprecedented scale, with scRNA-seq experiments regularly profiling gene expression in hundreds of thousands or even millions of cells [9]. Leveraging this data to gain unprecedented insight into biology and disease requires algorithms that can scale to the tremendous amount of data being generated and can integrate information across multiple experiments, laboratories, and technologies. Here, we present two algorithms that aim to aid researchers in gaining better insight from scRNA-seq data sets. The first, Scanorama, inspired by algorithms for panorama stitching, achieves accurate integration of heterogeneous scRNA-seq data sets, which we use to integrate a number of large and complex collections of data sets. The second algorithm, geometric sketching, is a sampling approach that aims to evenly cover the low-dimensional manifold spanned by the cells to capture more of the rare transcriptional structure than would uniform subsampling with equal probability for each cell, obtaining sketches that better capture the transcriptional heterogeneity of the original data. Moreover, geometric sketching can be used to improve the computational efficiency of algorithms for single-cell integration, including Scanorama. We anticipate that both algorithms will play an important role in the analysis and interpretation of large-scale single-cell transcriptomic data sets.

Computational Methods for Precision Oncology

Download Computational Methods for Precision Oncology PDF Online Free

Author :
Publisher : Springer Nature
ISBN 13 : 303091836X
Total Pages : 341 pages
Book Rating : 4.0/5 (39 download)

DOWNLOAD NOW!


Book Synopsis Computational Methods for Precision Oncology by : Alessandro Laganà

Download or read book Computational Methods for Precision Oncology written by Alessandro Laganà and published by Springer Nature. This book was released on 2022-03-01 with total page 341 pages. Available in PDF, EPUB and Kindle. Book excerpt: Precision medicine holds great promise for the treatment of cancer and represents a unique opportunity for accelerated development and application of novel and repurposed therapeutic approaches. Current studies and clinical trials demonstrate the benefits of genomic profiling for patients whose cancer is driven by specific, targetable alterations. However, precision oncologists continue to be challenged by the widespread heterogeneity of cancer genomes and drug responses in designing personalized treatments. Chapters provide a comprehensive overview of the computational approaches, methods, and tools that enable precision oncology, as well as related biological concepts. Covered topics include genome sequencing, the architecture of a precision oncology workflow, and introduces cutting-edge research topics in the field of precision oncology. This book is intended for computational biologists, bioinformaticians, biostatisticians and computational pathologists working in precision oncology and related fields, including cancer genomics, systems biology, and immuno-oncology.