Data Entry: Please note that the research database will be replaced by UNIverse by the end of October 2023. Please enter your data into the system https://universe-intern.unibas.ch. Thanks

Login for users with Unibas email account...

Login for registered users without Unibas email account...

 
Automated incorporation of pairwise dependency in transcription factor binding site prediction using dinucleotide weight tensors
JournalArticle (Originalarbeit in einer wissenschaftlichen Zeitschrift)
 
ID 3931575
Author(s) Omidi, Saeed; Zavolan, Mihaela; Pachkov, Mikhail; Breda, Jeremie; Berger, Severin; van Nimwegen, Erik
Author(s) at UniBasel van Nimwegen, Erik
Omidi Klishami, Saeed
Pachkov, Mikhail
Breda, Jeremie
Berger, Severin
Zavolan, Mihaela
Year 2017
Title Automated incorporation of pairwise dependency in transcription factor binding site prediction using dinucleotide weight tensors
Journal PLoS Computational Biology
Volume 13
Number 7
Pages / Article-Number e1005176
Abstract Gene regulatory networks are ultimately encoded by the sequence-specific binding of (TFs) to short DNA segments. Although it is customary to represent the binding specificity of a TF by a position-specific weight matrix (PSWM), which assumes each position within a site contributes independently to the overall binding affinity, evidence has been accumulating that there can be significant dependencies between positions. Unfortunately, methodological challenges have so far hindered the development of a practical and generally-accepted extension of the PSWM model. On the one hand, simple models that only consider dependencies between nearest-neighbor positions are easy to use in practice, but fail to account for the distal dependencies that are observed in the data. On the other hand, models that allow for arbitrary dependencies are prone to overfitting, requiring regularization schemes that are difficult to use in practice for non-experts. Here we present a new regulatory motif model, called dinucleotide weight tensor (DWT), that incorporates arbitrary pairwise dependencies between positions in binding sites, rigorously from first principles, and free from tunable parameters. We demonstrate the power of the method on a large set of ChIP-seq data-sets, showing that DWTs outperform both PSWMs and motif models that only incorporate nearest-neighbor dependencies. We also demonstrate that DWTs outperform two previously proposed methods. Finally, we show that DWTs inferred from ChIP-seq data also outperform PSWMs on HT-SELEX data for the same TF, suggesting that DWTs capture inherent biophysical properties of the interactions between the DNA binding domains of TFs and their binding sites. We make a suite of DWT tools available at dwt.unibas.ch, that allow users to automatically perform 'motif finding', i.e. the inference of DWT motifs from a set of sequences, binding site prediction with DWTs, and visualization of DWT 'dilogo' motifs.
Publisher Library of Science
ISSN/ISBN 1553-734X ; 1553-7358
edoc-URL http://edoc.unibas.ch/56285/
Full Text on edoc No
Digital Object Identifier DOI 10.1371/journal.pcbi.1005176
PubMed ID http://www.ncbi.nlm.nih.gov/pubmed/28753602
ISI-Number WOS:000406619800001
Document type (ISI) Journal Article
 
   

MCSS v5.8 PRO. 0.447 sec, queries - 0.000 sec ©Universität Basel  |  Impressum   |    
26/04/2024