On splice site prediction using weight array models: a comparison of smoothing techniques

Leila Taher; Peter Meinicke; Burkhard Morgenstern

doi:10.1088/1742-6596/90/1/012004

Journal of Physics: Conference Series

The following article is Open access

On splice site prediction using weight array models: a comparison of smoothing techniques

Leila Taher¹, Peter Meinicke² and Burkhard Morgenstern²

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 90, 16TH ARGENTINE BIOENGINEERING CONGRESS (SABI 2007) AND THE 5TH CONFERENCE OF CLINICAL ENGINEERING 26–28 September 2007, Alkazar Hotel, San Juan, Argentina Citation Leila Taher et al 2007 J. Phys.: Conf. Ser. 90 012004 DOI 10.1088/1742-6596/90/1/012004

Download Article PDF

Article metrics

804 Total downloads

Author affiliations

¹ Fundación Miguel Lillo – CONICET, Miguel Lillo 205, 4000 Tucumán, Argentina

² University of Göttingen, Institute of Microbiology and Genetics, Department of Bioinformatics, Goldschmidtstr. 1, 37077 Göttingen, Germany

³ To whom any correspondence should be addressed

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

In most eukaryotic genes, protein-coding exons are separated by non-coding introns which are removed from the primary transcript by a process called "splicing". The positions where introns are cut and exons are spliced together are called "splice sites". Thus, computational prediction of splice sites is crucial for gene finding in eukaryotes. Weight array models are a powerful probabilistic approach to splice site detection. Parameters for these models are usually derived from m-tuple frequencies in trusted training data and subsequently smoothed to avoid zero probabilities. In this study we compare three different ways of parameter estimation for m-tuple frequencies, namely (a) non-smoothed probability estimation, (b) standard pseudo counts and (c) a Gaussian smoothing procedure that we recently developed.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Please wait… references are loading.

On splice site prediction using weight array models: a comparison of smoothing techniques

Article metrics

Share this article

Author affiliations

Abstract