Innovation

Polybayes Software

Washington University in St. Louis
posted on 12/11/2006

PolyBayes is a computer program for the automated analysis of single-nucleotide polymorphism (SNP) discovery in redundant DNA sequences. The primary motivation for its development is to provide a general and reliable tool for the discovery of genetic vari

Suggested Uses

Automated analysis of single-nucleotide polymorphism (SNP) discovery in redundant DNA sequences.


Innovation Details
 

Detailed Description

PolyBayes is a computer program for the automated analysis of single-nucleotide polymorphism (SNP) discovery in redundant DNA sequences. The primary motivation for its development is to provide a general and reliable tool for the discovery of genetic variations in what is an exponentially increasing volume of sequence data in public and private databases. The software integrates algorithmic solutions to three of the main challenges in sequence-based SNP discovery:

1. Multiple sequence alignment. We have developed an anchored approach enables computationally efficient creation of multiple sequence alignments provided that a reliable anchor sequence (e.g. genomic reference sequence) is available.

2. Paralog identification. We utilize a probabilistic discrimination algorithm to identify likely sequence paralogs (highly similar duplicated sequences from disparate genomic origins). If unidentified, sequence differences between paralogous sequences can lead to false SNP predictions, hence it adventageous to remove them from the analysis as early as possible.

3. SNP detection. We have derived and implemented a novel, fully probabilistic SNP detection algorithm that calculates the probability (SNP score) that discrepancies at a given location of a multiple aligment represent true sequence variations as opposed to sequencing errors. The calculation is based on a rigorous, Bayesian-statistical formulation that takes into account the alignment depth , the base calls in each of the sequences, the associated base quality values (such as generated by the Phred trace analysis program or the Phrap fragment assembler), the base composition in the region, and the expected a priori polymorphism rate . By accounting for the base quality values, it is possible to mine all available data in a statistically rigorous manner, without restrictions on data quality or a need for heuristic considerations.

As its main output, the PolyBayes program produces a list of candidate polymorphic sites, each site with an associated SNP probability score that has been demonstrated to accurately forecast the true positive rate in subsequent validation experiments. A selectable score threshold allows the user to strike a balance between highly accurate predictions and the recovery of additional, rare polymorphisms, or SNPs in low quality sequences.

File Number: 001839 


IP Protection


License Online

End User License Agreement for academic/non-profit entities

Item type: Software
view license

This agreement is solely for academic and non-profit organizations. You must provide a '.edu' email address and the name of your institution in the...

[Edit] [Delete] [Test]

$0.00

License Now

People

Case Manager:

Isabel Acevedo Isabel Acevedo

Innovations (7)


Download Technology Brief (PDF)


Followed By

Follow this innovation

Icon_avatar

bianco2007

Member since
Aug 2009

Icon_avatar

neo L

Member since
Aug 2009

Icon_avatar

zhongxu lin

Member since
Feb 2010

Icon_avatar

xiaozuxingdong

Member since
Apr 2010

Icon_avatar

adrywa

Member since
May 2010

Icon_avatar

chong

Member since
Mar 2011

Icon_avatar

Omar Andres

Member since
Jul 2011

Icon_avatar

Mirele Poleti

Member since
May 2011

Icon_avatar

lifagen2002

Member since
Sep 2011

Organization
Profile
Related Tags

Find more innovations


February 11, 2009

7,783 members 17,070 innovations 152 organizations

Browse

Linda L. Restifo, M.D., Ph.D. - University of Arizona

"I want to say again how happy I am about the iBridge Network mechanism. This seems ideal for NeuronMetrics and I'm very pleased we will be part of this venture."  read more...