Submitted by: Submitted by kaede328
Views: 349
Words: 3969
Pages: 16
Category: Science and Technology
Date Submitted: 08/19/2012 09:41 AM
A Support Vector Machine Based Prediction Model for Lysine Acetylation Site
Chua, Linda
See, Kevin C.
A Research Paper submitted to
Mr. Neil Bretaña
In Partial Fulfillment of the Requirements
for the course in Bioinformatics
July 11, 2012
Abstract
This proposal paper aims to plan the development of a support vector machine based predictor model for lysine acetylation sites in proteins. First, a research on post-translational modification (PTM) in proteins was conducted. Several published related literatures were read and cited. A published paper was chosen as the basis for the entirety of the research. The paper “Predicting post-translational lysine acetylation using support vector machines” by Gnad et. al. was chosen due to its quality and relevance to the research objectives. The dataset was extracted from another source, namely N-Ace (which in turn used the dbPTM database as their source). Six protein features were selected to be the basis for the predictor: (1) frequency, (2) presence, (3) solvent accessibility, (4) hydrophobicity, (5) composition, and (6) BLOSUM63. The datasets extracted were processed. Finally, to attain a set of preliminary results, Weka, a popular data mining and machine learning tool, was used to run the dataset through a support vector machine (SVM).
Introduction
Background of the Study
Proteins are present in every living organism. They are responsible for the many functions of a living organism such as digestion, production of anti-bodies, enzymes, movement, eyesight and more (Bio News Online). But before they become as such, they undergo a process known as protein biosynthesis which occurs inside cells. In a nutshell, this process takes 20 amino acids and forms them into a long chain called polypeptides (Walker, 2012). After the formation of this long chain, a protein is made. But before it can become fully functional, it undergoes another process known as post-translational modification or abbreviated as PTM....