<html>


<!-- Mirrored from www.jenner.ac.uk/YBF/shadforth.htm by HTTrack Website Copier/3.x [XR&CO'2003], Fri, 25 Jun 2004 08:24:45 GMT -->
<head>
<meta name="GENERATOR" content="Microsoft FrontPage 5.0">
<meta name="ProgId" content="FrontPage.Editor.Document">
<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">
<meta http-equiv="Content-Language" content="en-us">
<title>New Page 1</title>
</head>

<body>

<div align="center">
  <center>
  <table border="0" cellpadding="0" cellspacing="8" width="98%">
    <tr>
      <td align="right" valign="top" width="20%">&nbsp; </td>
      <td width="15"></td>
      <td bgcolor="lightblue" valign="bottom" width="80%">
      <p align="center"><b><font size="5">Genome Annotation Using Proteomic MS</font></b></td>
    </tr>
    <tr>
      <td bgcolor="lightblue" valign="top" width="20%">
      <font FACE="Times New Roman">
<p align="center"><b><font size="4">Ian Shadforth</font></b></p>
<b>
<p>Supervisor:</b> Dr. Conrad Bessant</p>
<b>
<p>School:</b> Institute of BioScience &amp; Technology, CCBIT at Cranfield
University.</p>
</font>
      <p><font size="3"><br>
      <br>
      </font></td>
      <td width="15"></td>
      <td valign="top" width="80%"><font FACE="Times New Roman">
<p>&nbsp;</p>
<p>The focus of the EngD project, in collaboration with GlaxoSmithKline (GSK),
is to develop a system to enable high-throughput processing of proteomic
mass-spectra. Current production of proteomic MS and tandem MS data far exceeds
our analytical capacity and so individuals choose to analyse the mass-spectra of
interesting sets of peptide fragments – a tiny proportion of the available
data. GSK uses a commercially licensed program, Mascot, to match spectra against
a database of known and predicted proteins, however around 50% of peptide
fragments remain unassigned to proteins after Mascot processing. Hidden in this
information are likely to be as yet undiscovered proteins and variants of
currently known proteins.</p>
<p>A strategy and system for processing such information to yield the maximum
benefits in terms of generation and accessibility of knowledge is being
developed. The existing Mascot program alongside orthogonal algorithms, such as
ProbID, will form the core of the high-throughput processing system. Not only
must the system accurately identify peptide fragments or their origin, but also
must provide <i>chains of evidence</i> for the assignations made. These will
allow researchers to quickly form a view as to the veracity of an assignation
before performing further work based upon it.</p>
<p>Work modules currently being progressed are:</p>
<ol>
  <li>Pre-processing of data files to appropriate formats for analysis</li>
  <li>Batching of Mascot and ProbID processes</li>
  <li>Development of strategies and programs to meaningfully combine output data</li>
</ol>
<p>The output from this project will represent a significant increase in our 
knowledge of proteins produced by the genes of human, mouse and rat; although in 
principle data from any species may be considered.</p>
      </font></td>
    </tr>
  </table>
  </center>
</div>

</body>


<!-- Mirrored from www.jenner.ac.uk/YBF/shadforth.htm by HTTrack Website Copier/3.x [XR&CO'2003], Fri, 25 Jun 2004 08:24:45 GMT -->
</html>