Analysis of Large Data Sets
Data spotkania seminaryjnego:
czwartek, 5. Grudzień 2019 - 14:15
Mass spectrometry (MS) is one of the most important technologies for study of proteins. MS experiments generate massive amounts of complex data which require advanced pre-processing and careful statistical analysis. In bottom-up approach to MS, peptides - smaller segments of proteins - enter the mass spectrometer and thus measurements are made on a peptide level. Because of this, one of the problems in protein quantification based on MS is the presence of peptides that can be assigned to multiple proteins. Such peptides are referred to as shared or degenerate peptides. Since it is not obvious how to assign the abundance of shared peptides to proteins, they are often discarded from the analysis. This leads to a loss of a substantial amount of data. In this talk, I will first present the basics of Mass Spectrometry data analysis. Then, I will review existing methods for handling shared peptides. I will finish with a summary of our progress on improving methodology of protein quantification with shared peptides and related statistical challenges. The talk is based on an ongoing collaboration with Tomasz Burzykowski (Hasselt University) and Jurgen Claesen (Belgian Nuclear Research Centre).