Dataset of tandem mass spectra for non-halal pork-based food products

Published: 21 June 2020| Version 3 | DOI: 10.17632/6dmm8659rm.3
Mohd Hafis Yuswan,


Shotgun proteomics is a high-throughput proteomic studies that commonly used to establish the peptide markers. Massive amounts of tandem mass spectra will be produced after an LC-MS/MS acquisition, as a consequence of thousands of proteins from several biological samples is digested by an enzyme. Manual interpretation for these massive mass spectra is unfeasible; instead, database search engine is used to match the spectra with sequence information contained in the database. Each search engine has a unique algorithm to provide a list of the best matching peptide sequences, as well as a score that related to the confidence level for an individual tandem mass spectrum. In this dataset, three proteomic database search engines namely Comet, X! Tandem, and ProteinProspector have been used against 45 types of pork-based food products with triplicate each for halal analysis. The dataset can be used to compare and discuss the ability of each database search engine, in term of detected spectra, matched peptide-spectrum, and inferred proteins. A false-discovery rate can also be compared and discussed from this dataset, in respect of database searching. A result of combined database search engine can be generated from this dataset to leverage the information contained in the database during interpretation of massive amounts of tandem mass spectra.


Steps to reproduce

Acquisition mode for mass spectrometry proteomic data was a data-dependent acquisition (DDA) on tryptic peptides of different samples of pork-based food product. The mass spectra data were acquired from AB Sciex 4000 QTrap mass spectrometer using an Analyst 1.5.1 software. For each sample, both biological and technical replicate are in triplicate.


Universiti Putra Malaysia


Proteomics, Foodomics