FAMeS: Fidelity of Analysis of Metagenomic Samples

Gene Function Prediction Method Comparison

Genes identified on the simulated datasets were compared to the genes originally predicted on the corresponding reads of the isolate genomes (reference genes) using blastp. Genes were categorized into four groups. 

Number on top of columns indicate the total number of reference genes found in each sequence group. Percentages are calculated with respect to this number.

Reference genes for each simulated dataset, as well as predicted genes both on assembled sequences (contigs) and on singlets were compared to the COG database. For each combination of assembly/gene prediction method, the relative abundance of each cog group was calculated.