COG annotation




choosefile*  example





COG database introduction

COG database is Cluster of Orthologous Groups of proteins。The COG database contains 25 major categories of functionality, each with a specific functional description.


Input files

Fasta file of nucleic acid or amino acid query sequences.

Results

1. Mapping and annotation results

2. Statistics of alignment

3. Distribution of E-values.



1. Mapping and annotation results

Query_id :ID of query sequences

Subject_id :ID of mapped sequences in database

Organism Species(OS) :The species name of the matched sequence

Gene Name(GN) :The gene name of the matched sequence

Identity(%) :identity of alignment (percentity)


Align_length:the length of sequences covered by alignment

Gap:number of gaps

Query_start :the start position of query sequences covered by alignment


Query _end :the end position of query sequences covered by alignment

Subject_start:the start position of subject sequences covered by alignment

Subject_end:the end position of subject sequences covered by alignment

E_value :Expcet values of alignment, the lower the better

Score:Score of alignment, the higher the better

Protein-or-Domain: alignment of proterin or domain

COG -ID:The ID of COG

Function-Description:the description of function

Code:the code of function

Functional-Categories :the categories of functional

2. Statistics of alignment

showing the number of mapped and unmapped results.

3. Distribution of E-values.

E value is the expected value of the alignment. The smaller the E value, the more the reliability. We divided the E value into five ranges and show the number with a pie chart.