Protein

Protein accession
2XBoc [EnVhog]
Representative
5CeI1
Source
EnVhog (cluster: phalp2_8033)
Protein name
2XBoc
Lysin probability
77%
PhaLP type
VAL
Probability: 95% (predicted by ML model)
Protein sequence
MADTPDTGGGLLGFFGRGYDPAAAYGGLLADPATKDAMQARALAAMANAFAEGGMPVPYKGGIPLLSTIGRAGAAAYGGQDALIKARLEQAQAALAGTQGDVARQNLDFMKAYVKSRQQALGGGDAKPGGPLSLLQGAGGTAAPGDAGGPAISTEAVSTDLPNEARAFLDTVAGPESKGAYNIRYTPQGGATFDSFADHPRIYEPGPDGKSSAAGRYQIVASTFDPLAQKYGYKDFSPQTQDHAAWQLAQDTFKDKTRGDLLTALRAGKLDEVQKALHGQWKTLDLGSFGANLDKYTQAGAATVAALPGGPTAGLLAQVPNVAERMAAPDNPAPPRGGLLATAPPGPPAAVAAGGPPAPPTGAGAMRSAETAIPGAYYDKAQGLRVPGMPDYDPTRAPTPVTPVPSIRGGTNGIVPPGVPGPQGGLLDPPPGLGPPGATPVSTGAAAPPAAAPLMPGRLGLIHNSIAGLLDPRAAG
Physico‐chemical
properties
protein length:476 AA
molecular weight:47558,9 Da
isoelectric point:6,12
hydropathy:-0,22
Representative Protein Details
Accession
5CeI1
Protein name
5CeI1
Sequence length
351 AA
Molecular weight
36637,72980 Da
Isoelectric point
8,72717
Sequence
MALGFFTDDNSGDSYESIQRKRKLADALLAQSQDGAPIQSWTQGGAKLIQALAGSLQNRSLDTKERESSKAFNETLMRALGGQSGGSMPSAAPSPMPTGGGLLPAMASGGNIPKMVMPDPVYGNLDATQKALLNAIAAPESAGAYNIRYTPKGGATFAGFDAHPGVFEPGPAGPSSAAGRYQFTKTTWDRMGGGEFTPENQDKRALALANQDYKARTGRDLMADIQANGFTPQIAQALGPTWRGLIDNPQKAIAAFQSTMQRNQQPPAQVAQAGMTPDGMPTTPVPSNAAPMSYAPGQQAIASAAPQQPAQGQRLAQAVTSPPIPAPQPTGNVQQAMMAVLSDPRFSPEQK
Other Proteins in cluster: phalp2_8033
Total (incl. this protein): 6 Avg length: 380,2 Avg pI: 6,80

Protein ID Length (AA) pI
5CeI1 351 8,72717
6S5rt 485 8,54962
7LYGX 337 5,26945
bWKK 288 6,84946
qmEw 344 5,27911
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12308
7sbkI
1 35,6% 373 2.611E-52
2 phalp2_40072
41NhX
13 49,7% 225 9.017E-52
3 phalp2_24726
6AknS
21 38,7% 222 4.263E-31
4 phalp2_17473
4YIi6
1 35,4% 254 2.955E-29
5 phalp2_18645
dFjL
1 26,6% 338 3.583E-05

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (5CeI1) rather than this protein.
PDB ID
5CeI1
Method AlphaFoldv2
Resolution 64.70
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50