Protein

Protein accession
4gcIJ [EnVhog]
Representative
2jvCh
Source
EnVhog (cluster: phalp2_7110)
Protein name
4gcIJ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSQIPAILIAALIQVESGGNDRAIGDGGRSLGCLQISDVVVADVRRITGRWVSRTDAWSRQKSIRICVDYLSHYATRERLGYEPRMEDMARIWNGGPNGWRKDSTRDYWRKVQSVLTAGRRVQTGLDDRGPLVDPRRHHVQVAQFSGDRLVLGFEGFGPLLNRIHAVRQKDERAEHAAETRANSDDGPQSA
Physico‐chemical
properties
protein length:191 AA
molecular weight:21415,8 Da
isoelectric point:9,61
hydropathy:-0,60
Representative Protein Details
Accession
2jvCh
Protein name
2jvCh
Sequence length
184 AA
Molecular weight
21245,09760 Da
Isoelectric point
8,75637
Sequence
MMGSNNLHEIRDILKHVETDHRPSIIGDGGDSYGILQIQQGAITDVNRIYGTSYVHADAFDIECSEEIFELYIKHWTGKLEKREAREATEEDIVRIWNGGPQGWRRKSTLDYLFRYKKYKLDMSLNTRNCYVTGKVGTIVATYTHTVDIYVYKLKRVRYGVSRNHIKLMPLPVPPVNIQLALAL
Other Proteins in cluster: phalp2_7110
Total (incl. this protein): 6 Avg length: 191,2 Avg pI: 8,92

Protein ID Length (AA) pI
2jvCh 184 8,75637
29eT5 196 9,44231
2B0aR 204 8,76681
38x5n 194 9,26883
sVBb 178 7,64998
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_40097
85aZL
71 53,3% 118 1.515E-46
2 phalp2_24333
3Qsp3
6 41,5% 118 8.162E-26
3 phalp2_46
7Exor
250 31,0% 119 1.476E-22
4 phalp2_5196
3O9Fp
630 26,5% 143 2.755E-22
5 phalp2_23582
1f8D
174 33,9% 112 9.597E-22
6 phalp2_720
1ILsj
2 36,6% 112 6.232E-21
7 phalp2_13677
61fFV
187 35,0% 114 7.994E-18
8 phalp2_6768
7V7Ra
4 26,9% 126 9.950E-15
9 phalp2_11423
8xVfQ
883 29,5% 115 6.345E-14
10 phalp2_33050
4wJgh
2 28,4% 116 1.187E-11

Domains

Domains
Unannotated
Unannotated
Disordered region
Representative sequence (used for alignment): 2jvCh (184 AA)
Member sequence: 4gcIJ (191 AA)
1 184 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4gcIJ
Method AlphaFoldv2
Resolution 80.22
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2jvCh) rather than this protein.
PDB ID
2jvCh
Method AlphaFoldv2
Resolution 90.63
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50