Protein

Protein accession
4KRz8 [EnVhog]
Representative
4KRz8 (this protein)
Source
EnVhog (cluster: phalp2_23170)
Protein name
4KRz8
Lysin probability
97%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MAQVPYGEGVPEVAPDTSVPDDYQHINATPASFGGLIAQGAEKAGQGITQASTNLFDIAQFRGKINSDDQTNHYITTNNNILYGDPAKSTVGPDGKQVPDLGYLGLQGRAASDARPDVLKQLETARMEGRKNLSSPQEQLEYDAQTKRLYADAVARTGQHADQQWKSWAGDVNGKGAEHSMTGFLNSLGDQEAMKHNAADYINFKIQESQIRFGDDPKINAQVTADAQRDLLKAQVQWVAAKDPAGAQRILEKNKDIAGPEYPQLSNELRTRVDQQVGHTGGAAIFARANQNATQQFAAGQPPVEGAKTLLRSFEGFKPQAYWDVNHWRVGYGSDTITKADGTIVPVTQGTVVSLDDAERDLTRRTADFTQKAQTQIGADAWGKMTPQAQSVMGSVAYNYGTLPASVVQAAQTGDPVQLSNAVMSLGGDNGGINAGRREAEAGIISGKHSYAIKGDAYRMAIENPDFTEEQRAIALRTISELSNAQEVAYNQNARARIDGVNKAVGDYTTQFWNMLHTPNPDWVSFMGKINADPALADAGPAKDGLMERVIKRSGEEQSLAFGPGYMTVKNNILSDPGAEGHIAMLADIYKLPLGQLTAAGEHELKEVFNDLKKGPDEYGIQKTRASLETYAKSVMAKEQLIPGFGTLSTNKKGEEIFNSQFIPQFNAAYANWVKKGKDPNEFLTKKNVDEMMDRIYPRDKRAADNMIAQGDGSVPQDANAPLPQPPDGVDQKVWTGLLGTPPLMATGKVATPQQYG
Physico‐chemical
properties
protein length:757 AA
molecular weight:81963,3 Da
isoelectric point:5,33
hydropathy:-0,59
Other Proteins in cluster: phalp2_23170
Total (incl. this protein): 5 Avg length: 866,4 Avg pI: 5,67

Protein ID Length (AA) pI
2XW7x 879 5,51477
3Takq 981 6,34962
4Iwcz 842 5,32891
873eM 873 5,84506
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_18987
86KhQ
23 32,8% 827 5.591E-193
2 phalp2_11309
6XHkz
6 30,0% 781 2.643E-128
3 phalp2_14506
4Rn2g
4 25,2% 709 3.696E-72
4 phalp2_19739
4RnFA
14 25,6% 764 5.902E-66
5 phalp2_18484
6Kgj0
4 28,3% 544 7.938E-66
6 phalp2_25968
6DAlo
1 18,4% 969 4.314E-54
7 phalp2_21995
4XBst
1 20,6% 885 2.590E-47
8 phalp2_32936
3Tbkq
3 19,0% 950 1.988E-37
9 phalp2_40570
4RodC
3 22,6% 789 4.633E-37
10 phalp2_33368
6IlJk
3 20,8% 647 3.329E-36

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
4KRz8
Method AlphaFoldv2
Resolution 83.32
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50