Protein

Protein accession
4RzCP [EnVhog]
Representative
4gI90
Source
EnVhog (cluster: phalp2_21813)
Protein name
4RzCP
Lysin probability
94%
PhaLP type
VAL
Probability: 98% (predicted by ML model)
Protein sequence
LPRIPTYQPNQVGPVESTGARFRAADNGGGIGAAIGQGLQGLGKAGADFITAQDQIQDQFDDTYARRFALDFDAATKPVMTQYSAQQGKNAIDAAMPTQEALTKLREETLAKASNPRMKRYLEERLAEPYSRYSSGIVSHSLGQQQVMAEETALSEKALAINDAAGSYANPQLAATKKAEALDAVRRVGALKGWDTKKLKAEELAATTAIHTTAFDLMRSGAKEDIDRDAAYLAAHKDEMTATAYANALGDLAEPLRKREDTADFWRAITPTADAAPAPGGTPASGAPVRQATAAEIKPALLGVFGKGTYVGDNAKHSKYTSSGKVSDHTVDRALDFVPPGGMGKYTTSQAEALITSELARRGLRIRRNANGTPQFFGPGRHAKNPGDHDDHYHVAWEVDPKATGGQGVDNERPQQFDKDQVYNRIDALAAKEKWSPEKIERVKAMGDQEIRRNEELDSRQKRAADEAAADIMLGKGSGFTSKNLIPAETWSKMSVGARASAEAAIERNLAPVAAKPNGPAAMLLNQMKTLMPQEFANQDLSKIVGQVSQAELDTFLTEQTKIKRDNGDIVKRRTEINAVVNSQSTFGGLKLNDAERASVAQMMEAQEVAAISGGKTPDRAASYRAAIGIVMMSRAVTPQGN
Physico‐chemical
properties
protein length:642 AA
molecular weight:68968,5 Da
isoelectric point:7,30
hydropathy:-0,55
Representative Protein Details
Accession
4gI90
Protein name
4gI90
Sequence length
704 AA
Molecular weight
76206,73840 Da
Isoelectric point
5,63720
Sequence
MPRAPSYNPNQVGPAETTGARFRTPDFGPPAVAQGLAGFGKAVNDFAIEEDQRDAEFDDTQSRKLATNALTELQGITNEYTSLQGGNATAARKSIDERLAKARDAYVGQAANERQKRMLGERLDGLFGSAMGTIDKHYRSEEYTERLGTFATESAQLADSAAMTDDADAAAAYLAQGKATFQDGLRLKGVTDPEAVAFETLKFTDGAHTAKIDRWFASPNPDIDLIASYVEAHSDEFTAVGRVKAMERLQQPLQERADYSLYKSLPTLAAPPSSGQSGTGWTKVATDVASRFGLSPVEVASVISYETGGTFSPTVMGGKNGQYMGLIQFGPSERAKYGITAQSTPEQWSTAVGNFLEDRGFKKGMGVLDLYSTINAGSPGRYNASDGNGTVRSHTEKILSEHRGGAEKWLAGGGVTDNSPRQYDKAAIYDSIENGVDENGKPLSPERVERLKKIADREIARDEQLLGRERAAADEAAFSTVNSLGEGFTSISQLPASVRAGLSPADVAKYDNIAKANAKPKPVPANGPAAFALDALEMADPNAFASADLSKFFADLTPAERSSYMLRQQKVKDGLKGWTPYTGINGAVTRATNFGGFKLDDGQKLAVRQIMQAEADRRFKATGKPLNDVEFDDIFRTATRDVPTHKWYGAKGSVKWYDQTVANIGDETRKRIVDAYKKVNGGAEPDNETIMTWYRRNFIPARAQ
Other Proteins in cluster: phalp2_21813
Total (incl. this protein): 16 Avg length: 695,9 Avg pI: 6,13

Protein ID Length (AA) pI
4gI90 704 5,63720
1Xh7m 708 6,52042
35fqm 706 5,93651
4eJB2 698 6,20820
4ftVO 708 5,81727
4nY6A 676 5,91855
6DCT5 662 6,31478
7Rdc8 708 5,66482
7dzuF 709 7,05181
7oi8y 709 6,02189
7pbHw 706 6,16842
7qy0x 693 5,58224
8iRSa 698 6,16734
OVQs 705 5,93725
lHtH 703 5,89155
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25119
1qReM
1 41,2% 480 3.339E-198
2 phalp2_17450
7J67M
21 34,7% 716 7.291E-149
3 phalp2_23936
1XB6k
19 30,8% 720 2.730E-134
4 phalp2_30291
4nSpn
35 30,0% 748 2.894E-114
5 phalp2_32241
7qzoI
3 24,8% 757 2.213E-110
6 phalp2_13041
8epAw
7 21,8% 683 1.086E-37
7 phalp2_27844
fq3d
11 22,0% 765 1.575E-32
8 phalp2_39056
7osxH
14 21,9% 633 1.263E-28
9 phalp2_39703
btgC
13 21,0% 774 1.187E-27
10 phalp2_9608
1zzxV
9 20,6% 624 1.264E-20

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4gI90) rather than this protein.
PDB ID
4gI90
Method AlphaFoldv2
Resolution 82.58
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50