Protein

Protein accession
3ajZf [EnVhog]
Representative
4SQC9
Source
EnVhog (cluster: phalp2_10980)
Protein name
3ajZf
Lysin probability
96%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MPETIVAVVVAVVLQILPASKVDKGEEPSVRQARVERVVRAAVAAAQTRYALSLLLADADAETHMASHVVEGHCERMPDDDCDHGRARGVWQVHPWCRTAYAFPAGSDESMAEEARCAMRQMWHGMHRCREHAATPFHGAFAALAARPCSWSGAEKRVRLAFRIDAALERAGVPEPAAER
Physico‐chemical
properties
protein length:180 AA
molecular weight:19731,3 Da
isoelectric point:6,50
hydropathy:-0,21
Representative Protein Details
Accession
4SQC9
Protein name
4SQC9
Sequence length
166 AA
Molecular weight
17751,21120 Da
Isoelectric point
10,27041
Sequence
VTVPAEAVLALLVSMPQYRADIHADPTIEDRRRLLRPVALAISAVAKNRTEAAALVALAQHETHLARYVLDGYCEQGPVGARCDGGRARGAWQVHSWCRPLWDVSATDPGRHLAGARCAIGLLRRGRAKCGSLAGAFGAYAGVGCRWRGGPARARTTAHLEARLGR
Other Proteins in cluster: phalp2_10980
Total (incl. this protein): 24 Avg length: 171,4 Avg pI: 8,90

Protein ID Length (AA) pI
4SQC9 166 10,27041
1gq8p 162 9,61793
2zJOD 162 9,56822
36opk 127 7,83550
3NQfl 169 8,34996
3O8Sk 167 9,09625
3QNe1 167 9,24220
3XmJN 162 9,07536
3j8Cc 201 9,37565
429Ql 165 6,69657
4AElO 165 8,45015
4HqqM 162 8,60410
4Ma9G 168 10,88802
4TFSh 173 6,65303
4hRPn 182 11,00851
4jXBS 187 8,94243
4vnzi 169 9,49666
4wVVP 168 9,26406
5nBlc 177 7,84814
6TGhP 177 10,03355
7Vtt5 202 9,82958
e9Db 176 8,59230
tsrJ 180 8,39580
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7701
6Kf9J
5 31,5% 168 1.423E-37
2 phalp2_17689
6XyCu
13 36,9% 168 9.725E-25
3 phalp2_16466
6IGgW
27 32,9% 164 1.195E-23
4 phalp2_19726
4MXZw
1 33,7% 181 1.073E-22
5 phalp2_28141
8J9W
2 28,1% 181 2.007E-22
6 phalp2_917
8hAuw
31 28,5% 175 7.028E-22
7 phalp2_26
4Qxw9
2 34,7% 121 6.568E-16
8 phalp2_22052
5zEJP
28 33,3% 150 2.117E-12
9 phalp2_20289
Ypn7
4 25,0% 164 1.190E-08
10 phalp2_13141
2Rtef
18 27,8% 140 4.670E-07

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4SQC9 (166 AA)
Member sequence: 3ajZf (180 AA)
1 166 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4SQC9) rather than this protein.
PDB ID
4SQC9
Method AlphaFoldv2
Resolution 95.54
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50