Protein

Protein accession
1R19F [EnVhog]
Representative
4vh6Y
Source
EnVhog (cluster: phalp2_28678)
Protein name
1R19F
Lysin probability
94%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VTHRFVQGIDYGPRRGTLGFAIHMAEGGDGTLPYLARRSGETDLAWRSRIRGVSANFVILSTGEVVQMVGWDRASGSMNPSDRGGTTGFYQTKHIRAVLGSHYGDPNAYSLSVEVTGYRAAGPNRAQVNALVALVAESRDRFPGMVGAYGHADQTDTKGCPGTAPLMLEAWSRIGHGKFDPQEDIVKVYSSPGSVSAVWPAGTAIYPDPTGAATGKTTVTRRFLIVGQDAPIPTRYLVDGGASNPSGLMAWLAAAGMGGTRDETYNAGVAAAAQRASGAKRDD
Physico‐chemical
properties
protein length:283 AA
molecular weight:29761,9 Da
isoelectric point:8,99
hydropathy:-0,27
Representative Protein Details
Accession
4vh6Y
Protein name
4vh6Y
Sequence length
279 AA
Molecular weight
29665,30840 Da
Isoelectric point
5,81442
Sequence
MVYPFVQAKYDYGVRTAPVRAFLVHMAEGGGTVGYLAHDPARGVSVHYVIEYSGRIVQMLAESHASGSVDPTQIRTTDDVDHFYGVTAAKAVMGAYWNDPNSAVVSLEMEGFATDGPNAAQAISLIELVDDVRSRFPDIGLLGHRDFASYKACPGKHIDWPALGGHGPATKETIVKTVLTIPPTTVILPKVGVPVLDQPAGSSIYVTKSGDKLPQWGDTEPPSYHIVRLPGNVAGYLANGQVASTQPLPPPPPVVDCTDVVKAELDKAATRAADAVRAR
Other Proteins in cluster: phalp2_28678
Total (incl. this protein): 16 Avg length: 287,7 Avg pI: 7,23

Protein ID Length (AA) pI
4vh6Y 279 5,81442
5GOBC 303 5,55433
5GXcy 319 5,23893
5nJ5y 297 5,51221
5zCQb 261 6,12289
6D8oU 278 5,54728
6DJjk 306 6,15023
6Rlar 294 6,54526
6U1DA 288 6,62597
6U8qP 260 6,23969
6UXZq 287 6,16762
Itvr 371 9,58524
bZYq 260 10,64310
c0aQ 257 10,14160
c0ve 260 10,78248
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_9930
2SirC
5 51,4% 208 3.009E-73
2 phalp2_9682
21Ube
1 51,2% 203 6.941E-72

Domains

Domains
Ami2
Disordered region
Representative sequence (used for alignment): 4vh6Y (279 AA)
Member sequence: 1R19F (283 AA)
1 279 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1R19F
Method AlphaFoldv2
Resolution 80.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4vh6Y) rather than this protein.
PDB ID
4vh6Y
Method AlphaFoldv2
Resolution 71.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50