Protein

Protein accession
2UDff [EnVhog]
Representative
46ZA1
Source
EnVhog (cluster: phalp2_31763)
Protein name
2UDff
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSQSYFETEGYTKWEENLNYRTAERLPEVWPTHFTMDQSNTEKAYAPDYINNPEKLANMIYANRYGNGNVASGDGWKYRGQGAFNLTFADNYKAASQALFSDYLLYTNPALVGTDFETRWLTAGWFWNTHHFNAVADADQFTAMTTTINGSPSTVPERLEVLHKAQNIFKE
Physico‐chemical
properties
protein length:171 AA
molecular weight:19611,3 Da
isoelectric point:5,06
hydropathy:-0,67
Representative Protein Details
Accession
46ZA1
Protein name
46ZA1
Sequence length
121 AA
Molecular weight
13923,38100 Da
Isoelectric point
6,92551
Sequence
KQPEKIANRIYSSRMGNGDEHSGDGYKYRGRGPIQLTGRSNYTQFAKDMFDDWQNVVDNPDWVTADRDFALMSAIWFWNKNGLNVQADNGDIKLMTKKINGGYIGLDDRIKHYNECINLLT
Other Proteins in cluster: phalp2_31763
Total (incl. this protein): 10 Avg length: 145,8 Avg pI: 5,84

Protein ID Length (AA) pI
46ZA1 121 6,92551
1lTbC 136 4,54754
1yCCg 117 6,71896
33Fsu 111 5,62759
5jwg2 107 5,03528
8orA3 166 6,74579
GRNl 117 6,13613
A0A6J5SCG0 206 6,21400
A0A6J7WRE9 206 5,37654
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_10814
4695M
103 54,6% 119 4.983E-41
2 phalp2_29848
3MAIs
98 49,1% 120 5.706E-39
3 phalp2_20710
58Xni
163 55,3% 112 1.229E-36
4 phalp2_1654
8rD3A
4011 41,8% 110 1.344E-30
5 phalp2_431
dfz
1 44,4% 99 5.681E-25
6 phalp2_28972
62TOB
63 42,7% 117 4.730E-23
7 phalp2_7673
6ARhi
2 46,9% 83 2.172E-18
8 phalp2_11263
6KGnJ
1 46,7% 77 2.172E-18
9 phalp2_33649
PKMa
1 36,4% 96 7.673E-18
10 phalp2_2991
1bUPz
10 35,5% 104 1.799E-16

Domains

Domains
Representative sequence (used for alignment): 46ZA1 (121 AA)
Member sequence: 2UDff (171 AA)
1 121 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF00182

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (46ZA1) rather than this protein.
PDB ID
46ZA1
Method AlphaFoldv2
Resolution 96.37
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50