Protein

Protein accession
4JjDG [EnVhog]
Representative
4T1qD
Source
EnVhog (cluster: phalp2_7472)
Protein name
4JjDG
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTQTSITTRVANNLRDQGVTVYHRGQWGSLYPEVYQQRRTTRPAQQPADTVFQHITVTRPTEDFLHDVRVVEQIGYERFGSGVSYNWLINMETGEVAVGQPLDAKGTHTVNIKRQPRFSYDQNLVARAIAGIGMPETDFSKRAMKSTAKLLVAMIEEGAITEGFDYKPHSFVAYKDCPCDAMRNAMPAIQKVALSRLARRRDDR
Physico‐chemical
properties
protein length:204 AA
molecular weight:23108,9 Da
isoelectric point:9,30
hydropathy:-0,53
Representative Protein Details
Accession
4T1qD
Protein name
4T1qD
Sequence length
211 AA
Molecular weight
23572,71620 Da
Isoelectric point
10,16017
Sequence
MTITQRVVRRARQRGVTVLTHRQWGSTEMRTYAERRKLTREGHWPGFRQLVDTVAQHITVTAATDDFAHDCRVVEQIGMQRFGSGVSYNFLVHIRSGEAAVGQPLDSKGTHTVNDKGVEGYSYDQNLAARAIAVVGMPGDKLSRKAKRTIVHLLAAMVEEGAVTRGFDYVPHSLFAWKDCPCDSTRSQMPKIRAAVDRKLAKPVRFLGGPR
Other Proteins in cluster: phalp2_7472
Total (incl. this protein): 9 Avg length: 205,7 Avg pI: 9,39

Protein ID Length (AA) pI
4T1qD 211 10,16017
1a9Nh 204 6,07748
3NHGm 212 10,09093
401WF 197 9,77787
5kjwf 205 9,09573
85DTa 205 10,12329
8rmYq 212 10,16365
c0Wk 201 9,75099
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_16674
nLZD
1 42,7% 192 5.515E-54
2 phalp2_11605
1hk9s
6 34,8% 201 8.042E-26
3 phalp2_3598
4JlBw
7 28,9% 145 7.460E-11
4 phalp2_3739
5tm9a
5 25,4% 185 1.820E-07
5 phalp2_20510
4c5NL
19 24,5% 159 5.975E-07
6 phalp2_23216
7DAyW
82 23,3% 193 6.373E-06

Domains

Domains
Unannotated
Representative sequence (used for alignment): 4T1qD (211 AA)
Member sequence: 4JjDG (204 AA)
1 211 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4T1qD) rather than this protein.
PDB ID
4T1qD
Method AlphaFoldv2
Resolution 95.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50