Protein

Protein accession
A0A2L0V0Z1 [UniProt]
Representative
4G9Kg
Source
UniProt (cluster: phalp2_33090)
Protein name
PE-PGRS virulence associated protein
Lysin probability
99%
PhaLP type
endolysin
Probability: 97% (predicted by ML model)
Protein sequence
MSCDPKMTGAGKSSGNSSGSGGSGGSESNGSGGNNGSGEGGSNEKLANDTTEANIKFNKQRVDEITAEMRKKGYNDIQIAAALGHWKNESGFRLDALNRGDGRDGSDSIGLAQWNSSRAAGLKNYAAQNGLPVNSVAAQVGWFDYEMKNTEKGAGYAFKNATTVAEASWAMNKYERFQGYNSTTSSQTISRLQNSNTFYSYITKK
Physico‐chemical
properties
protein length:205 AA
molecular weight:21764,3 Da
isoelectric point:8,87
hydropathy:-0,86
Representative Protein Details
Accession
4G9Kg
Protein name
4G9Kg
Sequence length
613 AA
Molecular weight
66362,87200 Da
Isoelectric point
8,88498
Sequence
MPPIPVYDFTTKDYGVADMIASMPDAFFSGYDRAQKRLKENQEADAKDRSFQSVFGLGGGEQDQNIVAQPSSPVAALGLGAPQQAQTSPRIDQAFADAQPKMPSFQAMQGGDMGDVKSKVYNSLIKNGVSPVAAIGLTGNLAQESGFRTDARNRGDGRDGSDSIGLAQWNQDRAKNLLGFAASKGLDWRDPDVQGAFIAHELKTTEGRAGQALAQAQTPEEAARAAIGYFRPAGFTWNNPMMAHGAENRIAQARRAAQEFGVSGQQTASAPAQGSTEAQGFVAPQQGQQPQQNSYAQQLMARAQALARQAQATGNRTLKDRAVEMHDKAIEAQQKETYGFQAFGDQLLRTDPRTGKTEVIMNKPSENKKPVLVQEYEYAKNNGFQGSIFDYQKAVEEAKRGGKGTGSELDKVAEREQAADKLGLQGEDRRLYLANGKVPAGSEKLTNDQANAGLYADRMRKSNAILEKPEIEGEALSLKQKAYSSIPVIGNYAVSNKFQLLDQARRDFINATLRRESGAVIQPVEFDNANKQYFPQPGDSAAVLAQKKANRQTAIDGISRAGGVSYSKENPAGTKANNAQQPQPSQITSEQQYQTLPSGAKYIDPQGQIRTKR
Other Proteins in cluster: phalp2_33090
Total (incl. this protein): 2 Avg length: 409,0 Avg pI: 8,88

Protein ID Length (AA) pI
4G9Kg 613 8,88498
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_25174
1QqYT
1 31,7% 479 1.981E-30
2 phalp2_11262
6Ke74
2 27,1% 582 1.713E-26
3 phalp2_23154
4IjBV
1 27,1% 438 6.592E-20
4 phalp2_10136
8CLKY
3 23,7% 489 9.890E-14

Domains

Domains [InterPro]

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Agrobacterium phage Atu_ph07
[NCBI]
2024264 Polybotosvirus > Polybotosvirus Atuph07
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)
CDS Source ID
CDS Source
MF403008 [NCBI]
CDS location
range 482509 -> 483126
strand +
CDS
ATGAGTTGCGATCCAAAAATGACGGGTGCTGGAAAATCATCCGGCAACTCATCTGGCTCTGGTGGTTCAGGTGGTTCAGAATCGAACGGAAGTGGCGGAAATAATGGATCAGGTGAAGGCGGTTCTAATGAGAAGTTAGCTAATGATACAACTGAGGCTAATATTAAGTTTAATAAACAACGCGTAGACGAAATTACAGCAGAGATGCGAAAAAAAGGTTATAACGATATACAAATTGCGGCTGCACTTGGTCATTGGAAAAATGAATCTGGATTTAGACTTGATGCTCTTAATAGAGGCGACGGTCGTGATGGTTCAGATTCCATTGGTCTTGCACAATGGAATAGTTCACGCGCTGCTGGACTTAAAAATTACGCTGCTCAAAATGGATTACCCGTTAATTCTGTTGCAGCACAAGTTGGATGGTTCGATTATGAAATGAAAAATACTGAAAAAGGCGCTGGATATGCTTTTAAAAATGCGACTACAGTTGCTGAAGCAAGTTGGGCAATGAACAAGTATGAAAGATTTCAAGGTTATAACTCAACCACAAGTTCACAAACGATTAGTCGCTTGCAAAATTCGAATACCTTCTACTCGTATATAACCAAAAAATAG

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
upi000cdc1f98_model
Method AlphaFold3 (non-commercial)
Resolution -
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4G9Kg) rather than this protein.
PDB ID
4G9Kg
Method AlphaFoldv2
Resolution 65.50
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50