A Bioinformatics Approach To The Structural And Functional Analysis Of .

1y ago
8 Views
2 Downloads
2.76 MB
31 Pages
Last View : 15d ago
Last Download : 3m ago
Upload by : Jewel Payne
Transcription

A bioinformatics approach tothe structural and functionalanalysis of the glycogenphosphorylase protein familyJieming Shen1,2 and Hugh B. Nicholas, Jr.31Bioengineeringand Bioinformatics Summer Institute, Department of Computational Biology,University of Pittsburgh, Pittsburgh, PA 152612Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 088543Biomedical Initiative Group, Pittsburgh Supercomputing Center, Pittsburgh, PA 15213

Objectives Build a phylogenetic profile for the glycogenphosphorylase protein family from existingsequences Identify orthologues/paralogues; orthologues have samebiochemical/physiological roleProteins involved in the same pathway usually havesimilar phylogenetic profilesAids in identification of pathways and physiologicalprocesses in which uncharacterized protein appearsConduct a validation study of the ProbConsalignment algorithm Prepare multiple sequence alignments in parallel usingProbCons as well as more established methods forobtaining alignment (ClustalW)Compare .MeinelATmolgen.mpg.de 323.html

Background Information Glycogen phosphorylase plays a major role incarbohydrate metabolism by catalyzing thebreakdown of glycogen into glucose subunits Acts on linear chains of glycogenGlycogen shorted by one residueDimer of 2 identical ib/media portfolio/13.html

Methodology Overview Sequence Retrieval(iProclass)PatternIdentification andMatching (MEME)Multiple SequenceAlignment identificationRefined Multiple Sequence Alignmentand matchingMultiplePrinciple CompBootstrap Phylosequenceonent Analysisgenetic Analysisalignment(SeqSpace)(PHYLIP suite)PrinciplecomponentIdentified SubgroupsanalysisGroup EntropyPhylogeneticAnalysis (GEnt)analysisGroup entropyDistinctive Residuesanalysis3D visualizationPhylogenetic Profile

Sequence Retrieval iProclass databaseObtained 355 glycogen phosphorylasesequences of about 800 residues each Eliminated duplicates or nearduplicates—left with 282 sequences Final dataset included top mostsequenced genomes: honeybee,chicken, sea squirt, cow, dog,drosophila, Japanese puffer fish, human,mosquito, mouse, and C. elegans

Pattern Identification andMatching MEME (Multiple EM for MotifElicitation)Discovers motifs (highly conservedsequence patterns) in group of relatedprotein sequences Sorts through sequences and finds andreports motif alignments regardless oftheir placement along the protein Obtained 20 highly conserved motifs Conserved residues and motifs essentialto protein structure and function

Multiple Sequence Alignment ClustalW ProbCons Consistency-based method 10 hoursT-COFFEE, another consistency-basedmethod, was attempted but did notcomplete the MSA within a reasonableamount of time (2 weeks) Progressive method 1 hourmay have had to do with large size of datasetand long sequence lengthGeneDoc program used to highlight MEMEpatterns in the MSAs

Multiple Sequence Alignment GeneDoc: ProbCons conserved residues

Multiple Sequence Alignment GeneDoc: ClustalW conserved residues

Multiple Sequence Alignment GeneDoc: ProbCons with highlighted MEME motifs

Multiple Sequence Alignment GeneDoc: ClustalW with highlighted MEME motifs

Refined MSA Incorporated information from boththe ClustalW and ProbCons outputsRefined alignment by hand usingMEME motifs as guide

Principle Component Analysis SeqSpace One method to identify groups of subfamilies“Top-down” analysisTakes distance measures among set ofsequences and converts into self-consistent setof coordinates in some arbitrary number ofdimensions Coordinates plotted and examined for clusters ofsequencesClustered protein sequences share similar pattern ofsubstitutions—presumed to reflect some commonbiochemical/physiological property or function

SeqSpace: Dimensions 2 x 3VariousbacteriaandarchaeaMetazoa

Bootstrap PhylogeneticAnalysis with PHYLIP suite ClustalW alignment with gap columnseliminatedGblocks: eliminate poorly aligned positionsand divergent regions of sequence ofprotein alignmentSeqboot: generate multiple data sets(1000) that are resampled versions of theinput data set, randomly sample columnswith replacementProtdist: analyzes the multiple data setsand computed a distance measure forprotein sequences, using maximumlikelihood estimates based on the DayhoffPAM matrix in this case

Bootstrap PhylogeneticAnalysis with PHYLIP suiteNeighbor: constructs a tree bysuccessive clustering of lineages,setting branch lengths as the lineagesjoin Consense: computes consensus treesby majority-rule method Phylogenetic consensus tree viewedin ATV viewer and TreeView Groups of subfamilies compared tothose obtained from Seqspace forfurther refinement

TreeView Unrooted tree forMetazoa subfamilyrecalculated fromdistinct subset ofoverall treeReveals enzymeisozyme groups forbrain, muscle, andliver tissue withinthe moremedi.htmhttp://en.wikipedia.org/wiki/Biceps brachii

GEnt Group Entropy Analysis Identify distinctive features of mutuallyexclusive subsets determined byphylogenetic analysis through crossentropy analysis of the columns in MSAWithin single column, contrast: Amino acid composition within defined group ofsequencesAmino acid composition in all sequencesoutside of groupAlignment positions where residuecomposition inside groups is very differentfrom that outside group are expected toindicate positions associate with distinctiveproperties of sequences of that subgroup

Group Entropy Equations pi Family Entropy Distance ( pi ) log 2 qi pi Group Entropy Distance ( pi qi ) log 2 qi pi foreground residue frequencyqi background residue frequency

GEnt Results for MetazoaSubfamily5.05.5Important to family3.53.02.52.0Family y-w360k-f0.5479f-fNot enoughinformation02468Group Entropy1012Importanttosubfamily1416

2w-w1.0322w-w1304d-a1141k-l249d-q1223l-t0.5Family Entropy4.04.55.05.5GEnt Results for FungiSubfamily0123456Group Entropy7891011

GEnt Distinctive subfamily residues highlighted

Visualization3D structures for human liver isoformand yeast obtained from Protein DataBank Residues of interest highlighted inVMD

Visualization: Human LiverIsoformGEnt: distinctive residuesUniProt:binding/association sites

Visualization: Human LiverIsoform Globally conserved MEME motifs

Visualization: YeastGEnt: distinctive residuesUniProt: binding sites

Visualization: Yeast Globally conserved MEME motifs

Conclusions GEnt In metazoans: identified residue (155) knownto be important in allosteric controlOther predicted residues have high probabilityof reflecting important adaptations of enzymeto environmentDetermined possible evolutionary path ofglycogen phosphorylaseMSA methodology ProbCons and T-COFFEE did not perform welldue to the large number of sequences and longsequence length of the datasetIn this case, the ClustalW alignment methodgave the best results

Future WorkExperimentally test GEnt-predictedresidues (site-directed mutagenesis) Additional computation work withquantum mechanics and molecularmechanics to identify roles ofconserved residues, motifs, and GEntpredicted residues

AcknowledgementsHugh B. Nicholas, Jr. Alexander J. Ropelewski Ricardo G. Mendez Pittsburgh Supercomputing Center Bioengineering & BioinformaticsSummer Institute

References PSC website ials/)Michael M. Crerar, lm.htm)Buchbinder et al. (2001) Structuralrelationships among regulated andunregulated phosphorylases. Annu. Rev.Biophys. Biomol. Struct. 30: 191-209.Nicholas Jr., H.B. Glutathione S-transferasesubfamily differences: remodeling thesubunit and domain interfaces.

A bioinformatics approach to the structural and functional analysis of the glycogen phosphorylase protein family Jieming Shen1,2 and Hugh B. Nicholas, Jr.3 1Bioengineering and Bioinformatics Summer Institute, Department of Computational Biology, University of Pittsburgh, Pittsburgh, PA 15261

Related Documents:

May 02, 2018 · D. Program Evaluation ͟The organization has provided a description of the framework for how each program will be evaluated. The framework should include all the elements below: ͟The evaluation methods are cost-effective for the organization ͟Quantitative and qualitative data is being collected (at Basics tier, data collection must have begun)

Silat is a combative art of self-defense and survival rooted from Matay archipelago. It was traced at thé early of Langkasuka Kingdom (2nd century CE) till thé reign of Melaka (Malaysia) Sultanate era (13th century). Silat has now evolved to become part of social culture and tradition with thé appearance of a fine physical and spiritual .

On an exceptional basis, Member States may request UNESCO to provide thé candidates with access to thé platform so they can complète thé form by themselves. Thèse requests must be addressed to esd rize unesco. or by 15 A ril 2021 UNESCO will provide thé nomineewith accessto thé platform via their émail address.

̶The leading indicator of employee engagement is based on the quality of the relationship between employee and supervisor Empower your managers! ̶Help them understand the impact on the organization ̶Share important changes, plan options, tasks, and deadlines ̶Provide key messages and talking points ̶Prepare them to answer employee questions

Dr. Sunita Bharatwal** Dr. Pawan Garga*** Abstract Customer satisfaction is derived from thè functionalities and values, a product or Service can provide. The current study aims to segregate thè dimensions of ordine Service quality and gather insights on its impact on web shopping. The trends of purchases have

Chính Văn.- Còn đức Thế tôn thì tuệ giác cực kỳ trong sạch 8: hiện hành bất nhị 9, đạt đến vô tướng 10, đứng vào chỗ đứng của các đức Thế tôn 11, thể hiện tính bình đẳng của các Ngài, đến chỗ không còn chướng ngại 12, giáo pháp không thể khuynh đảo, tâm thức không bị cản trở, cái được

Bioinformatics Crash Course Ian Misner Ph.D. Bioinformatics Coordinator UMD Bioinformatics Core . Bioinformatics!Core The Plan Monday – Introductions – Linux and Python Hands-on Training Tuesday – NGS Introduction – RNAseq with Sailfish (Dr. Steve Mount, CBCB) – RNAse

Le genou de Lucy. Odile Jacob. 1999. Coppens Y. Pré-textes. L’homme préhistorique en morceaux. Eds Odile Jacob. 2011. Costentin J., Delaveau P. Café, thé, chocolat, les bons effets sur le cerveau et pour le corps. Editions Odile Jacob. 2010. Crawford M., Marsh D. The driving force : food in human evolution and the future.