Multiscale protein-protein interactions
Disorder & aggregation
Travis Hoppe
CSULA Seminar: February 3, 2015
National Institutes of Health (NIH)National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)Laboratory of Chemical Physics (LCP), Theoretical Biophysical Chemistry (TBC)
Biophysical question #1
How do we predict phase separations of protein solutions?
Biophysical question #2
How do we make predictions about intrinsically disordered proteinsgiven their large conformational landscape?
Acknowledgments:
Laboratory of Chemical Physics
Robert Best
Wenwei Zheng
Laboratory of Biochemistry and Genetics
Allen Minton
Di Wu
*Support provided by the Intramural Research Division of the NIDDK, NIH.
Protein Structure
Primary structure (sequence)
GSIGAASMEF CFDVFKELKV HHANENIFYC PIAIMSALAM VYLGAKDSTR TQINKVVRFD KLPGFGDEIE AQCGTSVNVH
SSLRDILNQI TKPNDVYSFS LASRLYAEER YPILPEYLQC VKELYRGGLE PINFQTAADQ ARELINSWVE SQTNGIIRNV
LQPSSVDSQT AMVLVNAIVF KGLWEKAFKD EDTQAMPFRV TEQESKPVQM MYQIGLFRVA SMASEKMKIL ELPFASGTMS
MLVLLPDEVS GLEQLESIIN FEKLTEWTSS NVMEERKIKV YLPRMKMEEK YNLTSVLMAM GITDVFSSSA NLSGISSAES
LKISQAVHAA HAEINEAGRE VVGGAEAGVD AASVSEEFRA DHPFLFCIKH IATNAVLFFG RCVSP
Secondary structurehelices [red], sheets [blue
Tertiary structure3D structure
Higher-order structurecomplexes, aggregation
Primary Structure
Twenty residue "alphabet" forms polypeptide chain
Protein folding problem
Predict structure from sequence
Sequence Structure Function
Native structure, folding pathways, ...
Scientific Philosophy
Theoreticians need to keep in close contact with experimentalists.Imagination must be constrained by reality.
Models must as simple as possible (but no simpler).
Part 1: Aggregation
How do we predict phase separations of protein solutions?
Higher order structure
Phase separations lead to sudden changes in liquid structure.
Leibler, Nature 2004
Tanaka, Phys. Rev. E 2005
How do we model many protein-protein interactions?Can we predict aggregates from experimental structure?
Human serum albuminPDB:1AO6
OvalbuminPDB:1OVA
LysozymePDB:1W6Z
Bovine Serum AlbuminPDB:3V03
Protein-Protein interactions
Important terms:
Volume exclusion, Electrostatics, solvent effects,Non-specific interactions (London/dispersion forces)
Second-order effects?
Non spherical geometries, polarization,internal conformational energies, ...
Need a way of validating model.
Experimental Measurements
Second virial coefficient, , measurementusing light scattering at different pH.
Dotted-line: Hard sphere potential. Good enough for sickle cell hemoglobin!
Virial Coefficients
An equation of state expanded in powers of density
is the pairwise interaction of two molecules is the interaction of three molecules, ...Negative values of often correlate with aggregation.
For rotationally invariant molecules*
Goal: Develop a realistic pair potential for virial calculation.
The Process
Start with the crystallized PDB Structuree.g. Human Serum Albumin PDB:1A06
Solve for with the Adaptive Poisson-Boltzmann Solver (APBS),
Typically (in the absence of ions), and .
Macrocharge fitting
Best fit macrocharges to approximate the field.
Decompose the field, determine a region of excluded volume +spherical harmonic decomposition for large distances.
Matching experiments
Theoretical predictions of the second virial coefficient considering only excluded volume and reduced electrostatics.
Matching experiments
Theoretical predictions of the second virial coefficient considering only excluded volume and reduced electrostatics.
A Simplified Representation of Anisotropic Charge Distributions within Proteins, Hoppe,
J. Chem. Phys. (2013)
Phase separations summary
Calculate the non-ideality of a protein molecule after includingboth the excluded volume and electrostatics.
Predict the second-virial coefficient as a function of pH values, protein concentrations, binary mixtures, and salt concentrations.
Ongoing research: Use the model in higher-order simulationsto predict phase behavior via Gibb's ensembles.
Part 2: Disorder
How do we make predictions about intrinsically disordered proteinsgiven their large conformational landscape?
Paradigm shift
Proteins were thought to adopt stable, folded conformations.Solving the structure was paramount for understanding the function.
Unexpected: disorder is abundant!
Grouping proteins in the yeast proteome, Gsponer,
Science (2008)
Intrinsically disordered proteins
Structure
- Lack tertiary structure (disorder!)
- Still may form secondary structure
- Different primary structure (residue propensity)
- More charged, less hydrophobic and aromatic residues
Binding
Not disordered, Lock and KeyBarnase-Barstar complex
Disorder-to-orderHif-1 α/CBP
Always disorderedSIC1 binding to CDC4
Theory
- What advantages do IDPs have over traditional proteins?
- Recognition that the cellular environment is a crowded place.
Function
- Often found in signaling pathways, centers of protein hubs
- Linkers (entropic chains), Chaperones, HIV transcription (TAT)
- Binding specificity, with lower affinity
Modeling
IDPs: Folding Sampling
Goal: Develop a model for IDP interactions.
Statistical Potentials
Residue-residue interactions, quasi-chemical lattice-gas
Residue-residue interaction matrix, MJ
MJ matrix reveals biophysical structure
H (hydrophobic), P (polar), C (charged)
MJ Contact energy, from structure
Mean-field (MF) energy, from sequence
MJ contact energy reproduces MF energy
MF Energy distributions: Physically reasonable
Protein Networks
- Target protein interacts with a range of possible surfaces.
- Measure average binding affinity of protein to surfaces.
- Measure binding specificity of protein to surfaces.
Example network: Protein-protein interactions in yeast, S. cerevisiaeSchwikowski & Fields et al., Nature 2000.
Protein-complex energy
Pairwise decomposition of protein complex energy; Binding affinity
Contact matrix is not symmetric
Specificity score: Define "decoys" as weakly boundstructures in protein network.
MF IDP Summary:
- MF models reproduce MJ contact energies. MF IDP's bound to native structures show increased specificity with lower affinity.
PDB:1B8A
1B0B
1BQ8
1DQP
1DOI
1C4Q
1ARB
1BXU
1CC8
1CCJ
1DFU
1DMG
What's next? Add structure to mean field calculations.Lattices may be optimal for IDP's, they can reproduce native-energies but quickly sample extended conformational space.
Active research projects & collaborations
Crowding, surface adsorption and protein fibrillation, Biophys (in press).Programmable Nanoscaffolds and Multivalent Effects, JACS.Integer sequence discovery from small graphs, Discrete Math. (submitted).Dependence of Internal Friction on Folding Mechanism, JACS (submitted).Quantification of plasma HIV RNA, Nature Comm.
Allen Minton
Andrew Dix, Daniel Appella, et. al
Anna Petrone
Robert Best, Wenwei Zheng
Zhao, Daniel Appella, et. al.
Future Research Projects
Phase separation calculations, aggregation.
Quantitative IDP models, disorder.
Benchmarks in sampling algorithms, BiSA.
Graph fingerprint and invariant database, EoFG.
Dependence of topology on sampling, WL topology.
Theoretical liquid state calculations for simple potentials.
Entropic microscopes: free chain calculations of PNA.
0
Multiscale protein-protein interactionsDisorder & aggregation
Travis Hoppehttps://github.com/thoppe/Presentation_Research_IDP
CSULA Seminar: February 3, 2015
National Institutes of Health (NIH)National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK)Laboratory of Chemical Physics (LCP), Theoretical Biophysical Chemistry (TBC)