MAYA-MOL

GRAPHICS TOOLS FOR MOLECULAR NEUROSCIENCE

By Ian Patrick Coyle

UW-Madison Department of Genetics

Dr. Barry Ganetzky Laboratory

A Computer Science Project for the Laboratory of Michael Gleicher

 

Abstract

As a trainee in the UW-Madison Biotechnology Training Program, I am developing software tools to visualize the molecular dynamics of the neuron. My goal is to generate high quality animations using scale models of molecules to illustrate the complex events in neuron function, such as synaptic transmission, axon pathfinding, and learning. This will be an open-ended project conducted under the supervision of Michael Gleicher during the course of my Ph.D. research with Dr. Barry Ganetzky in molecular neurogenetics. My primary subject will be synaptic transmission, a process in which electrical and macromolecular activity allow one neuron to perceive a stimulus and, within milliseconds, output a response (typically neurotransmitters). My hypotheses are that animations like mine will: 1. Provide lucid and engrossing explanations of neuroscience concepts; and, 2. Promote fine-tuning of existing conceptual models, most of which are still quite incomplete and hotly debated. The high quality animations will be the product of combining detailed molecular structure information from the National Center for Biotechnology Information (NCBI) with highly powerful 3D modeling software that can reconstruct the cellular milieu of these molecules and approximate the physical forces influencing them. Scientists build conceptual models of molecular systems by interpreting data collected during their research. My hope is that after that data is gathered in this way into visible, connected images containing a greater amount of detail than can be stored in the mind’s eye, discrepancies between models will be able to be evaluated, and new insights and directions pursued. However, I do not expect these animations to have much predictive value; they will be biased towards explanation rather than prediction, and will produce representations of molecules that are more abstracted than those from other more specialized molecular modeling programs. In Phase I of this project, I developed MAYA-Mol, a Perl script that translates Protein Data Bank files describing the crystal structures of proteins into a ribbon diagram of that protein in the MAYA Embedded language (MEL). MAYA is a leading 3D modeling suite from Alias-WavefrontTM. Now in Phase II, I will test the above hypotheses by assembling animated scenes of these proteins in MAYA. Soon, in Phase III, I will specify physical and chemical forces on the proteins in the animations to make them accurate representations of the events occurring in your brain right now.

Introduction

Human intelligence is an emergent property of the collective activity of the approximately one trillion neurons in the brain. It is too complex for science to currently explain. Nevertheless, the foundation for an understanding of this phenomenon is being assembled as the properties of the neuron are being discovered. I am involved in the search for genes expressed in or impinging on the nervous system of the fruit fly, Drosophila melanogaster, which is the most intelligent organism on the planet for which its entire genome has been sequenced (The Drosophila Genome Project, with a big boost from Celera Genomics, Inc., finished sequencing all of the DNA in the Drosophila genome at the very end of 1999; the mouse and human genomes will be completely sequenced in just a few more years). Since genes encode proteins, and neurons in all organisms are similar, the fruit fly gives us the first system for which every component of a neuron can be identified and studied. The next step in the marathon to understanding intelligence will be to coordinate them all into an explanation of neuron structure and function. Keep in mind, human neurons contain some extra components but still rely on the same basic mechanisms to work, and almost every gene expressed in fly brains has a relative in human brains.

Methods

To help me assign functions to the as yet, un-characterized genes I study in the fly brain, I decided to embark on this project. Phase I involves the transfer of molecules into 3D modeling software so they can be manipulated in silico. MAYA is sexy and powerful, so I decided to use it as my software for phases I – III. I started by writing MAYA-Mol, a short Perl script that reads files from the Protein Data Bank at NCBI and extracts the x, y, z coordinates of the atoms in crystallized proteins. Then it draws a second-degree curve through those points in MAYA’s language, MEL. When imported into MAYA and extruded with an oval, the output of MAYA-Mol produces a ribbon diagram of the protein.

 

 

Figure 1. Crystal structure of the vesicle fusion protein, Sec17 (Rice and Brunger, 1999), as interpreted by MAYA-Mol.

 

Not only proteins, but also lipids, sugars and nucleotides play roles in neurons, and eventually I will create ways of modeling these macromolecules in MAYA. Once a structure is represented three dimensionally in MAYA, a fourth dimension, time, can be added. Other molecules can be inserted into the scene and animated, the numbers and actions of which are limited only by what is known to science and published in databases. I have not yet developed a technique for explaining atomic bonds or chemical attraction in MAYA, but MAYA’s "Dynamics" menu set and the ability to write programs in MEL will allow me to do this in the future. MAYA does have the inherent ability to model simpler physical forces like gravity, magnetism, and diffusion.

 

Figure 2. Diffusion of the neurotransmitter, dopamine, into the synaptic cleft. Dopamine is represented by its chemical structure, and calcium channels are shown as purple rings protruding through the membrane of the presynaptic nerve terminal, marking the sites of vesicle fusion.

 

Phase I will benefit when I update MAYA-Mol to mine more information from PDB files, specifically the side groups on the amino acids which make up the protein chains. Then I can assign charges to the atoms protruding from the side groups and use MAYA’s ability to model attractive and repulsive forces to create a van der Waals surface around each protein. Then, if informed which surfaces were supposed to stick together, MAYA could snap them into the lowest energy state and, therefore, the correct alignment on an atomic scale. But, as it stands right now, when I show proteins interacting they are just "eyeball" approximations of the appropriate binding activities, which do not demonstrate the chemical forces that make the actual proteins interact. And even MAYA-Mol learns science, it will only be able to recreate what real experiments have already determined about which proteins interact and with what surfaces – no program in the world can tell you that beforehand. Nevertheless, MAYA-Mol will be able to show the user what our data mean in four-dimensional space.

Results

The movie below is an example of what this set of tools can do so far. The background is as follows. At the nerve terminal, vesicles filled with neurotransmitters wait for the electrical signal that means it is time to send an output to the next neuron(s). When that electrical signal arrives (animated as blue electricity), voltage-gated calcium channels (red) open briefly to allow calcium ions to diffuse into the nerve terminal where they are in low concentration from the outside, where they are in high concentration. Proteins on the vesicle detect this calcium and respond by changing shape, setting in motion a series of protein-protein interactions which cause the vesicle to fuse with the cell membrane. Fusion exposes the inside of the vesicle to the outside of the neuron, thus neurotransmitters are exported in 0.5 milliseconds. The proteins that interacted to effect this fusion are then transported to another site to be recycled. Only two of these proteins are shown. The light blue is syntaxin, a transmembrane receptor protein that positions the fusion machinery near calcium channels. The dark blue is synaptotagmin, which may be the primary "trigger" for fusion upon calcium binding and which recognizes syntaxin as a binding partner. According to X-ray crystallography, synaptotagmin binds 3 calcium ions as shown in the animation (Sutton et al, 1995).

There are three cautionary notes to be made. First, the precise three-dimensional structure of a calcium channel is unknown, so it is represented abstractly. Second, the published structures of synaptotagmin and syntaxin are incomplete, so they are only partially correct in the animation. And third, I did not animate the binding of these two proteins accurately for the reasons I mentioned in the methods section. On an optimistic note, the uncertainty whether synaptotagmin could be the primary calcium sensor is exactly the type of question I hope 3D models can ultimately help resolve, since we can actually watch it in action..

 

 

 

 

Figure 3. Synaptic vesicle fusion during synaptic transmission. Click on the movie icon to view it. The file size is 4.18 MB.

 

Discussion and Conclusions

One problem with three-dimensional modeling is the massive amount of random access memory needed to model complex surfaces. Molecular surfaces are especially onerous, and when more than a few are animated, RAM becomes a limiting factor. MAYA is going to have an awful time calculating the electrical forces on a protein when I eventually depict them with side-chains, and may not be able to handle as many as I want even on a muscular computer. Another limitation I encountered was the lack of structural information for many of the key players in this system, like the calcium channel. Channel proteins cannot be visualized with current techniques like X-ray Crystallography and Nuclear Magnetic Resonance (NMR) because they are too big and live within the membranes of cells, so I may never have these structures to animate.

On the other hand, the simple test images I created this semester turned out to be more attention getting than I had anticipated. The explanatory value of this project will be unprecedented among free software available for molecular modeling -well, MAYA is far from free, but I will publish MAYA-Mol next year.

In conclusion, I believe these results support my first hypothesis, and this project is going to be valuable to me for organizing conceptual models and explaining where the novel proteins I investigate in the next several years fit into the molecular mechanics of neurobiology. Also, the resulting animations will add clarity, content, and character to my presentations. But I encountered limitations with MAYA that precluded any strong conclusions in favor of my second hypothesis. I do not foresee this project modeling these systems in the level of detail I initially expected. I plan to try other 3D software and investigate the potential for hybridizing MAYA-Mol to more accurate, specialized molecular modeling programs in the near future.

References

Fernandez I, Ubach J, Dulubova I, Zhang X, Sudhof TC, Rizo J. Three-dimensional structure of an evolutionarily conserved N-terminal domain of syntaxin1A. Cell. 1998 Sep18;94(6):841-9.

Rice LM, Brunger AT. Crystal structure of the vesicular transport protein Sec17: implications for SNAP function in SNARE complex disassembly. Mol Cell. 1999 Jul; 4(1):85-95.

Sutton, R.B. et al. Structure of the first C2 domain of synaptotagmin I: a novel Ca2+/phospholipid-binding fold. Cell 1995 Mar 24; 80(6):929-38.

Sutton RB, Fasshauer D, Jahn R, Brunger AT. Crystal structure of a SNARE complex involved in synaptic exocytosis at 2.4 A resolution. Nature. 1998 Sep 24; 395(6700):347-53.