CA2548882A1 - Methods and systems for analyzing and determining ligand-residue interaction - Google Patents

Methods and systems for analyzing and determining ligand-residue interaction Download PDF

Info

Publication number
CA2548882A1
CA2548882A1 CA002548882A CA2548882A CA2548882A1 CA 2548882 A1 CA2548882 A1 CA 2548882A1 CA 002548882 A CA002548882 A CA 002548882A CA 2548882 A CA2548882 A CA 2548882A CA 2548882 A1 CA2548882 A1 CA 2548882A1
Authority
CA
Canada
Prior art keywords
fragment
residue
binding
affinity
molecular
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002548882A
Other languages
French (fr)
Inventor
Stephan Brunner
David Mosenkis
Frank P. Hollinger
William Chiang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Locus Pharmaceuticals Inc
Original Assignee
Locus Pharmaceuticals, Inc.
Stephan Brunner
David Mosenkis
Frank P. Hollinger
William Chiang
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/730,267 external-priority patent/US20050123993A1/en
Application filed by Locus Pharmaceuticals, Inc., Stephan Brunner, David Mosenkis, Frank P. Hollinger, William Chiang filed Critical Locus Pharmaceuticals, Inc.
Publication of CA2548882A1 publication Critical patent/CA2548882A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/569Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
    • G01N33/56911Bacteria
    • G01N33/5695Mycobacteria
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/30Drug targeting using structural data; Docking or binding prediction
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/50Molecular design, e.g. of drugs
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C10/00Computational theoretical chemistry, i.e. ICT specially adapted for theoretical aspects of quantum chemistry, molecular mechanics, molecular dynamics or the like

Abstract

A method implemented in the form of a computer simulation code for evaluating the free energy of binding between polypeptide amino acid residues and one or more molecular fragment types is presented. The basis of the method is a novel weighted Metropolis Monte Carlo approach for sampling the grand canonical ensemble. By making use of the properties of the grand canonical ensemble, the affinity of fragments for binding in the vicinity of each protein residue can be efficiently computed. The binding volume associated to each fragment-residue pair is estimated on the basis of a simple proximity criteria, and a useful affinity mapping of the protein surface can be obtained in this way.
The analysis of such data for various fragment types provides valuable information to help identify protein binding sites, as well as to identify key fragments used for building potential drug leads

Description

METHODS AND SYSTEMS FOR ANALYZING AND
DETERMINING LIGAND-RESIDUE INTERACTION
BACKGROUND OF THE INVENTION
Field of the Invention [0001] The present invention relates to computer-implemented methods and systems for analyzing the interaction between polypeptide amino acid residues and one or more molecular fragments. The invention further provides methods and systems for using the information regarding the fragment-polypeptide interaction to aid in drug design.
Related Art [0002] The action of a particular drug is believed to result from the interaction of that drug with a particular molecular target, such as a protein, nucleic acid, or other molecule found in the biological system. Typical protein drug targets include enzymes and receptors (Thomas G., "Medicinal Chemistry - An Introduction" (John Wiley & Sons, Ltd., New York, 2001)).
[0003] In the case of an enzyme, its binding with a drug molecule usually has the effect of interfering with the normal operation of that enzyme. The drug molecule may bind directly within the active site of the enzyme or act indirectly by binding to a so-called allosteric site. Similarly, drugs may act on a receptor by binding to, or near, its surface. This may either activate the receptor, or prevent the binding of its normal substrate to that receptor.
Ultimately, such drug actions can result in a physiological response with the purpose of providing a therapeutic effect.
[0004] The drug's effectiveness will depend on the stability of the drug-enzyme or drug-receptor complex, as well as the number of binding sites occupied by the drug. Other targets for drug action include nucleic acids and other naturally occurring molecules.
[0005] To rationally develop a drug lead, it is therefore desirable to have accurate lenowledge of the binding sites) on the target molecule (e.g., enzyme, _2_ receptor or nucleic acid). One method used for determining protein binding sites is so-called protein mapping, where different molecular probes, typically small organic molecules representing various functional groups, are placed around the protein surface to determine the most favorable binding positions (Dennis et al., PNAS 99:4290-4295 (2002)). Experimental approaches to protein mapping include x-ray crystallography and NMR methods. Both of these approaches have shown that probes, even those generally unrelated to any natural substrate of the protein, bind only to a limited number of positions.
Generally, a pocket of the active site tends to form a consensus site that binds many ligands, regardless of their sizes and polarities.
(0006] Because of major difficulties associated in many cases with co-crystallizing proteins and probes, or using NMR for determining binding sites, a number of methods have been developed to perform mapping computationally rather than experimentally. Examples of such computer codes are the drug design program GRm (Goodford, P.J., J. Med. Chem. 2:849-875 (1985)), or the Multiple Copy Simultaneous Search (MCSS) strategy (Miranker, A. & Karplus, M., Proteiras Struct. Funct. Genet. 11:29-34 (1991);
Caflish, A., et al., J. Med. Chem. 36:2142-2167 (1993); Joseph-McCarthy, D., et al., J. Am. Chem. Soc. 123:12758-12769 (2001)).
[0007] The main problem with the computational approaches referenced above is that they are usually limited to identifying the many local minima along the protein surface of the potential energy field representing the fragment-protein interaction. This data lacks the essential information required for determining which of these minima represents a biologically relevant binding site (Demiis et al., PNAS 99:4290-4295 (2002)). W deed, although computationally more expeditious, energy minimization approaches are unable to correctly estimate free energies of binding, which, as presented further on, is the basic biologically relevant quantity for characterizing the binding affinity of a ligand. To estimate a free energy of binding, information on the actual thermodynamic fragment distributions around the protein, i.e., distributions consistent with thermal fluctuations at physiological temperatures, is required. Such thermodynamic distributions provide information on entropic effects, necessary for free energy calculations.
[0008] Accordingly, improved computational methods are necessary to provide accurate and efficient estimates of the free energy of binding of molecular fragments to protein binding sites, so that high affinity ligands can be designed for these sites.
SUMMARY OF THE INVENTION
[0009] Recognizing the essential need for relevant characterization of the interaction between fragments and polypeptide molecules, the computational method of the present invention estimates the affinity of particular fragment-residue pairs, which enables the identification of key fragment interactions with the protein based on an analysis of computed fragment-residue interactions. When analyzed appropriately as described below, potential binding sites can be identified and the identification of the important fragments, which can be viewed as key pharmacophore elements, are assembled into potential drug leads. These same affinity values also provide a useful numerical convergence criteria, i.e., an assessment of the statistical validity of a given simulation, as well as a quantitative diagnostic to compare the results from different simulations.
[0010] The present invention includes conducting a computer simulation of the interaction between (i) a polypeptide and (ii) at least one type of molecular fragment, wherein a sampling from a thermodynamic ensemble of states of the polypeptide-fragment system is collected; and an affinity value is then computed and assigned to at least one fragment-residue pair when the fragment has a finite probability of being in the vicinity of the residue, wherein the affinity value is a measure of the free energy of interaction betyveen the polypeptide and the fragment; wherein the above calculations are conducted for each type of molecular fragment considered.
[0011] Alteniatively, the invention provides methods and systems for analyzing one or more samplings from a thermodynamically relevant ensemble for a ligand, or fragment, interacting with a residue of a polypeptide or a protein.
[0012] The present invention further provides methods and systems for using the affinity values of the present invention to identify protein binding sites, and help determine the key fragments to be used in constructing ligands for a given polypeptide molecule. Finally, the affinity values can also be used as a numerical convergence criterion for a given simulation, as a diagnostic to compare the results from multiple computer-implemented simulations, and to identify protein binding sites and help determine the key fragments to use in constructing ligands for a given polypeptide.
DETAILED DESCRIPTION OF THE INVENTION
I. Overview [0013] Terms are used herein as generally used in the art, unless otherwise defined herein.
[0014] In one aspect, the present invention provides methods and systems for analyzing the affinity between polypeptide amino acid residues and one or more molecular fragment types. In one embodiment, the present invention includes conducting a computer simulation of the interaction between (i) a polypeptide, and (ii) at least one type of molecular fragment, wherein a sampling from a thermodynamic ensemble of states of the polypeptide-fragment system is collected; and an affinity value is then computed and assigned to at least one fragment-residue pair when the fragment has a finite probability of being in the vicinity of the residue, wherein the affinity value is a measure of the free energy of interaction between the polypeptide and the fragment; wherein the above calculations are conducted for each type of molecular fragment considered.
[0015] Alternatively, the invention provides methods and systems for analyzing one or more samplings from a thermodynamically relevant ensemble for a ligand, or fragment, interacting with a residue of a polypeptide or a protein.
[0016] As used herein, the term "polypeptide" encompasses a molecule comprised of amino acid molecules linked by peptide bonds, and includes all such molecules, regardless of the number of amino acids in the molecule. The term polypeptide, as used herein, also includes molecules which include other moieties in addition to amino acids, an example being glycosylated polypeptides such as antibodies. The term polypeptide, as used herein, also includes protein molecules which consist of more than one chain of amino acids linked by peptide bonds; the multiple chains may be covalently bonded to each other by means of disulfide side-chain bonds.
[0017] "Fragments," as the term is used herein, includes molecules or molecular fragments (e.g. radicals) that can be used to model one or more types of interaction with a macromolecule, such as the interactions of carbonyls, hydroxyls, amides, hydrocarbons, and the like. Examples of useful fragments include, but are not limited to:
Name Structure Acetone __ CH3(C=O)CH3 Aldehyde H(C=O)-CH3 Amide H(C=O)NHZ
Ammonia NH3 Benzene Carboxylic Acid CH3COOH
1,4-Diazine N
N
Ester CH3-O-(C=O)-CH3 Ether CH3-O-CH3 Name Structure Formaldehyde HzC=O
Furan O
Imidazole N
NH
Methane CH4 Methanol CH30H
Phospho-Acid O
HO P OH
OH
Pyridine N
Pyrimidine N
N
Thiol CH3SH
Thiophene S
[0018] The fragments are preferably selected to represent chemical features that have proven useful in the design of pharmaceuticals or other bioactive chemicals. Additional possible fragment types of interest will be readily apparent to one skilled in the art.
[0019] A database of organic fragments relevant for drug discovery has been compiled by extracting organic fragments from molecules published in 1) the Jou~yaal of Medicinal Chefnistry from 1991-2001, 2) Jou~~raal of Heterocyclic ChemistYy from 1981-2001, 3) Medicinal Research Reviews from 1991-2001 and 4) heterocyclic chemistry text boolcs (for example, Eicher, T.;

_7_ Hauptmann, S. The Chemistry of Heterocycles; Th ieme Organic Claenaistry Monograph Series: 1995) and other journals and texts covering biologically active molecules. The compiled database is regularly augmented with new fragments from the literature, as well as new fragments tailored in an iterative process for a specific target macromolecule thanks to information obtained from previous simulations of the type described herein, as well as new fragments resulting from modifications that a chemist would consider for issues such as synthetic tractability.
[0020] In one aspect of the methods of the present invention, a computer simulation of the interaction between a polypeptide and at least one type of molecular fragment is conducted, wherein at least one sampling of states from a thermodynamic ensemble representing the polypeptide-fragment system is collected. In one aspect of the present invention the ensemble sampling of the protein-fragment system is obtained through a Metropolis Monte Carlo-type method. (Metropolis, N., et al., J. Chem. Physics 21:1087-1092 (1953), U.S.
Patent No. 6,735,530). Such a computation is repeated for a large collection of different organic fragment types with diverse physico-chemical properties.
The number of fragment types can be in the hundreds to thousands.
[0021] For each sampled state of the rigid fragment a set of attributes is saved, including the relative position and orientation with respect to the protein, as well as the potential energy of interaction between the fragment and the protein. The fragment's position can be characterized by the coordinates (x,y,z) of its center of mass, while its orientation is conveniently represented by a unit quaternion q.
[0022] This Monte Carlo data for the different fragment types is analyzed for identifying potential binding sites using the methods of the present invention.
These tools are based on the postulate that a binding site must be a localized high affinity region for a diverse collection of fragment types, i.e., fragments with different physico-chemical properties. In one aspect the binding site may also be determined by the ability of a diverse collection of fragment types to be coincident in a region of the protein where bound water molecules can _g_ freely exchange with bulk water. In one aspect, additional experimental binding site data, such as co-crystal X-ray data and/or residue mutational analysis, if available, is used to help in determining the final site within which the leads are designed.
[0023] The actual relevant thermodynamic fragment distributions around the protein, i.e., distributions consistent with thermal fluctuations at physiological temperatures, can be computed numerically using a Metropolis Monte Carlo approach (Metropolis, N., et al., J. Chem. Physics 21:1087-1092 (1953)).
Information on the thermodynamic distribution is essential for computing free energies of binding, which is the basic biologically relevant quantity for quantifying the binding affinity of a ligand. By contrast, the MOSS approach (Miranker, A. & I~arplus, M., Proteins Struct. Funct. Genet. 11:29-34 (1991);
Caflish, A., et al., J. Med. Chem. 36:2142-2167 (1993); Joseph-McCarthy, D., et al., J. Am. Chem. Soc. 123:12758-12769 (2001)), for example, is essentially based on an energy minimization approach, providing fragment states corresponding to various local minima of the fragment-protein interaction potential energy field. Such a procedure is computationally more expeditious than computing the actual physical distributions, but is unable to provide information on entropic effects, essential for free energy estimates.
II. Process A. Computer Simulation Methods [0024] In aspects of the methods of the present invention, a computer simulation of the interaction between a polypeptide and at least one molecular fragment is conducted, wherein at least one sampling of states from a thermodynamic ensemble representing the polypeptide-fragment system is collected. In aspects of the present invention, the ensemble sampling of the protein-fragment system is obtained through a Metropolis Monte Carlo-type method.
[0025] The computer simulation methods of the present invention provide a measure of the of the free energy of binding between the polypeptide and the molecular fragment. One such method is described in U.S. Patent No.
6,735,530, which is hereby incorporated by reference in its entirety.
Modifications to such method that would be readily apparent to one of skill in the art can also be used in the methods of the present invention.
[0026] In embodiments, the computer simulation methods of the present invention determine a measure of the chemical potential, defined as B~,;t,~al ("B~"). B~ is defined as the minimum chemical potential value (referred to as "B" in U.S. Patent No. 6,735,530) for which a particular fragment is persistently observed in the vicinity of a residue, wherein B is related to the excess chemical potential of the system according to the relation B = ~,'/kT +
In<N>, where ~,' is the excess chemical potential, k is Boltzmann's constant, T
is the absolute temperature, and <N> is the average number of molecular fragments in the simulation.
[0027] In further embodiments of the present invention, a particular type of fragment is then considered to be persistently observed in the vicinity of a residue when the average number of fragments in the vicinity of the residue is greater than or equal to 0.8. In a particular aspect of the present invention, a given type of fragment is considered to be persistently observed in the vicinity of a residue when the average number of fragments in the vicinity is greater than or equal to 0.9.
[0028] The B~ value that is assigned to any particular fragment-residue pair is an estimate of the fragment's free energy of binding for a binding site on the polypeptide in the vicinity of the considered residue. These affinity values thus attempt to account for both enthalpic and entropic contributions.
[0029] Comparing sets of B~ values for different fragment types is valuable to help identify protein binding sites as follows: a binding site is identified as a set of neighboring residues with low B~ values (high affinity) for multiple fragments with diverse physico-chemical properties. This approach is based on the assumption that diverse interactions in a localized region are the necessary condition for ensuring the specificity of a binding site. This numerical localization of binding sites is preferably, but not necessarily, complemented by experimental binding information, such as co-crystal X-ray data, mutational analysis or other approaches known to one skilled in the art.
[0030] In embodiments of the present invention, a computer simulation using water as the fragment is conducted and sites that tightly bind the water fragments are eliminated as potential binding sites. Thus, the organic fragments must demonstrate the ability to out-compete water in a particular site in order for that site to be identified as a potential ligand binding site.
[0031] Compared to the above described residue-based proximity criteria, more detailed calculations of the binding mode volumes OVb can be used to provide more accurate estimates of the free energy of binding. Such improved binding mode volume estimates are determined by identifying "clumps" in the fragment distribution. This can be achieved by clustering sampled fragment states belonging to the same potential energy well. For this purpose one makes use of the potential energies saved for the sampled fragment states.
B. Binding Analysis [0032] A first estimate of the binding affinity of a given fragment for different regions on the protein surface can be obtained by assigning a critical chemical potential (B~) to each fragment-residue pair. Such a chemical potential can be calculated from the thermodynamic ensemble data by using the method described in U.S. Patent No. 6,735,530 in the case of a binding volume OVb defined for each residue according to the following proximity criteria:
a fragment state is considered to be in proximity of a given residue if at least one fragment-protein atom pair (a, b) is such that ~"aa ~ a ~Ryaw,a '~- Rydw>a ~~ (33) where r~b is the distance between the two atoms, RvaW is the Van der Waals radius and a is a numerical parameter. In an embodiment, a is between 0.5 and 2.0, and typically chosen to be 1.2. In one aspect the Van der Waals radius is half the Lennard-Jones parameter 6 from the considered molecular-mechanics force-field used for the Monte Carlo simulation. In an aspect of the present invention, the molecular mechanics force field is selected from one of the group consisting of MM2, MM3, MM4, OPLS, OPLS-AA, AMBER, GROMOS, CHARMM, Xplor, Discover, MMFF and Tripos and others known by those skilled in the art. AMBER is a particularly preferred force field.
(Reviews ih Co~iputatiohal Chemistry, hol 16, Lipkowitz and Boyd, eds., John Wiley & Sons, New York, New York, 2000).
[0033] In another aspect of the present invention, following the computer simulation of the interaction between the polypeptide and at least one fragment type, and the assignment of affinity values to each fragment-residue pair, a binding analysis profile is outputted that comprises a matrix of B
values for each fragment-residue pair.
[0034] In an embodiment of the present invention, numerous separate computer simulations are conducted on a particular polypeptide, wherein in each simulation a different fragment type interacts with the protein. For example, a simulation of polypeptide A is conducted with fragment X, wherein interaction energies are calculated, and affinity values B~ assigned to each fragment-residue pair as described above. A computer simulation of polypeptide A is then conducted with fragment Y, wherein interaction energies are calculated, and affinity values assigned to each fragment-residue pair as described above, etc.
[0035] When multiple simulations are conducted for a given polypeptide, a separate affinity value matrix can be generated for each fragment type. In this way the output enables a ranking of the residues with respect to average fragment-binding affinity for a given residue. For example, a matrix of affinity values can be generated which represent averages over various fragment families (for example, polar, aliphatic, heterocyclic, etc.), and the polypeptide surface can then be coded according to these average fragment-residue binding affinities. For visualization purposes, residues with higher and lower fragment binding affinity values can be color-coded accordingly. For example, residues with high average fragment affinity can be displayed in various degrees of red, while the residues with low average fragment affinity are represented in light to dark blue. Other related coloring schemes can be used which are known by those skilled in the art. Such a color-coding of the three-dimensional rendering of the protein provides an efficient way to highlight the high affinity regions, i.e., the potential binding sites, of the protein.
[0036] The residue-fragment affinity can also be used to identify key fragments which can be used to design ligands, i.e., potential drug candidates.
For one or more selected residues, molecular fragments can be ranked according to their affinity value. For example, for a selected residue, the molecular fragments can be listed in ascending or descending order of their B
values. Similarly, in an embodiment, the invention allows the display of, for each fragment, a table of residues that highlights the residues on the protein surface for which a particular fragment has the highest affinity. The results presented in such a table can again be visualized by appropriate color-coding of the three-dimensional rendering of the protein.
[0037] The fragment-residue affinity values can also be used as a numerical convergence criteria of the Monte Carlo simulation. For example, a matrix of B-critical values derived from all sampling data collected can be saved for each fragment-residue pair (an "affinity profile") at successive intervals along the Monte Carlo simulation. Convergence is considered to be achieved when the affinity profile remains invariant within a consistent range of statistical variation.
[0038] The fragment-residue affinity can also be used to measure the extent to which different simulation implementations of the same physical system give statistically the same or different results.
[0039] The residue-fragment affinity values can also be used to identify key fragments that can be used to design ligands (i.e. drug candidates). For one or more selected residues, molecular fragments can be ranked according to affinity value. For example, for a selected residue, the molecular fragments can be listed in ascending or descending order of residue-fragment affinity.

Similarly, in an embodiment, the invention enables the display of a table of residues for each fragment that highlights the regions on the protein for which the fragment has the highest affinity.
[0040] The present invention is described in fiuther detail in the following non-limiting examples.
EXAMPLES
[0041] The following data in Table 1 was generated from a simulation conducted according to the methods of the present invention on the protein Caspase-3. Amino acid residues are listed on the left hand side, while different fragment types are listed at the top. The binding affinities B
associated with the fragment-residue pairs are listed.
Table 1 - Fragment Binding Affinity for Caspase-3 acet-acetoneben-carbo-dimethylethanolimida-iso- pyrimi-tetra-ureaHz0 amide zenexylicsulfoxide zole butanedinehydro-acid furan -AO

ASN 0 -14.5300 0 -19.1710 -27.3070 0 -12.976-22.5280 A

A

TYR -21.0980 -6.8080 0 0 -21.3070 0 0 -22.5280 A

LYS 0 0 -6.8080 0 0 0 0 0 0 0 0 A

A

A

TYR 0 0 0 0 0 0 0 0 -12.4720 0 0 A
GLU 0 0 0 -17.5930 -18.2330 0 0 0 -19.528-19 A
A
A
A
A
A
A -A

acet-acetoneben-carbo-dimelhylethanolimida-iso- pyrimi-tetra-ureaHZO

amide zenexylicsulfoxide zoie butanedine hydro-acid furan LYS0 0 0 0 0 0 0 _ 0 0 0 0 A

A

A

A

A

A

A

MET-31.098-14.5300 -28.593-29.171-23.233-22.3070 0 -13.976-30.5280 THR0 0 0 0 0 0 0 0 -11.4720 0 0 A

SER0 0 0 -12.5930 0 0 0 -11.4720 0 0 A

ARG-31.098-30.530-5.808-28.593-29.171-26.233-35.307-6.330-17.472-23.976-34.528-A

A

A

A

A

A

A

A

A

A

A

-A

A

A

A

A

A

A

GLU0 0 0 -12.5930 0 0 0 0 0 0 -18 A

A

ARG0 0 0 0 0 0 0 0 -10.4720 0 0 A

A

LYS-27.098-16.5300 0 -21.1710 -30.3070 -12.472-12.976-26.5280 A

ASN0 0 0 0 -16.1710 0 0 -12.4720 0 0 A

A

LEU-27.098-15.5300 0 -17.1710 -30.3070 0 -11,976-26.5280 A

ARG0 0 0 0 0 0 0 0 -9.4720 0 0 A

I D 0 -15.593-16.171-13.2330 0 D -A ~ ( ~ ~ ~
94 ~ ~

GLU-27.098-18.5300 0 -21.1710 -30.3070 0 -12.976-26.5280 acet-acetoneben- carbo-dimethylethanolimida-iso- pyrimi-tetra-ureaHz0 amide zene xylicsulfoxide zole butanedinehydro-add furan A -' VAL -18.098-17.5300 0 -24.1710 0 0 0 0 -17.5280 A

GLU -22.098-13.5300 -21.593-17.171-23.233-30.3070 0 0 -25.528-27 A

LEU 0 0 0 0 -16.1710 -23.3070 -10.4720 0 0 A

A

ARG -18.098-17.5300 -15.593-24.171-23.2330 0 0 -9,976-25.528-27 A

ASP -21.0980 0 -21.5930 -23.233-30.3070 0 0 -25.528-27 A

A

SER 0 -16.5300 0 -18.1710 -24.3070 -10.472-14.9760 0 A

LYS -22.098-16.5300 -21.593-18.171-23.233-30.3070 -10.472-14.976-25.528-A

GLU -21.098-14.5300 0 -18.171-12.233-24.3070 -10.472-14.9760 0 A

ASP 0 -16.530-5.808-16.593-18.171-18.233-24.3070 0 -14.976-19.5280 A

A

A

LYS 0 0 0 -17.5930 -18.2330 0 0 0 -19.528-17 A

ARG 0 0 0 -17.5930 -18.2330 0 0 0 -19.528-19 A

A

A

A

A

A

A

A

A

SER -26.098-14.530-5.808-28.593-27.171-28.233-35.307-6.330-17.472-23.976-34.528-23 A

HIP -31.098-20.530-5.808-28.593-29.171-23.233-35.307-6.330-17.472-23.976-30.528-26 A

GLY -31.098-20.5300 -28.593-29.171-23.233-22.3070 -14.472-14.976-30.528-A

GLU 0 -19.5300 0 -23.1710 0 0 0 0 0 0 A

GLU 0 0 0 0 0 0 0 0 -13.4720 0 0 A

A

PHE 0 -19.5300 0 -29.1710 0 0 0 0 0 0 A

A

A

A

A

PRO 0 0 0 0 0 0 0 0 -9.4720 0 0 A

VAL 0 0 0 0 0 0 0 0 -9.4720 0 0 A

ASP 0 0 0 0 0 0 0 0 -9.4720 0 0 A

A

I 0 0 -7.808-13.5930 1.233-25.3070 0 0 -23.528-21 ~ ~ ~

A ~ ~

~YS -18.098-17.530-5.8080 -24.171~0 0 0 0 -9 -17 0 ~ ~ ~ ~ ~ - 976 528 ~

acet-acetoneben- carbo-dimethylethanolimida-iso- pyrimi-tetra-ureaH20 amide zene xylicsulFoxide zole butanedinehydro-acid furan A

A

ASN 0 0 -5.8080 -'16.1710 -25.3070 0 0 -23.5280 A

PHE 0 0 -5.8080 0 0 0 0 0 0 0 0 A

A

ARG -24.098-13.530-5.8080 -16.171-15.233-21.3070 0 -9.976-18.528-18 A

A

ASP 0 0 0 -14.5930 -13.2330 0 0 0 0 0 A

ARG 0 -13.530-5.808-14.593-16.171-13.2330 0 0 0 0 0 A

CYS 0 0 0 0 -18.171-12.233-24.3070 0 -14.9760 0 A

ARG -21.098-16.5300 0 -18.171-12.233-24.3070 -10.472-14.9760 0 A

SER -21.098-16.5300 0 -18.171-12.233-24.3070 -10.472-14.9760 0 A

A

A

GLY -21.0980 -6.8080 0 -12.233-22.3070 -13.472-11.976-22.528-19 A

A

A

A

A

A

GLN -26.098-13.530-5.8080 -16.1710 -29.307-6.3300 -23.976-26.528-15 A

ALA -31.098-30.530-5.808-28.593-29.171-26.233-29.307-6.3300 -23.976-34.5280 A

CYM -31.098-30.530-5.808-28.593-29.171-23.233-35.307-6.330-17.472-23.976-34.528-26 A

ARG 0 0 0 0 0 0 0 0 -14.4720 0 0 A

A

A

GLU -26.098-16.5300 0 -25.1710 -30.3070 -10.472-10.976-25.5280 A

LEU 0 0 0 0 -25.1710 0 0 0 -10.9760 0 A

A

A

A

A

A

EO

HID 0 -15.5300 -19.593-18.1710 -19.3070 0 0 -18.528-16 E

LYS 0 0 0 -15.593-18.171-12.2330 0 0 0 0 -16 E

ILE -21.098-15.5300 0 -20.1710 -22.3070 -13.472-11.976-23.5280 E

E

VAL 0 0 -5.8080 0 ~ 0 0 0 0 0 ~ ~ ~ ~

E ~ ~ I ~

ASP 0 0 -7.8080 0 / -11.233- 0 0 0 0 0 I ( I ~ ~ ~ 0 acet-acetoneben-carbo-dimethylethanol~ iso- pyrimi-tetra-ureaHZO
imida-amide zenexylicsulfoxide sole butanedinehydro-acid furan E

E

ASP-21.0980 0 0 0 0 -21.3070 -13.4720 -22.528-19 E

E

E

TYR0 0 -7.8080 0 -11.2330 0 0 0 0 0 E

E

E

E

E

-E

PRO0 0 0 0 0 0 0 0 -14.4720 0 0 E

E

TYR0 0 0 0 0 0 0 0 0 -9.9760 0 -E

TYR0 0 0 -12.593-29.1710 0 0 -11.472-12.9760 0 E

SER-31.098-30.5300 -28.593-29.171-26.233-35.307-6.330-17.472-23.976-34.528-E

TRP-26.0980 0 0 0 0 0 -6.330-11.4720 -26.5280 E

ARG-31.098-30.530-5.808-28.593-29.171-26.233-35.307-6.330-17.472-23.976-34.528-E

ASN0 0 0 0 0 0 0 0 0 -9.9760 0 E

E

LYS0 0 0 -13.5930 0 0 0 0 0 0 0 E

ASP0 0 0 -13.5930 0 0 0 0 0 0 0 E

E

E

TRP0 0 0 0 0 0 0 0 0 -9.9760 0 E

E

E

E

E

E

E

E

E

E

E

GLN0 0 0 0 0 0 -29.3070 -11.4720 0 -18 E

TYR0 0 0 0 0 0 -29.3070 -11.4720 0 -18 E

E

ASP0 0 0 0 0 0 -29.3070 -11.4720 0 -18 E

LYS0 0 0 0 0 0 -29.3070 -12.4720 0 -18 E

LEU-18.098-13.5300 -16.593-17.171-15.233-21.3070 -12.4720 -20.528-16 E

E

PHE0 0 0 0 0 ~ 0 0 0 0 0 0 0 ~ ( ~ ~

E

I I I I I

acel-acetoneben- carbo-dimelhylethanolimida-iso- pyrimi-tetra-urea HZO

amide zene xylicsulfoxide zolebutanedinehydro-acid furan E

HIE 0 0 0 -16.5930 0 0 0 0 0 0 -26 E

E

E

E

ARG -18.098-13.5300 -16.593-17.171-15.233-22.3070 -12.472-9.976-20.528-E

E

E

ARG -22.098-16.530-5.8080 -20.1710 -24.307-4.330-12.472-15.976-19.528-23 E

LYS -18.0980 0 0 0 0 -24.3070 -9.4720 -19.5280 E

E

E

THR 0 -15.5300 -16.593-20.1710 0 -4.330-11.472-15.9760 0 E

GLU -22.098-16.5300 0 -19.1710 -24.3070 -12.4720 -19.5280 E

E

GLU 0 0 0 -16.5930 0 0 0 0 -9.9760 0 E

SER 0 0 0 0 0 0 0 0 0 -9.9760 0 E

E

E

E

E

E

E

E

E

E

E

LYS 0 0 0 -16.5930 0 0 0 0 0 0 0 E

E

E

E

E

E

E

E

MET 0 0 -7.8080 0 -11.2330 0 0 0 0 0 E

E

THR -21.0980 -6.8080 0 0 -21.3070 -13.4720 -22.5280 E

LYS -21.098-15.530-6.8080 -20.171-12.233-22.3070 -13.472-11.976-22.5280 E

GLU -25.098-18.5300 -19.593-23,171-22.233-27.3070 -13.472-12.976-22.528-E

LEU 0 0 0 0 0 '0 0 0 0 0 0 0 E

E

E

~ ~ ~ ~ 330 acet-acetoneben- carbo-dimethylethanolirnida-iso- pyrimi-tetra-urea Ha0 amide zene xylicsulfoxide zolebutanedinehydro-acid furan E

HIE 0 0 0 0 0 0 0 -4.3300 0 0 0 E

E

ACE -18.098-13.5300 -16.593-17.171-15.233-21.3070 -12.4720 -20.528-16 BO

B

B

TYR -23.0980 0 0 0 0 -27.3070 0 0 0 0 B

B

B

B

B

GLU 0 0 0 0 0 0 -22.3070 0 0 0 -17 B

B

B

B

B

B

B

B

LYS -22.0980 0 -15.593-20.171-14.2830 0 0 -11.976-17.5280 B

ASN 0 0 0 0 0 -11.2330 0 0 0 0 0 B

B

B

B

B

B

B

MET -27.0980 0 0 -29.171-22.233-31.307-4.330-16.472-19.9760 0 THR 0 0 0 0 0 0 0 0 -12.4720 0 0 B

SER 0 0 0 0 0 -11.2330 0 0 0 0 0 B

ARG -27.098-16.5300 -21.593-29.171-18.233-35.307-4.330-29.472-19.976-34.528-B

SER -22.0980 0 -15.593-20.171-14.2330 0 0 -11.976-17.5280 B

GLY -22.0980 0 -15.593-20.171-14.2330 0 0 -11.976-17.5280 B

B

ASP -22.0980 0 -15.593-20.171-14.2330 0 0 -11.976-17.5280 B

B

B

B

ASN~O~ 0 0~ 0 0 0 0 0 ~0 0 0 0 ~

B

LEU 0 0 0 0 ~ 0 0 0 0 0 0 0 ~ ~ ~ ~ 0 ~
~

acet-acetoneben-carbo-dimethylethanolimida-iso- pyrimi-tetra-urea Ha0 amide zenexylicacidsulfoxide zoiebutanedine hydrc-furan B

B

B

B

B

B

B

B

B

B

B

ARG0 -15.5300 0 -21.1710 0 0 0 -9.9760 0 B

B

LYS0 -15.5300 0 -21.1710 -19.3070 0 -9.9760 0 B

B

B

LEU0 0 0 0 0 0 -19.3070 0 0 0 0 B

ARG-22.0980 0 0 -20.1710 -29.3070 0 0 -18.528-17 B

GLU-22.098-19.530-5.808-13.593-25.1710 -29.3070 -9.472-13.976-23.528-16 B

GLU0 0 0 0 0 - -19.3070 0 0 0 0 U
--B

B

VAL-23.098-19.5300 -13.593-25.1710 -30.307- -16.472-13.976-19.528-16 B

GLU-23.098-19.5300 -15.593-25.1710 -30.3070 -16.4720 -19.5280 B

LEU0 -13.5300 0 -20.1710 0 0 0 -9,9760 0 B

B

ARG-22.0980 0 -15.5930 0 -30.3070 -16.4720 -19.5280 B

ASP-22.0980 0 -15.5930 0 0 0 0 0 -18.5280 B

B

B

LYS-22.0980 0 -15.5930 0 0 0 0 0 -18.5280 B

B

ASP0 0 0 0 0 0 -21.3070 0 0 0 0 -B

B

B

LYS0 0 0 0 0 0 -22.3070 0 0 0 0 B

ARG0 0 0 0 0 0 -21.3070 0 0 0 -17 B

B

B

B

B

I 0 0 0 0 0 0 -~ O 0 0 0 -B ~ ~ ~ ~
1 ~

acet-acetoneben-carbo-dimethylethanolimida-iso- pyrimi-tetra-ureaHa0 amide zenexylicsuifoxide zolebutanedinehydro-acid furan B

B

B

SER -26.098-16.5300 -21.593-27.171-26.233-35.307-4.330-29.472-19.976-30.528-B

HIP -31.098-25.5300 -27,593-29.171-26.233-35.307-6.330-29.472-19.976-35.528-B

GLY -31.098-25.5300 -17.593-29.171-22.233-31.307- -4.330-16.472-19.976-35.528-21 B

GLU 0 -25.5300 0 -29.1710 0 0 0 0 0 0 B

GLU 0 0 0 0 0 0 0 0 -13.4720 0 0 B

B

B

B

PHE 0 -16.5300 0 0 0 0 0 0 0 0 0 B

B

B

B

B

B

B

ASP 0 0 0 0 -20.1710 0 0 0 0 0 0 B

B

LYS -18.0980 0 0 0 0 -24.3070 -9.4720 -20.5280 B

LYS -24.098-19.530-5.8080 -25.1710 -30.3070 -16.472-13.976-23.528-16 B

B

B

ASN -18.0980 0 0 0 0 -24.3070 -10.4720 -20.5280 B

B

B

ARG -26.098-16.5300 0 -25.1710 -30.3070 -10.472-10.976-25.5280 B

B

B

B

B

B

B

B

B

GLY -23.098-15.530-7.8080 -20.1710 -27.307-4.330-11.472-15.9760 -20 B

B

B

B

B

B

I

I 0 0 I 0 0 I 0 0 0 [ 0 0 0 0 ILE ~ 0 I I I
B I
~

acet-acetoneben-carbo-dimethylethanolimida-iso- pyrimi-tetra-urea H20 amide zenexylicsulfoxide zolebutanedinehydro-acid furan GLN -27.098-16.5300 0 -29.171-18.233-19.307-4.330-17.472-19.976-26.5280 B

ALA -31.098-25.5300 -21.593-29.171-26.233-35.307-4.330-17.472-19.976-34.528-B

CYM -31.098-25.5300 -27.593-29.171-26.233-35.307-6.330-29.472-19.976-34.528-B

ARG 0 0 0 0 0 0 0 0 -14.4720 O 0 B

B

B

GLU -24.098-13.5300 -13.593-16.171-15.233-25.3070 0 -9.976-23.528-21 B

LEU -24.0980 -5.8080 -16.171-15.233-21.3070 0 -9.976O 0 B

8.169 B

B

B

GLU 0 0 0 0 0 0 -19.3070 0 0 -22.5280 B

B

FO

HID 0 -14.530-7.8080 -20.1710 -27.307-4.330-11.472-15.976D 0 F

LYS 0 0 0 -16.5930 0 0 0 0 0 D 0 F

ILE -23.098-14.5300 0 -20.1710 -27.307-4,330-11.472-15.976O -20 F

F

F

F

F

ASP -23.098-14.5300 0 -20.1710 -27.3070 -11.472-15.976O -20 F

F

F

F

F

F

F

F

F

PRO 0 0 -7,808-13.5930 -11,2330 -4.330-12,4720 0 -21 F

GLY 0 0 0 -13.5930 0 0 0 0 0 0 0 F

TYR -24.0980 -7.8080 0 -15.2330 0 0 0 -18.528-21 F

TYR 0 0 0 0 -29.171-11.2330 -4.330-17.472-15.9760 0 F

SER -31.098-25.5300 -21.593-29.171-26.233-35.307-4.330-29.472-19.976-34.528-F

TRP 0 0 0 0 0 0 0 -4.330-17.4720 0 0 F

ARG -31.098-25.530-5.808-21.593-29.171-26.233-35.307-4.330-29.472-19.976-34.5280 F

ASN 0 0 0 0 0 0 0 0 -10.4720 0 0 F

F

F

F

GLY 0 0 0 ~ 0 0 0 0 0 0 0 0 ~ / ~ 0 acet-acetoneben-carbo-dimelhylethanolimida-iso- pyrimi-tetra-urea HZO

amide zenexyiicsulfoxide zciebutanedine hydro-acid furan F

F

TRP 0 0 0 0 0 0 0 0 -10.4720 0 0 F

F

F

F

F

F

F

F

MET 0 0 -6.8080 0 0 0 0 -13.472-13.9760 0 F

F

F

GLN 0 0 0 0 -16.1710 0 0 0 0 0 0 F

F

F

ASP 0 0 0 0 -16.1710 0 0 0 0 -17.5280 F

LYS 0 0 0 0 -16.1710 0 0 0 0 -17.5280 F

LEU -25.098-18.5300 -19.593-19.1710 0 0 -12.4720 -22.5280 F

F

F

F

F

F

F

F

ARG -25.098-16,5300 -19.593-23.171-22.233-27.3070 -13.472-12.976-22.528-F

F

F

ARG -24.098-15.530-8.8080 -20.1710 -26.3070 -13.472-13.976-23.5280 F

LYS -24.098-13.530-5.8080 0 0 -26.3070 -13.472-13.976-20.5280 F

F

F

THR -24.098-15.530-6.808-19.593-18.1710 -19.3070 -13.472-9.976-23.5280 F

GLU -24.098-15.5300 0 -20.1710 -26.3070 -13.472-13.976-23.5280 F

PHE 0 -15.5300 0 0 0 0 0 0 0 0 0 F

GLU 0 -15.5300 -19.593-18.171-12.2330 0 -10.4720 -18.528-16 F

SER 0 0 0 0 0 0 0 0 -10.4720 0 0 F

PHE 0 0 0 0 0 0 0 0 -10.4720 0 0 F

F

F

F

ALA 0 0 0 0 ~ - 0 0 0 0 0 0 0 ~ ~ ~ 0 ~

F

THR 0 0 0 0 ~ 0 0 0 0 0 0 0 0 / ~ ~ ~

acet-acetoneben- carbo-dimethylethanolimida-iso- pyrimi-tetra-ureaHZO

amide zene xylicsulfoxide zolebutanedinehydro-acid furan F

F

F

ALA 0 0 0 -12.5930 -12.2330 0 0 0 0 0 F

F

LYS 0 -15.5300 -19.593-18.1710 0 0 0 0 -18.528-16 F

F

F

F

F

F

F

F

F

F

THR -23.098-14.5300 0 -20.1710 -27.307-4.330-11.472-15.9760 0 F

LYS -23.098-15.530-7.8080 -20.1710 -27.307-4.330-11.472-15.9760 0 F

GLU -18.098-13.5300 -16.593-17.171-15.233-21.3070 0 0 -20.528-26 F

F

TYR 0 -13.5300 0 0 0 0 0 0 0 0 0 F

F

F

F

NME 0 0 0 0 0 ~ 0 0 0 0 0 ~ ~ ~ ~ ~ ~

F

[0042] The following data in Table 2 was generated from a simulation conducted according to the methods of the present invention on the protein Caspase-8. Amino acid residue are listed on the left hand side, while different fragment types are listed at the top. The binding affinities B~ associated with the fragment-residue pairs are listed.
Table 2 - Fragment Binding Affinity for Caspase-8 acet-acetoneben-carbo-dimethylethanolimidazoliso-pyrimi-tetra-urea HZO

amide zenexylicsulfoxide a butanedinehydro-acid furan ACE -27.098-8.5300 0 -11.1710 -14.3070 D -6.976-22.528-17 AO

ASP -27.098-20.530-3.808-12.593-22.171-13.233-30.3070 -14.472-11.976-22.528-A

LYS -27.098-20.530-3.808-12.593-22.171-14.233-30.3070 -14.472-11.976-22.528-A

VAL -13.098-11.530-4.808-12.593-15.171-10.233 2.330-9.472-9.976-19.5280 ~ ~ ~ ~ ~ ~

A 0 ~

acet-acetoneben- carbo-dimethylethanolimidazoliso-pyrimi-tetra-ureaH20 amide zene xylicsulfoxide a butanedine hydro-acid furan TYR -26.098-21.530-4.808-10.593-26.171-16.233-20.307-2.330-11.472-11.976-20-A

GLN -26.098-21.530-4.808-10.593-26.171-11.233-20.307-2.330-11.472-11.976-20-A

MET -13.098-12.5300 -10.593-18.1710 -13.3070 -9.472-8.976-22_5280 A

LYS -22.098-12.530-4.808-13.593-18.171-15.233-25.307-2.330-11.472-7.976-30-A

A

LYS 0 0 0 -13.5930 -15.2330 0 -7.4720 O -12 A

A

ARG 0 0 0 -13.5930 -15.2330 0 0 0 O -12 A

A

TYR 0 0 0 -8.5930 0 0 0 0 0 O 0 A

A

A

A

A

A

A

HID 0 -11.5300 0 0 0 -14.3070 0 0 O 0 A

ASN 0 0 0 0 -- -9.2330 0 0 0 O 0 A

A

ALA 0 0 -3.8080 0 0 0 0 0 0 O 0 A

LYS -15.098-11.530-4.808-8.593-15.171-9.233-17.3070 -15.472-6.976-12_528-11 A

A

A

A

LYS -28.098-12.530-3.808-15.593-11.171-13.233-23.3070 -15.472-8.976-32-528-A

A

A

LYS -14.098-8.5300 -11.593-10.171-12.233-14.307-2.330-9.4720 -17-528-14 A

LEU 0 0 0 0 0 0 0 -2.3300 0 O 0 A

A

A

ILE 0 0 -3.8080 0 0 0 -3.3300 0 O 0 A

ARG 0 0 -8.808-7.5930 -12.233-15.307-2.330-11.4726.976O 0 A

ASP -14.0980 -8.808-7.5930 -12.233-15.3070 -11.4720 -12_5280 A

ARG -30.098-25.530-8.808-22.593-28.171-18.233-32.3070 0 -16.976-31-528-10 A

ASN -14.098-11.5300 -7.5930 -12.233-15.3070 -11.4720 -12-6280 A

GLY 0 -11.5300 0 0 0 0 0 0 0 O 0 A

THR 0 -11.5300 0 0 0 0 0 0 0 O 0 A

HIE 0 -11.5300 0 0 0 -14.3070 0 0 O 0 A

A

A

A

acet-acetoneben- carbo-dimethylethanolimidazoliso-pyrimi-tetra-urea H20 amide zene xylicsulfoxide a butanedine hydro-acid furan A

ALA 0 0 0 0 0 0 0 -2.3300 0 0 0 A

A

A

A

A

A

A

A

A

A

GLU 0 0 0 -9.5930 0 0 0 0 0 0 0 A

A

LYS 0 0 0 -9.5930 0 0 0 0 0 0 0 A

A

A

ASP 0 0 0 0 0 0 -14.307-2,3300 0 0 0 A

ASP -13.098-11.530-3.808-8.593-14.171-9.233-16.3070 -7.4720 -12.528-10 A

A

THR -15.098-11.5300 -15.593-12.1710 -13.3070 -15.4720 0 0 A

A

GLU -28.098-12.5300 -15.593-11.171-13.233-23.3070 0 0 -32.528-10 A

GLN -15.098-11.5300 -8.593-14.171-9.233-18.3070 0 -6.976-12.5280 A

A

GLU -15,098-9.5300 0 -13.1710 -21.3070 0 -6.976-15.5280 A

A

A

LYS -15.098-9.5303.8080 -13.1710 -21.3070 0 -6.976-15.5280 A

ILE 0 0 0 0 0 0 -13.3070 0 -6.976-15.5280 A

A

GLN -13.098-11.5300 -11.593-12.1710 -13.3070 -8.472-8.9760 0 A

LEU -13.098-11.5300 -11.593-12.1710 -13.3070 -8.472-8.9760 0 A

MET -13.098-11.5300 -11.593-12.1710 -13.3070 -8.472-6.9760 0 A

ASP -13.098-11.5300 0 -12.1710 0 0 0 0 0 0 A

A

A

A

A

ASP 0 0 0 0 0 0 -20.3070 -7.4720 -17.528-11 A

A

A

acet-acetoneben- carbo-dimethylethanolimidazoliso-pyrimi-tetra-urea HZO

amide zene xylicsulfoxide a butanedine hydro-acid furan CYS 0 D .0 0 0 0 0 0 0 0 0 0 A

A

A

A

SER -27.098-25.530-8.8080 0 -18.233-25.3070 0 -16.9760 0 A

HIE -26.098-25.530-8.808-22.593-28.171-18.233-32.307-3.3300 -16.9760 0 A

GLY 0 0 0 0 0 0 0 -2.3300 0 0 0 A

ASP -14.098-10.530-3.808-11.593-15.171-11.233-20.3070 -9.472-7.976-17.528-14 A

LYS -23.098-15.530-3.808-8.593-18.171-9.233-26.3070 -10.472-7.976-24.5280 A

GLY 0 0 0 0 0 0 0 -2.3300 0 0 0 A

ILE 0 0 0 -12.5930 0 0 0 0 0 0 0 A

A

TYR 0 0 0 0 0 0 0 -2.3300 0 0 0 A

A

A

ASP -28.098-12.5300 -15.593-11.171-13.233-23.3070 -15.472-8.976-32.528-14 A

A

GLN 0 0 0 -7.593-10.1710 0 0 0 0 0 -11 A

GLU 0 -8.5300 -11.593-10.171-12.233-14.3070 -9.4720 -17.528-11 A

A

A

A

TYR 0 0 0 0 0 0 0 -2.3300 0 0 0 A

GLU -14.0980 0 0 0 -7.233-17.3070 0 0 -16.5280 A

A

A

SER -14.0980 0 0 0 -7.2330 0 0 0 0 0 A

GLN -14.0980 0 0 0 0 -17.3070 0 0 -16.5280 A

A

THR 0 0 0 0 0 -8.2330 0 0 0 0 0 A

GLY -16.0980 0 0 0 -7.2330 -2.3300 0 -13.5280 A

LEU 0 0 0 0 0 0 0 -2.3300 0 0 0 A

LYS -14.098-8.530-3.8080 -11.171-8.233-17.3070 0 0 -16.5280 A

CYS -13.098-11.5300 -11.593-10.1710 0 0 0 -6.9760 0 A

PRO -13.098-11.5300 -11.593-12.1710 -13.3070 -8.472-6.9760 0 A

SER -13.098-11.5300 -11.593-12.1710 -13.3070 -8.472-6.9760 0 A

A

ALA -16.0980 0 0 0 -7.2330 -3.3300 0 -13.528-14 A

GLY -26.098-21.5300 -11.593-26.171-16.233-20.307-2.330-10.472-12.976-20.528-A

LYS -26.098-21.5300 -11.593-26.1710 -20.307-2.330-9.472-9.976-17.528-11 A

A

LYS -16.0980 0 0 0 -7.2330 0 0 0 -13.528-14 A

VAL 0 0 0 0 0 0 0 0 0 0 D n ~ ~ ~

acel-acetoneben- carbo-dimethylethanolimidazoliso- pyrimi-tetra-urea H20 amide zene xylicsulfoxide a butanedinehydro-acid furan A -A

A

GLN -30.0980 -8.808-22.593-28.171-18.233-82.3070 0 -16.976-30.5280 A

ALA -30.098-25.530-8.808-7.593-28.171-18.233-25.3070 0 -16.976-22.5280 A

CYS -30.098-25.530-8.808-22.593-28.171-18.233-32.307-3.3300 -16.976-31.5280 A

A

GLY 0 0 -3.8080 0 0 -21.3070 0 -6.9760 0 A

ASP -23.098-15.530-3.808-8.593-18.171-9.233-26.307-3.330-8.472-7.976-24.5280 A

ASN -13.098-8.530-4.8080 -11.171-7.233-21.3070 -8.472-9.976-11.5280 A

TYR -13.0980 -3.8080 0 -7.233-14.3070 -7.472-6.976-11.5280 A

GLN -13.0980 0 0 0 -7.233-15.307-3.330-10.472-6.976-11.5280 A

LYS -13.098-8.530-4.8080 -10.171-7.233-15.307-2.330-10.472-6.976-11.5280 -A

GLY 0 0 0 0 0 0 0 -3.3300 0 0 0 A

ILE 0 0 -4.8080 0 0 0 -3.3300 -6.9760 0 A

A

VAL -13.098-10.5300 0 0 0 0 -2.3300 0 -11.5280 A

GLU -13.0980 0 0 0 0 0 0 0 0 -11.5280 A

THR -13.0980 0 0 0 0 0 0 0 0 0 0 A

ASP -13.0980 0 0 0 0 0 0 0 0 -11.5280 A

A

ACE 0 0 0 0 -10.1710 0 0 0 0 0 0 BO

B

ARG 0 0 0 0 -11.1710 0 0 0 -6.9760 0 B

TYR 0 0 0 0 0 0 0 -2.3300 0 0 0 B

ILE -16.0980 0 0 -23.171-7.2330 -3.3300 -9.976-13.5280 B

PRO -16.0980 0 0 0 -7.2330 0 0 0 -13.5280 B

ASP -16.0980 0 0 0 -8.2330 -2.3300 0 -13.5280 B

B

ALA -16.0980 0 0 0 -7.2330 0 0 0 -13.528-14 B

ASP -16.0980 0 0 0 -7.2330 0 0 0 -13.528-14 -B

PHE 0 0 0 0 0 0 0 -4.3300 0 0 0 B

B

LEU 0 0 -3.8080 0 0 0 -3.3300 0 0 0 B

B

B

B

B

B

B

ASN 0 0 0 0 0 0 -21.3070 0 0 0 0 CYS -13.0980 0 0 0 ~ 0 0 0 0 0 -11.5280 ~ ~ ( ~ ~ ~

B ~

VAL 0 0 0 0 0 / 0 0 -2.3300 0 0 0 ~ ~ ~ ( ~ ( ~

acet-acetoneben-carbo-dimethylethanolimidazoliso- pyrimi-tetra-urea H20 amide zenexylicsulfoxide a butanedinehydro-acid furan B

SER-30.098-25.530-8.808-22.593-28.171-18.233-32.3070 0 -16.976-31.5280 B

TYR-30.098D -8.8080 0 0 0 -3.3300 -7.9760 0 B

ARG-30.098-25.530-8.808-22.593-28.171-18.233-32.307-3.330-11.472-16.976-31.5280 B

ASN0 -9.5300 0 -10.1710 0 -3.3300 0 -13.5280 B

PRO-14.0980 -8.8080 0 -9.233-15.3070 0 0 -12.528-B

B

GLU0 -9.5300 0 -15.1710 0 0 -15.472-8.976-13.5280 B

GLY0 0 0 0 0 0 0 0 0 0 -13.5280 B

THR0 0 -8.8080 -10.1710 0 0 0 0 0 0 --B

TRP0 -9.530-3.8080 -10.1710 0 -3.330-15.472-8.976-13.5280 B

B

B

GLN-20.098-14.530-4.8080 -18.1710 -22.307-3.330-15.472-9.976-13.5280 B

SER-21.098-19.5300 0 -18.1710 -22.3070 0 0 -20.5280 B

B

GLN-21.098-19.530-3.808-9.593-18.171-12.233-22.307-2.330-12.472-8.976-22.5280 B

B

B

ARG0 0 0 0 0 0 0 -2.3300 0 0 0 B

GLU-19.098-14.5300 0 -21.171-7.233-22.3070 -10.472-10.976-17.528-11 B

ARG0 0 0 0 -18.171-7.2330 0 0 0 -17.528-11 B

B

B

ARG-19.098-14.5300 0 -21.1710 -22.3070 -10.472-10.976-17.5280 B

GLY0 0 0 0 -12.1710 D 0 0 0 0 0 B

ASP-19.098-11.5300 -20.593-13.1710 0 0 -12.472-6.976-14.5280 B

ASP-19.098-11.5300 -20.593-13.171-14.233-14.3070 -12.472-6.976--14.528-15 B

B

B

THR0 0 0 0 -13.1710 0 0 0 0 0 0 B

B

B

GLU0 0 0 0 0 -7.2330 0 0 0 -17.528-11 B

-B

B

TYR-13.0980 0 0 0 -7,2330 -2.3300 0 -11.5280 B

GLU-24.098-19.5300 -14.593-18.171-17.233-26.3070 -12.472-7.976-27.528-17 B

B

SER-13.0980 0 0 -10.1710 -13.307-3.330-10.472-6.976-11.5280 B

ASN-24.098-10.530-4.808-11.593-10.171-9.2330 -2.330-10.472-7.976-27.5280 B

LYS~ -19.530-5.8D8~ -18.171-17.233-26.307-3.330-15.472-11.976-27.528-17 -24.098~ -14.593~ ~ ~
~ ~

acet-acetoneben-carbo-dimethylethanol~ iso- pyrimi-tetra-urea H20 imidazol -amide zenexylicsulfoxide a butanedinehydro-acid furan B

ASP -16.0980 -4.808-12.5930 -7-2330 -2.3300 0 0 0 B

ASP 0 0 0 -7.5930 O 0 0 0 0 0 0 B

LYS -16.098-9.5300 -12.593-11.171-7-2330 0 0 -6.9760 0 B

LYS 0 0 0 -7.593-11.171-7.2330 0 0 0 0 0 B

B

B

LYS -13.098-10.530-0.808-10.593-10.171-7-233-13.307-4.330-10.472-6.976-11.5280 B

B

MET 0 0 0 0 0 O -13.307-3.330-10.4720 0 0 B

B

B

B

B

PHE 0 0 0 0 0 O 0 -2.3300 0 0 0 B

THR 0 0 0 0 0 O 0 -3.3300 0 0 0 B

B

ARG -20.098-14.530-3.808-8.593-23.171-7.233-23.307-2.330-11.472-12.976-16.5280 B

LYS -26.098-21.530-4.808-11.593-26.171-16-233-21.307-3.330-14.472-13.976-20.528-15 B

LYS -21.098-13.530-3.808-20.593-13.171-14-233-14.307-2.330-12.472-8.976-14.528-B

B

VAL 0 0 0 -10.5930 O 0 0 0 0 -19.5280 B

B

PRO -22.098-12.5300 -10.593-16.171-7-233-22.307-2.330-9.4720 -30.5280 B

B

ASP -22.098-10.5300 -13.593-15.171-15-233-25.3070 -9.4720 -30.528-14 ( ~ ~ ( [0043] While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example, and not limitation. It will be apparent to persons skilled in the relevant art that various changes in detail can be made therein without departing from the spirit and scope of the invention. Thus the present invention should not be limited by any of the above-described exemplary embodiments.
[0044] All references and publications referred to herein are hereby incorporated by reference in their entirety_

Claims (18)

1. A method for analyzing the binding affinity between polypeptide amino acid residues and one or more molecular fragments, comprising:
(a) conducting a computer simulation of (i) a polypeptide, and (ii) at least one molecular fragment, wherein a sampling from a thermodynamically relevant ensemble of states between the polypeptide and each molecular fragment is collected; and (b) assigning a binding affinity value to at least one fragment-residue pair when said fragment has a finite probability to be in the vicinity of the residue, wherein said affinity value is an estimate of the free energy of binding between the polypeptide and the fragment;
wherein (a) and (b) are conducted for each molecular fragment considered in the computer simulation.
2. The method of claim 1, wherein said at least one fragment is defined as having a finite probability to be in the vicinity of a residue when at least one pair of fragment-residue atoms is within a predetermined threshold distance, wherein said threshold distance is based on the sum of the Van der Waals radii of said fragment-residue atoms.
3. The method of claim 2, wherein said predetermined threshold distance is defined as:
r ab < .alpha. (R VdW,a + R VdW,b), wherein r ab is the distance between the two atoms, R VdW is the Van der Waals radius and .alpha. is a numerical parameter.
4. The method of claim 3, wherein said .alpha. is between about 0.5 and about 2Ø
5. The method of claim 4, wherein said .alpha. is about 1.2.
6. The method of claim 3, wherein said Van der Waals radius is about half the Lennard-Jones parameter .sigma. from the molecular-mechanics force-field considered for modeling the interaction between the polypeptide and the fragment.
7. The method of claim 6, wherein said molecular mechanics force field is selected from the group consisting of MM2, MM3, MM4, AMBER, OPLS, OPLS-AA, GROMOS, CHARMM, Xplor, Discover, MMFF and Tripos.
8. The method of claim 7, wherein said molecular mechanics force field is the AMBER force field.
9. The method of claim 1, wherein in said computer simulation fragment-fragment interactions are retained, and multiple simulations are performed at multiple values of B, and said affinity value is estimated by B-critical, wherein B-critical is defined as the minimum B value for which a particular fragment is persistently observed in the vicinity of a residue, wherein B is related to the excess chemical potential of the system according to the relation B = µ'/kT
+
ln<N>, where µ' is the excess chemical potential, k is the Boltzmann's constant, T is the absolute temperature, and <N> is the average number of molecular fragments in the simulation.
10. The method of claim 9, wherein a fragment is persistently observed in the vicinity of a residue when the average number of fragments in the vicinity of the residue is greater than or equal to a fixed value between 0.8 and 1Ø
11. The method of claim 10, wherein a particular fragment is persistently observed in the vicinity of a residue when the average number of fragments in the vicinity is greater than or equal to 0.9.
12. The method of claim 1, further comprising outputting a binding analysis profile, wherein said binding analysis profile comprises a matrix of affinity values for each fragment-residue pair.
13. The method of claim 1, wherein (a) and (b) are repeated for a plurality of fragment types.
14. The method of claim 13, wherein a matrix of affinity values are averaged over a plurality of fragment families and the polypeptide surface is coded according to these binding affinity values.
15. The method of claim 14, wherein residues with highest fragment binding affinity values are displayed with a different color from the residues with the lowest affinity value.
16. The method of claim 15, wherein sets of neighboring residues displaying high fragment binding affinities are identified as potential binding sites of the protein.
17. The method of claim 16, wherein sets of neighboring residues that tightly bind water are eliminated as potential binding sites of the protein.
18. The method of claim 14, wherein residue-fragment affinity values are used to identify key fragments, wherein key fragments are assembled into larger ligand molecules.
CA002548882A 2003-12-09 2004-12-08 Methods and systems for analyzing and determining ligand-residue interaction Abandoned CA2548882A1 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US10/730,267 2003-12-09
US10/730,267 US20050123993A1 (en) 2003-12-09 2003-12-09 Methods of determining ligand residue binding affinity
US10/920,234 US7415361B2 (en) 2003-12-09 2004-08-18 Methods and systems for analyzing and determining ligand-residue interaction
US10/920,234 2004-08-18
PCT/US2004/041016 WO2005056814A2 (en) 2003-12-09 2004-12-08 Methods and systems for analyzing and determining ligand-residue interaction

Publications (1)

Publication Number Publication Date
CA2548882A1 true CA2548882A1 (en) 2005-06-23

Family

ID=34681738

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002548882A Abandoned CA2548882A1 (en) 2003-12-09 2004-12-08 Methods and systems for analyzing and determining ligand-residue interaction

Country Status (4)

Country Link
US (1) US7415361B2 (en)
EP (1) EP1704493A4 (en)
CA (1) CA2548882A1 (en)
WO (1) WO2005056814A2 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040267456A1 (en) * 2003-06-27 2004-12-30 Stephan Brunner Method and computer program product for drug discovery using weighted grand canonical metropolis Monte Carlo sampling
US20050222776A1 (en) * 2004-03-31 2005-10-06 Locus Pharmaceuticals, Inc. Method for fragment preparation
EP1763814A4 (en) * 2004-06-07 2008-12-10 Locus Pharmaceuticals Inc Identification of ligands for macromolecules
US20110130968A1 (en) * 2009-11-29 2011-06-02 Matthew Clark Method for computing ligand - host binding free energies
US20160306947A1 (en) * 2013-10-23 2016-10-20 Dow Global Technologies Llc Methods, systems, and devices for designing molecules

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5453937A (en) * 1993-04-28 1995-09-26 Immunex Corporation Method and system for protein modeling
US5884230A (en) * 1993-04-28 1999-03-16 Immunex Corporation Method and system for protein modeling
US5600571A (en) * 1994-01-18 1997-02-04 The Trustees Of Columbia University In The City Of New York Method for determining protein tertiary structure
US6341256B1 (en) * 1995-03-31 2002-01-22 Curagen Corporation Consensus configurational bias Monte Carlo method and system for pharmacophore structure determination
AU5721596A (en) 1995-04-27 1996-11-18 Mount Sinai School Of Medicine Of The City University Of New York, The Method for identifying structurally active compounds using c onformational memories
US6251620B1 (en) * 1995-08-30 2001-06-26 Ariad Pharmaceuticals, Inc. Three dimensional structure of a ZAP tyrosine protein kinase fragment and modeling methods
EA199700087A1 (en) 1995-10-30 1998-04-30 Смитклайн Бичам Корпорейшн METHOD OF INHIBITING KATEPSIN K
US6622094B2 (en) * 1996-02-15 2003-09-16 The Trustees Of Columbia University In The City Of New York Method for determining relative energies of two or more different molecules
US6083711A (en) 1996-05-15 2000-07-04 Smithkline Beecham Corporation Proteases compositions capable of binding to said site, and methods of use thereof
GB9616105D0 (en) * 1996-07-31 1996-09-11 Univ Kingston TrkA binding site of NGF
US20020055536A1 (en) * 1996-09-26 2002-05-09 Dewitte Robert S. System and method for structure-based drug design that includes accurate prediction of binding free energy
US5854992A (en) * 1996-09-26 1998-12-29 President And Fellows Of Harvard College System and method for structure-based drug design that includes accurate prediction of binding free energy
US6178384B1 (en) * 1997-09-29 2001-01-23 The Trustees Of Columbia University In The City Of New York Method and apparatus for selecting a molecule based on conformational free energy
US6426205B1 (en) * 1997-10-24 2002-07-30 Mount Sinai Hospital Corporation Methods and compositions for modulating ubiquitin dependent proteolysis
US6735530B1 (en) 1998-09-23 2004-05-11 Sarnoff Corporation Computational protein probing to identify binding sites
US6489608B1 (en) * 1999-04-06 2002-12-03 Micromass Limited Method of determining peptide sequences by mass spectrometry
US6716614B1 (en) * 1999-09-02 2004-04-06 Lexicon Genetics Incorporated Human calcium dependent proteases, polynucleotides encoding the same, and uses thereof
US6640191B1 (en) * 1999-12-30 2003-10-28 The Regents Of The University Of California Library design in combinatorial chemistry by Monte Carlo methods
EP1272839A4 (en) * 2000-03-23 2006-03-01 California Inst Of Techn Method and apparatus for predicting ligand binding interactions
WO2004078932A2 (en) 2003-03-03 2004-09-16 Locus Pharmaceuticals, Inc. Methods and systems for preparing virtual representations of molecules
US20040267456A1 (en) * 2003-06-27 2004-12-30 Stephan Brunner Method and computer program product for drug discovery using weighted grand canonical metropolis Monte Carlo sampling
WO2005001645A2 (en) * 2003-06-27 2005-01-06 Locus Pharmaceuticals, Inc. Method and computer program product for drug discovery using weighted grand canonical metropolis monte carlo sampling

Also Published As

Publication number Publication date
EP1704493A4 (en) 2008-09-10
US20050123995A1 (en) 2005-06-09
EP1704493A2 (en) 2006-09-27
WO2005056814A2 (en) 2005-06-23
US7415361B2 (en) 2008-08-19
WO2005056814A3 (en) 2005-10-06

Similar Documents

Publication Publication Date Title
Roy et al. Recognizing protein-ligand binding sites by global structural alignment and local geometry refinement
An et al. Pocketome via comprehensive identification and classification of ligand binding envelopes
Koehl Protein structure similarities
Davis et al. Application and limitations of X‐ray crystallographic data in structure‐based ligand and drug design
Verma et al. 3D-QSAR in drug design-a review
Laurie et al. Q-SiteFinder: an energy-based method for the prediction of protein–ligand binding sites
Tripathi et al. A novel and efficient tool for locating and characterizing protein cavities and binding sites
Gerstel et al. Identifying and quantifying radiation damage at the atomic level
Chruszcz et al. Unmet challenges of structural genomics
Wieman et al. Homology-based modelling of targets for rational drug design
US20180300447A1 (en) Method for binding site identification by molecular dynamics simulation (silcs: site identification by ligand competitive saturation)
KR20070045141A (en) Method and device for partitioning a molecule
WO2005038618A2 (en) Lead molecule cross-reaction prediction and optimization system
Aprahamian et al. Utility of covalent labeling mass spectrometry data in protein structure prediction with Rosetta
Mehmood et al. Bringing structural implications and deep learning-based drug identification for KRAS mutants
Trisciuzzi et al. Analysis of solvent-exposed and buried co-crystallized ligands: A case study to support the design of novel protein–protein interaction inhibitors
Campagna-Slater et al. Pharmacophore screening of the protein data bank for specific binding site chemistry
Gilberg et al. X-ray-structure-based identification of compounds with activity against targets from different families and generation of templates for multitarget ligand design
Tarcsay et al. Homology modeling and binding site assessment of the human P-glycoprotein
US20050123993A1 (en) Methods of determining ligand residue binding affinity
Fujimoto et al. Machine-Learning-and Knowledge-Based scoring functions incorporating ligand and protein fingerprints
CA2548882A1 (en) Methods and systems for analyzing and determining ligand-residue interaction
Brewerton et al. Structural analysis of DNA–PKcs: modelling of the repeat units and insights into the detailed molecular architecture
Zhou et al. Mechanistic insights into peptide and ligand binding of the ATAD2-bromodomain via atomistic simulations disclosing a role of induced fit and conformational selection
Ramatenki et al. Identification of new lead molecules against UBE2NL enzyme for cancer therapy

Legal Events

Date Code Title Description
FZDE Discontinued