Go to
Google
Go to PubMed Go to
Inserm 
Visit Paris5
University
Visit our Building
Research Publications People Programs
Databases
Utilities
Images Presentation
Teaching
Links Collaborations
Contact
Acknowledgments
Latest News Back to first page

Programs/Databases and utilities
Oct 21, 2008

1) FAF-Drugs: ADME/tox Filtering: simple rules to remove some undesirable molecules

2) Compound Collection-Databases: small molecules in 3D with 3 levels of ADME/tox filtering and DiversitySet ChemBridge - Validation Set 

3) Protein Electrostatics: electrostatic potentials and pKas

4) Python scripts, Awk, Unix, Shell: utilities to help managing some files

5) JME editor from P. Ertl (to draw and get SMILES)

6) JChemPaint vs 2.2.1 (to draw and get SMILES and draw from SMILES)

7) JMol (look at some molecules in 3D)

8) Similarity search for small molecules via eMolecules: SMILES input, search in several databases

9) Compute logP

10) AMMOS: refine compounds and docked molecules (get the software)

11) FAF-Drugs2: ADME/Tox (get the software)


12) MS-DOCK: accurate multiple conformation generator and rigid docking protocol for multi-step virtual ligand screening


1) ADME/tox prediction: we have developed several filters, the code is written in Python
Abstract
In silico screening based on the structures of the ligands or of the receptors has become an essential tool to facilitate the drug discovery process but compound collections are needed to carry out such in silico experiments. It has been recognized that absorption, distribution, metabolism, excretion and toxicity (ADME/tox) are key properties that need to be considered early on, even during the database preparation stage. FAF-Drugs is an online service based on Frowns (a chemoinformatics toolkit) that allows users to process their own compound collections via simple ADME/Tox filtering rules such as molecular weight, polar surface area, logP or number of rotatable bonds. SMILES (Simplified Molecular Input Line Entry System), CANSMILES (canonical smiles) or SDF (structure data file) files are required as input and molecules that pass or do not pass the filters are sent back in CANSMILES format. This service should thus help scientists engaging in drug discovery campaigns. Other utilities and several compound collections suitable for in silico screening are available at our site. FAF-Drugs can be accessed at http://bioserv.rpbs.jussieu.fr/FAFDrugs.html.
The paper is published in Nucleic Acids Res. 2006, 34(Web Server issue):W738-44, by Miteva, Violas, Montes, Gomez, Tuffery and Villoutreix
Title: FAF-Drugs: free ADME/tox filtering of compound collections (Go to PubMed)
Go to FAF-Drugs
2) Compound Collections-Databases
Montes and Villoutreix have generated several compound collections. These molecules from different Chemical vendors (Ambinter Asinex, ChemBridge, NCI, Specs) went through several ADME/tox filters and were transformed in 3D with Omega (OpenEye Scientific Software). Each molecule can have up to 50 structures. These collections are available at RPBS, they were put online by Dr. P. Tuffery. This project is done via a special agreement with OpenEye.



Diversity set ChemBridge 2007 - Validation set                                                                                       



collections are available at RPBS

.


.Check here
3) PCE: Electrostatic computations for Proteins and pKa predictions  
Dr. Miteva and Dr. Villoutreix have been working with the MEAD package (MEAD: Macroscopic Electrostatics with Atomic Detail, by Dr. D. Bashford). The program is now available online via a collaboration with Dr. P. Tuffery.
Abstract
PCE (protein continuum electrostatics) is an online service for protein electrostatic computations presently based on the MEAD (macroscopic electrostatics with atomic detail) package initially developed by D. Bashford [(2004) Front Biosci., 9, 1082-1099]. This computer method uses a macroscopic electrostatic model for the calculation of protein electrostatic properties, such as pK(a) values of titratable groups and electrostatic potentials. The MEAD package generates electrostatic energies via finite difference solution to the Poisson-Boltzmann equation. Users submit a PDB file and PCE returns potentials and pK(a) values as well as color (static or animated) figures displaying electrostatic potentials mapped on the molecular surface. This service is intended to facilitate electrostatics analyses of proteins and thereby broaden the accessibility to continuum electrostatics to the biological community. PCE can be accessed at http://bioserv.rpbs.jussieu.fr/PCE.
The paper is published in Nucleic Acids Res. 2005, 33(Web Server issue):W372-5, by Miteva, Tuffery, Villoutreix
Title: PCE: web tools to compute protein continuum electrostatics (Go to PubMed)
     Go to PCE
4) Python Scripts to process Surflex or FRED output and other utilities
These scripts are written by D. Lagorce, they will be provided soon.
For now:
            Script 1: extract molecules from a large .mol2 file 
(Get the script here)
5) Java Molecular Editor or JME Molecular Editor
This Java program allows to draw, edit and display small molecules. You can draw a compound and generate the corresponding SMILES. JME has been written by Dr. Peter Ertl 
(Novartis). Message from Bruno: Thanks Peter for allowing non-commercial users to work with your package. WARNING THIS MAY NOT WORK WITH ALL BROWSERS.
USE JME NOW
6) JChemPaint
This Java program was originally written by Christoph Steinbeck and is now developed by his group.
This is a free editor for 2D chemical structures (Go here for more explanations)
WARNING THIS MAY NOT WORK WITH ALL BROWSERS
USE JChemPaint NOW
7) JMol
Jmol is a Java applet to view molecules in 3D
This is a free viewer (Go here for more explanations)
WARNING THIS MAY NOT WORK WITH ALL BROWSERS

USE JMol to look at Butane NOW
USE JMol to look at a small Protein (this one is called Crambin) NOW
8) eMolecules
This is a chemical search engine, searching all the world's chemistry (more info here)
Use eMolecules NOW
9) Compute logP with jlogP [You can get information about jlogP (here)]

Lipophilicity is the most often used physicochemical property in quantitative structure-activity relationships (QSAR) studies because it is related to the distribution of a bioactive substance between body fluids and lipid-rich phases. Its quantitative descriptor is the octanol-water partition coefficient (POW; usually expressed as log POW), which is the ratio of the solute concentration in the octanol to that in the aqueous phase under equilibrium conditions at a given temperature.
People also talk about logP as the 1-octanol/water partition coefficient, please check some articles about that. If a molecule contains basic or acidic groups, it can be ionized and its distribution in 1-octanol/water mixture becomes pH-dependent. The pH-dependent distribution coefficient, logD, was shown to correlate with a number of biological parameters, such as the effective permeability in human jejunum, blood-brain barrier permeability values and volume of distribution. If log P and pKa are known, log D may be derived using the Henderson-Hasselbalch equation. Thus both coefficients are important parameters for drug development.
Hydrophobicity can be expressed as logP: this is the calucalted logarithm of the octanol/water partition coefficient of the molecule. The higher the logP, the more hydrophobic the molecule. Note that the  logP values calculated by jlogP are very approximate.
The important features and limitations of jlogP are listed below:
-Allowed structures: Structures using C, N, H, O, P, S, and the halogens are allowed  with the following limitations (jlogP will issue a warning if you attempt to calculate  the properties of an 'illegal' structure):
          -Carbon atoms can make only 4 covalent bonds and cannot be charged.
          -Nitrogen atoms can make 3 or 4 covalent bonds (except in the case of NO2  where they can make 5 covalent bonds); nitrogens can be (+)-charged;  (-)-charged N's are not allowed.
          -Oxygen atoms can make 1 or 2 covalent bonds; oxygens can be (-)-charged but not (+)-charged.
          -Sulfur atoms can make 2 covalent bonds (or 6 as in a SO2 group); they  cannot be charged.
          -Phosphorus atoms can make 5 covalent bonds only (as in a PO4 group).
logP calculation method: jlogP uses a modified version of the atom-additive  methods used by Wang et al. (J. Chem. Inf. Comput. Sci. 1997,  37, 615-21 and Perspectives in Drug Discovery and Design 2000:  19, 47-66) in their xlogp software. Briefly, both programs categorize each  atom in the molecule into one of 90+ types based on the element, its hybridization,  and its neighbors. Each type has a characteristic contribution to the overall logP  of the molecule. xlogP sums the individual atom contributions and applies 10 correction  factors that take larger scale effects into account. jlogp 1.1 does not apply these correction factors at present.
Charged groups: The only charged groups allowed by jlogP are N+ and O-. Because  the logP of charged groups is highly dependent on pH, jlopP's logP estimates for  charged molecules are very approximate.
In General Biology, the key concepts that facilitate a basic understanding of structure-hydrophobicity relationships are:
• Concept 1: Polar (-OH, -NH2, -C O, etc.) or charged (-O– or -N ) groups are hydrophilic.
• Concept 2: Charged groups are more hydrophilic than uncharged groups.
• Concept 3: Non-polar groups (-CH2-, -CH3, -SH, etc.) are hydrophobic.


Most programs computing logP are not able to deal with charged structures, this is because the logP of a charged molecule is more complex to calculate and is highly dependent on pH. To compute logP of charged molecules, the Marvin package is among the best one. Yet, jlogP can also deal with charged structures according to the limitations described above.

Overall, a computed logP of about -6 indicates that the molecule is very hydrophilic (for example, glycine ClogP = -3.9) while the ClogP of decane = 5.9.

One small comment about logP from Brian White from the Biology Department, University of Massachusetts, Boston, Massachusetts.
When trying to design a drug, one would like to measure its hydrophobicity by adding it to a flask containing water and octanol. Since water and octanol don’t mix well, you get two layers. If the drug is very hydrophilic, you will find it all in the water layer and none in the octanol. If the drug is very hydrophobic, you will find it all in the octanol layer and none in the water layer. If the drug is in between, you will find some in the water and some in the octanol. The ratio of the amount
found in the octanol divided by the amount found in the water is called the octanol-water partition coefficient; this is abbreviated POW and is higher the more hydrophobic a molecule is. Since POW varies over a large range, it is convenient to take the base-10 logarithm of POW or log(POW).
For example, if we consider a drug that is moderately hydrophobic. Suppose that if you put 10 grams of the drug into the octanol/water flask, shake it up, and let it come to equilibrium, you find 9.09 grams of the drug in the octanol and 0.909 grams of the drug in the water. The POW = 9.09/0.909 or 10 (10 times more of the drug goes into the octanol than the water). The log(POW) would be log(10) or 1.
More information (with many examplex) about logP computations can be found for example in the
ClogP user guide
Use jlogP NOW
10) AMMOS

This is package to refine the 3D structures of compounds present in a file and a tool that allows energy minimization of pre-docked ligands in the context of the protein binding site.
Pencheva T, Lagorce D, Pajeva I, Villoutreix BO, Miteva MA. BMC Bioinformatics. 2008 Oct 16;9(1):438.


Get AMMOS
 NOW
11) FAF-Drugs2

This is package for filtering a compound collection (ADME/Tox using many rules).
Lagorce D, Sperandio O, Galons H, Miteva MA, Villoutreix BO. FAF-Drugs2: free ADME/tox filtering tool to assist drug discovery and chemical biology projects. BMC Bioinformatics. 2008 Sep 24;9(1):396

Get FAF-Drugs2
 NOW
12) MS-DOCK

Sauton N, Lagorce D, Villoutreix BO, Miteva MA. MS-DOCK: accurate multiple conformation generator and rigid
docking protocol for multi-step virtual ligand screening. BMC Bioinformatics. 2008 Apr 10;9:184.

Get MS-DOCK