Simplified molecular input line entry specification facts for kids
The simplified molecular input line entry specification or SMILES is a line notation for the chemical structure of molecules. It uses a short series ASCII characters to represent structures. Most molecule editor computer programs can draw a two-dimensional diagram or a three-dimensional model of a molecule based on its SMILES code.
Arthur Weininger and David Weininger wrote the original SMILES specification in the late 1980s. It has since been modified and extended by others, including by Daylight Chemical Information Systems Inc. In 2007, the Blue Obelisk open-source chemistry community developed an open standard called "OpenSMILES".
In July 2006, the IUPAC introduced the InChI as a standard line notation for representing chemical structures. People think that it is easier for humans to understand SMILES than InChI. Some chemists argue that SMILES is better than InChI because many different software programs support it. SMILES also has theoretical (for example, graph theory) backing.
Conversion
SMILES can be converted back to 2-dimensional representations using Structure Diagram Generation algorithms. This conversion can sometimes be ambiguous. Because SMILES just says which atoms are connected without any information about the angles between chemical bonds, computer programs use energy minimization to convert SMILES to 3-dimensional representations. There are many downloadable and web-based conversion utilities.
Examples
Here are some examples of the SMILES for molecules:
Molecule | Structure | SMILES Formula |
---|---|---|
Dinitrogen | N≡N | N#N |
Methyl isocyanate (MIC) | CH3–N=C=O | CN=C=O |
Copper(II) sulfate | Cu2+ SO42- | [Cu+2].[O-]S(=O)(=O)[O-] |
Œnanthotoxin (C17H22O2) | CCC[C@@H](O)CC\C=C\C=C\C#CC#C\C=C\CO | |
Pyrethrin II (C22H28O5) | COC(=O)C(\C)=C\C1C(C)(C)[C@H]1C(=O)O[C@@H]2C(C)=C(C(=O)C2)CC=CC=C | |
Aflatoxin B1 (C17H12O6) | O1C=C[C@H]([C@H]1O2)c3c2cc(OC)c4c3OC(=O)C5=C4CCC(=O)5 | |
Glucose (glucopyranose) (C6H12O6) | OC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O)1 | |
Bergenin (cuscutin) (a resin) (C14H16O9) | OC[C@@H](O1)[C@@H](O)[C@H](O)[C@@H]2[C@@H]1c3c(O)c(OC)c(O)cc3C(=O)O2 | |
A pheromone of the Californian scale insect | CC(=O)OCCC(/C)=C\C[C@H](C(C)=C)CCC=C | |
2S,5R-Chalcogran: a pheromone of the bark beetle Pityogenes chalcographus | CC[C@H](O1)CC[C@@]12CCCO2 | |
Vanillin | O=Cc1ccc(O)c(OC)c1 | |
Melatonin (C13H16N2O2) | CC(=O)NCCC1=CNc2c1cc(OC)cc2 | |
Flavopereirin (C17H15N2) | CCc(c1)ccc2[n+]1ccc3c2Nc4c3cccc4 | |
Nicotine (C10H14N2) | CN1CCC[C@H]1c2cccnc2 | |
Alpha-thujone (C10H16O) | CC(C)[C@@]12C[C@@H]1[C@@H](C)C(=O)C2 | |
Thiamin (C12H17N4OS+) (vitamine B1) |
OCCc1c(C)[n+](=cs1)Cc2cnc(C)nc(N)2 |
Specifications
See also
In Spanish: SMILES para niños