IU MolViz User Guide

Cambridge Structural Database



Introduction

The Cambridge Structural Database (CSD) contains over 355,000 (Version 5.27) organic and metal organic structures determined by X-ray and neutron diffraction studies. The database is fully retrospective with the earliest entry dating to 1923. The primary source of coordinate data is the literature and the CSD acts as a depository for many major journals. Each CSD entry contains bibliographic information as well as 2-D and 3-D representations of structures. The database can be searched by molecules or fragments of molecules drawn in the search program; or searches can be conducted on textual items such as authors, publication year, journal, compound name, etc.

This guide is intended to be reference for first-time CSD users (some contents in this page came from the CCDC's CSD documents). This guide is NOT necessarily updated or complete. Consult the online CSD manuals on the CCDC web site for a more detailed description.

For more information, see the Cambridge Crystallographic Data Centre (CCDC) web site.

1D information

2D information

3D information


Performing a CSD Search

Two programs distributed with the CSD are used for searching and retrieving information from the CSD. QUEST is the basic search engine for the CSD. ConQuest is a new program, which contains almost all of the functionality of QUEST and is being onwardly developed by CCDC. We highly recommend ConQuest because of its better graphic interface and many new features. The following brief instructions are written for the use of ConQuest. You are encouraged to go through the online ConQuest tutorials on the CCDC web site. If you perfer QUEST, please see the instructions for QUEST.

  1. Open a UNIX shell: Click the left mouse button on Desktop (in the toolbox in the upper left corner), then click on UNIX shell in the new pop-up menu.
  2. With the mouse cursor positioned in the new UNIX shell, type cq to start the ConQuest program.
  3. Start search:

Crystal structures found by the search could be visualized within ConQuest and QUEST programs, or by other more powerful crystal structure visualization software, such as Cerius2 or Mercury -- a crystal structure visualizer developed by CCDC (see http://www.ccdc.cam.ac.uk/prods/mercury). Mercury is also able to download hit lists from ConQuest and browse the entire CSD database.


General Outline of ConQuest

BUILD QUERIES

Draw
Draw a molecular structure or fragment (substructure), which is then used for a structure / substructure (2D or 3D) search e.g., find structures containing a benzene ring.
Author/Journal
Find structures mentioned in particular papers, or publications by specific author(s). You can search for the author name, journal name, volume, page number, and year of publication.
Compound Name
Use compound names to search for structures, or use partial names to find certain types of compounds or structures.
Elements
Find structures containing particular chemical elements or structures containing one of a group of element e.g., find structures containing Au and O.
Formula
Find structures with a particular molecular formula e.g., Fe1C30H28N2O2Cl1.
Space Group
Find all entries for a particular crystal system (e.g., tetragonal) or for a particular space group (e.g., C2/c).
Unit Cell
Search for structures with specific unit cell parameters (or reduced cell parameters) to find out whether a crystal structure with those particular cell dimensions was already determined.
Z/Density
Find crystals with densities in a particular range, or with a specific Z (number of molecules per unit cell) or Z' (number of molecules per asymmetric unit) values.
Experimental
Search structures by experimental details e.g., find structure determinations carried out at high or low temperatures, structures determined by neutron diffraction.
All Text
General text search e.g., find crystals of a specific colour, with specific morphology; find structures whose absolute configuration was determined by X-ray methods; find pharmaceutically or agrochemically active molecules by looking for the words drug, activity, agent etc.; search for particular experimental conditions; find polymorphs by searching for the words form phase or polymorph.
Refcode (entry ID)
Find an entry whose CSD Refcode (i.e., CSD entry identifier) is known.

COMBINE QUERIES

The Combine Queries screen can be used to find structures that satisfy a logical combination of queries e.g., structures which contain magnesium or a benzene ring, do not crystallize in space group P21/c, and were determined at a temperature below 273K.

VIEW RESULTS

Author/Journal
Bibliographic information.
Chemical
Compound name, formula, colour, melting point, etc.
Crystal
Unit cell and space group information.
Experimental
Information about experimental conditions, accuracy and precision.
Diagram
Chemical structural diagram. If the search was carried out for a substructure, the substructure will be highlighted in the diagram.
3D Visualiser
3D view of the molecule, which allows you to: rotate, translate and scale molecules; select from a variety of display styles; measure molecular geometry (distances, angles and torsions); label atoms; display crystallographic unit-cell contents.
CSD Internals
CSD internal information.
Search Overview
Summary of the query (or queries), filter(s), etc.

Output File Format of ConQuest

CIF (default file extension .cif)
Crystallographic Information File (CIF) format -- International Union of Crystallography format for crystal structure data (meet the IUCr format rules exactly).
CIFMIF (.cif)
Format which combines CIF crystallographic data names with chemical data names from the Molecular Information File, and additional non-standard fields containing CSD-specific information (e.g., information about whether the structure has unresolved errors).
COORD (.cor)
Simple ASCII listing of 3D coordinates and, where appropriate, cell data.
FDAT (.dat)
CCDC format for crystal structures, necessary for transferring structures into Pluto.
MOL2 (.mol2)
Tripos Inc. format for 3D molecules and crystal structures.
PDB (.pdb)
Protein Data Bank format for 3D molecules.
REFCODE (.gcd)
Simple ASCII listing of CSD refcodes.
SD (.sd)
SDfile (Structure-Data file) format for 3D molecules produced by MDL Information Systems Inc.
SHELX (.res)
Simulated SHEL-X .res file.
TXT (.txt)
Plain text format. Selected text items (Bibliography, Chemical, etc.) may be included.
TAB (.tsv)
Tab separated list of text items. Selected text items may be included.

Back to  |  Application Guide  |  MolViz Home  |
Last updated: 06/18/2002