IMAGE Clone Verification Data

 

Assessment of Clone Identity and Sequence Fidelity
Within a Commercially Distributed Subset of IMAGE Clones

R.G. Halgren, M.R. Fielden, C.J. Fong, and T.R. Zacharewski.
Department of Biochemistry & Molecular Biology
National Food Safety & Toxicology Center and
Institute for Environmental Toxicology
Michigan State University
East Lansing, Michigan 48824-1319 USA

This report documents the error rate in a commercially distributed subset of the IMAGE Consortium mouse cDNA clone collection. After isolation of plasmid DNA from 1189 bacterial stock cultures, only 62.2 percent were uncontaminated and contained cDNA inserts that had significant sequence identity to published data for the ordered clones.

An agarose gel electrophoresis prescreening strategy identified 361 stock cultures that appeared to contain two or more plasmid species. Isolation of individual colonies from these stocks demonstrated that 7.1 percent of the original 1189 stocks contained both a correct and an incorrect plasmid. 5.9 percent of the original 1189 stocks contained multiple, distinct, incorrect plasmids, indicating the likelihood of multiple contaminating events.

While only 739 of the stocks purchased contained the desired cDNA clone, agarose gel prescreening, colony isolation, and similarity searching of dbEST allowed for the identification of an additional 420 clones that would have otherwise been discarded. Considering the high error rate in this subset of the IMAGE cDNA clone set, the use of sequence verified clones for cDNA microarray construction is warranted.

When this is not possible, prescreening non-sequence verified clones with agarose gel electrophoresis provides an inexpensive and efficient method to eliminate contaminated clones from the probe set.

These data accompany a published manuscript entitled "An Assessment of Clone Identity and Sequence Fidelity Within Commercially Distributed Clones From the IMAGE Collection" Halgren, R.G., Fielden, M.R., Fong, C.J. and Zacharewski, T.R., Nucleic Acids Research, 29(2), 582-588, 2001. This article was featured in April 19 issue of Nature (Vol 410, 860-1, 2001. When the Chips are Down). All of the clones described in the manuscript have been provided to Research Genetics and are available from Research Genetics. Collaborations or other joint efforts using these data and clones are appropriate and encouraged. Please contact Tim Zacharewski for further information.

Ideally, all the data you would want is available here. However, should you find that you require additional data or other file formats, please do not hesitate to ask. Due to space constraints, image files for electrophoresis assays are not presented, although they are archived off-line. Files are presented in either MS Excel or HTML format. Clicking on an HTML formatted file will open the page in a new window. Please note that the Excel files are commented in the headers for each sheet.

 

Description Notes Available Files
Summary of results by stock ordered This file presents the information for the clone order submitted to Research Genetics, including the IMAGE clone ID ordered, the library from which the clone is derived, location in IMAGE stock plates, and location on the stock plates we received. Also included is an analysis of which stocks were correct, incorrect, contaminated, or failed to grow. per_stock.xls
Sequence Verification Data Summary Excel file with four summary pages.
1 - Correct sequences from prescreened clones (primary)
2 - Incorrect sequences from prescreened clones with BLAST data for clone identification
3- Correct sequences from Isolated clones (secondary)
4- Incorrect sequences from isolated clones with BLAST data for clone identification
webdata_totals.xls
BLAST Data (Isolated Clones) The results of attempts to identify unknown sequences obtained during physical isolation of clones. Up to 10 significant hits per query (significance defined as E < 1e-50) are examined against the UniGene database to obtain a UniGene cluster ID. Note that not all ESTs are found in a UniGene cluster. blast_negatives1.htm
Comparison of Negative Clones A pairwise comparison of all clones derived from a single stock well using the BLAST 2 Sequences tool. Note that short matches in the 5' region of a sequence were not considered to be evidence of identity. This is probably cloning vector sequence. compare_negatives.htm