Jump to main content.


Structure Browser Information

The EPA DSSTox Structure-Browser, developed from available structure-viewing freeware and open-source programming tools (see Specifications& Acknowledgements), delivers a simple, easy-to-use structure-searching capability through the chemical inventory of published DSSTox Data Files. The DSSTox Structure-Browser application can be accessed directly from this website, or from the external websites of DSSTox Collaborators and others through a simple URL address link (see Accessing DSSTox Structure-Browser). Additional details of the DSSTox Structure-Browser options and operations are provided below:

blue bullet graphic Version 2.0 Enhancements ** Updated 12 August 2008

blue bullet graphic Chemical Text Search Option
blue bullet graphic Data Files to Search Option
blue bullet graphic Chemical Structure Drawing Editor
blue bullet graphic SMILES Search Option
blue bullet graphic Structure Search Options ** Updated 12 August 2008
blue bullet graphic Searching at Generic Test Substance Level
blue bullet graphic Output Options

blue bullet graphic Accessing DSSTox Structure-Browser from External Websites ** Updated 12 August 2008

blue bullet graphic Specifications
blue bullet graphic Planned Upgrades & Future Enhancements
blue bullet graphic Acknowledgements
blue bullet graphic Disclaimers

feedback image Submit Problem Report or Feedback

Note: The DSSTox Structure-Browser has been optimized for use in Microsoft Explorer. We caution that users may experience some problems (e.g., text wrapping) with other Internet Browsers (e.g., Mozilla FireFox) at this time. For more information on current Structure-Browser limitations and planned improvements, see Specifications.

New Users: For more information on the EPA DSSTox project, visit the DSSTox Home page.

 

DSSTox Chemical Text Search

Chemical text search options listed below offer various types of text searching through DSSTox Standard Chemical Fields in all or selected DSSTox Data Files. The text search option is limited to exact text matches (all or partial text) through existing DSSTox Data File content and, therefore, provides a much more limited search option than performing a structure/substructure/similarity search.

Note: DSSTox Data Files do not, as a rule, contain lists of chemical name synonyms or alternate/discontinued CAS RN, although occasionally these are provided in the various Note fields (ChemicalNote or Note_NAMEID).

View of chemical text search box in browser

Chemical Name: wildcard (*) enabled; case-insensitive

Searches full or partial chemical names (common or systematic names) contained in TestSubstance_ChemicalName and STRUCTURE_ChemicalName_IUPAC fields of DSSTox Data Files. Lists of chemical name synonyms are not provided in DSSTox Data Files. Wildcard character (*) option can be used to broaden name search. If a match is found in the DSSTox inventory, a broader structure search can be initiated. Sample entries:

Chloroprene
chloroprene
2-chlorobuta-1,3-diene
2-chlorobuta*diene
chloropre*

CAS RN: formatted or unformatted (with or without hyphens)

Searches for CAS Registry Numbers contained in the TestSubstance_CASRN field of DSSTox Data Files. If a match is found in the DSSTox inventory, a broader structure search can be initiated. Alternate or retired CAS RNs are not searched. A CAS RN Check is performed to determine if entered value represents a valid CAS RN; an error message is returned if invalid CAS RN is entered in search box. Sample entries:

126-99-8
126998
117279-73-9
117279739

InChI: case-sensitive, must be proceeded by standard "InChI=..." text

Searches for InChI (version 1) text-string representations of chemical structure contained in the STRUCTURE_InChI field of DSSTox Data Files (see More on InChI). An exact match can be used to initiate a broader structure search. Sample entry for Chloroprene [126-99-8]:

InChI=1/C4H5Cl/c1-3-4(2)5/h3H,1-2H2

Note: Although InChI is a representation of chemical structure, this version of the DSSTox Structure-Browser only performs a text match search and does not automatically convert InChI directly to structure for a structure search; later versions will include this feature of automatic initiation of a broader structure search.

Formula: case-sensitive

Searches for full or partial empirical (molecular) formula contained in the STRUCTURE_Formula field of DSSTox Data Files. An exact match can be used to initiate a broader structure search. Sample entries:

C4H5Cl
C7H12Cl2N2
C16H12ClNO2
C16

Auto-detect: default selection, wildcard (*) enabled for Chemical Name only

Default selection automatically detects the type of chemical text entry as one of the 4 allowed types listed above, filtering detection in order of most sytematic/easily detected (InChI, CAS RN), to less systematic (Formula), to least systematic (Chemical Name). If the requirements of InChI, CAS RN, and Formula (which additionally checks for valid atomic symbols) are not met, then the entry is assumed to be a Chemical Name, without further validity checks.

InChI ......................must include "InChI=" at start
CAS RN .................numeric characters only, with or without hyphens inserted, satisfying CAS check
Formula .................mixture of numeric and text characters following minimal standard atomic precedence rules; no spaces or extra characters
Chemical Name ......default detection, no further requirements

Return to the list above Return to Top

 

"Data Files to Search" Options

View of Data Files to Search option box on browser

The DSSTox Structure-Browser is designed to search chemical/structure content through all or a portion of the published DSSTox Data File inventory. This is made possible because individual DSSTox Data Files are separately indexed within the system.

The default option is to search through "All DSSTox Files", i.e. the full list of published DSSTox Data Files.

Clicking on "Selected DSSTox Files" (or clicking on pull-down arrow to left) opens a list of all available current DSSTox Data Files, each one already checked for the default "All DSSTox Files" option. The user can selectively un-check one or more DSSTox Data Files in this list to restrict searching to within only checked files.

Note: When the DSSTox Structure-Browser is accessed from external websites, through a URL link to the EPA server hosting the Structure-Browser application, the specifics of the URL will specify which DSSTox Data Files the Structure-Browser is to search through. A user can change this incoming default selection using this Data Files to Search options once they are in the browser. See Accessing the DSSTox Structure-Browser for more information.

Return to the list above Return to Top

 

Chemical Structure Drawing Editor

View of Chemical Drawing Tool for entering chemical structures

Chemical Drawing Editor: exit EPA

The JME Molecular Editor exit EPAapplet was developed by Peter Ertl at the University of Bratislava (later enhanced at Ciba-Geigy, Basel and Novartis Pharma AG). The applet is freely available to the public and has become a standard for molecular structure input on the web, with more than 6500 installations worldwide. JME can be downloaded from the Molinspiration cheminformatics exit EPAwebsite. Additional information on JME is available at:

Facts about JME
JME documentation
JME tutorial

Example shown above has user-drawn structure for chlorobenzene. Once structure has been submitted for search, the original Search page will also compute and display the corresponding SMILES.

Return to the list above Return to Top

 

SMILES Search Option

Smiles Search Option Entry

SMILES: case-sensitive

SMILES are a text-based representation of chemical structure (see More on SMILES) but, unlike InChI codes, are not unique. Hence, a "SMILES" search first converts the SMILES code to a unique representation of "structure", this structure is displayed in the Chemical Structure Drawing Editor for user verification, and then the user has the option to initiate a structure search in the DSSTox Structure-Browser.

C=C(Cl)C=C
ClC4=C(C=CC=C4)C2=NC(C)C1=NN=C(C)N1C3=C2C=C(CCC5=CC=C(CC(C)C)C=C5)S3
ClC1=CC=CC=C1C=C(C#N)C#N

Example above shows SMILES entry for chlorobenzene. If the "Preview below" button is clicked, the corresponding drawn structure appears in the Chemical Structure Drawing Editor (see preceding graphic).

Return to the list above Return to Top

 

Structure Search Option

View of Search Options box in browser.

Exact match: default option - on

If this option is checked, all "exact" structure matches to the query structure will be presented on the File Incidences search results page of the Structure-Browser. This search option ignores charge differences, stereo-specific information, and does not include equivalent tautomeric forms. Exact match results are displayed on the Search Details results page and listed separately by row if their DSSTox TestSubstance (Generic Substance ID) characteristics differ.

Substructure: default option - on

If this option is checked, all structures containing the query "substructure" will be presented on the File Incidences search results page of the Structure-Browser. This search option ignores charge differences and stereo-specific information. Substructure results are displayed on the Search Details results page and listed separately by row if their DSSTox TestSubstance (Generic Substance ID) characteristics differ. The user is cautioned that this search option can return exceedingly large numbers of search hits. To produce smaller numbers of hits, the user should use a larger or more specific substructure query.

Similarity: default option - on; Threshold default 80%

If this option is checked, a commonly employed "Tanimoto similarity coefficient" is used as a search filter, with the default threshold value of 80% similarity coefficient arbitrarily chosen to provide a reasonable capture rate of hits that are likely to have obvious similar structure characteristics when viewed by the user. All hits exceeding the chosen Threshold % similarity score will be presented on the File Incidences search results page of the Structure-Browser. Increasing the Threshold % will place more stringent criteria for similarity, producing fewer hits above the threshold, whereas decreasing the Threshold % will loosen the criteria for similarity, producing more hits, but of less obvious similarity. Similarity hits presented on the Search Details results will be listed along with their Tanimoto similarity score to provide the user an approximate sense of how similar the hit is perceived to be relative to the query structure. Note that a minimum number of hits (at least 10 structures) will be displayed, regardless of the threshold entered. Hence, in the event that fewer than 10 structure hits are produced above the chosen threshold, the adjusted threshold that produced the displayed 10 hits is shown.

** New to v2.0: A minimum of 10 structure match hits are presented on the File Incidences search results page, regardless of the chosen default threshold for similarity. The Similarity cutoff displayed under the "Results Type" header will be adjusted accordingly on the FIle Incidences page (e.g., the top 10 structure matches for "atrazine" structure search yields 12 substance hits, exceeding a similarlty of 51.2%).

Of the three Search Options, Similarity is capable of casting the broadest net in a search for chemical analogs as it is not restricted to the precise characteristics of the drawn chemical structure query. Adjusting the Threshold %, a user can calibrate how stringent the similarity metric, and how many hits are returned.

A Tanimoto similarity coefficient is computed based on comparison of binary "fingerprint" representations of all structures in the search inventory to the binary fingerprint of the query structure. Such fingerprints are generally computed from fragment decompositions of each molecule as follows:

the entire search inventory (all molecules) is decomposed into all possible component fragments - for our purposes a fragment is: 1 to 8 atoms connected by a linear, consecutive path of non-overlapping covalent bonds, a 3 to 7 membered ring, or an atom center with 3 or more covalently bonded non-hydrogen atoms;
each molecule's component fragments are matched against the larger inventory of all fragments found in a search inventory (in this case, the set of all fragments from decomposition of the published DSSTox file inventory);
if a given fragment is present in the molecule, a binary value of "1" is assigned to that fragment position in the molecule's fingerprint vector, whereas if the fragment is not present, a value of "0" is assigned;
each molecule is then represented as a binary (bit) fingerprint, e.g., 1000010010000011000...
the Tanimoto similarity coefficient (S) between two molecules is computed by comparison of the bit fingerprints for the query and target molecules, with S = 1 (or 100%) indicating identical fingerprints and values approaching S = 0 (or 0%) indicating maximum dissimilarity or no common bits in the two fingerprints:

Sab = c/(a+b-c)

c is number of "on" bits (matching) in both query and the target,
a is number of "on" bits in query (matching or not),
b is number of "on" bits in target (matching or not).

Implementation of the Tanimoto similarity coefficient in the DSSTox Structure-Browser is described in Specifications.

A more general and technical discourse on similarity metrics, including Tanimoto coefficients, can be found at: http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=eurekah.section.3798 exit EPA

Return to the list above Return to Top

Searching at Generic Test Substance Level

The DSSTox Structure-Browser performs structure searches through the inventory of chemical substance records in DSSTox Data Files based on the contents of the STRUCTURE field, which is uniquely indexed by DSSTox_CID (Chemical ID). However, since different test substances (mixtures, different purity grades, etc.), which could have different toxicity characteristics, can share the same structure representation and DSSTox_CID, search results are presented to the user based on the more specific generic substance identifier, DSSTox_Generic_SID. (denoted "DSSTox Substance ID" on the "Search Details" page)

DSSTox_Generic_SID better discriminates test substance at a level nearly equivalent to the commonly employed CAS RN identifier, but also discriminates when no CAS RN are available, or when CAS RN do not discriminate different substances, such as when purity grades are known to differ. DSSTox_Generic_SID does not, however, discriminate test substance records that differ at the test sample level (e.g., lot, batch, manufacturer, plate number); hence our use of the term "Generic" in the field name. The DSSTox Record ID (DSSTox_RID) is a unique record identifier that does distinguish to the level of test sample, which is important for those DSSTox data files undergoing current testing (i.e., NTPHTS and TOXCST).

Listed on the File Incidences search results page, at the bottom of the table that includes the breakdown of search results according to DSSTox Data File, are the summarized totals for substance hits.

Total Unique Substance Hits = Total number of different DSSTox_Generic_SID 's occurring in each Search type column (i.e., Exact, Substructure, Similarity) across all listed data files.

Total Substance Hits = Total number of DSSTox_Generic_SID 's occurring in each Search type column (i.e., Exact, Substructure, Similarity) across all listed data files, counting all replicates of a DSSTox_Generic_SID found in different DSSTox data files.

The practical ramifications of displaying results by DSSTox_Generic_SID (listed as DSSTox Substance ID in figure below) are that test substance hits, even those sharing the same STRUCTURE or DSSTox_CID identifier, will be listed separately by DSSTox_Generic_SID in the same results line (with links to one or more DSSTox Data Files containing that same DSSTox_Generic_SID substance). An example of such a view from the Search Details page is shown below:

Structure search list showing different SID hits with same structure

Note: The presentation of structure search results in the DSSTox Structure-Browser by DSSTox_Generic_SID differs significantly from PubChem, which groups substance results (defined by Source depositor, not by test substance characteristics) at the level of PubChem Chemical ID (PubChem CID), a structure identifier with a 1:1 correspondence to DSSTox_CID.

Return to the list above Return to Top

Output Options
View of Search Details - Output Options box in browser.View of Substance Results - Output Options box in browser.

Save Option

.txt ..... Exports tabular text information on Search Details or Substance Results page into a text (.txt) document; does not include chemical structure images.

.csv ..... Exports tabular text information on Search Details page into a csv (comma-delimited) spreadsheet document; does not include structure images. "csv" is a standard export/import format for tabular information and can be imported into MS Excel. Substance Names, which frequently contain commas, are enclosed in quotation marks during the export process to prevent these commas from being interpreted as delimiters.

Print Option

Selecting the "Print" option uses the Internet browser print function to print the page being viewed; limitations of this print function will be browser-dependent and outside of our control.

Exporting Full DSSTox Data Files

All users of the DSSTox Structure-Browser should be aware that downloads of complete DSSTox Data Files (SD files or .xls data tables), along with documentation, are available from the DSSTox Website. These SD files are a primary product of the DSSTox Project and can be imported into any user-owned Chemical Relational Database (CRD) application to facilitate local structure-browsing and tailored data capture.

Return to the list above Return to Top

Accessing DSSTox Structure-Browser from External Websites

The DSSTox Structure-Browser, with default search settings, is accessible by EPA and external collaborators using a simple URL:
http://www.epa.gov/dsstox_structurebrowser/. Extensions to the URL allow users to modify the parameters of the search. The general forms of the URL extension are:

?dbs=[NAMEID]&hits=[search type]&thresh=[number]
?qtype=[query type]&qval=[query value]

where dbs refers to DSSTox data file name (e.g., NTPBSI, IRISTR), and

where qtype can be any of the following: auto, name, cas, inchi, formula, smiles, cid, sid, inchikey (the last new to v2.0).

To change the default setting for Data Files to Search from "All DSSTox Files" to restrict the incoming search to a particular DSSTox Data File, use the URL extension: ?dbs=NAMEID
where NAMEID is the 6 letter DSSTox Data File NAMEID code, e.g., CPDBAS, EPAFHM, etc; the default entry is dbs=all. To search through more than one file, use the modification: ?dbs=NAMEID(1)-NAMEID(2)-NAMEID(3)

For example:

http://www.epa.gov/dsstox_structurebrowser/?dbs=CPDBAS ... searches only through CPDBAS file contents.

http://www.epa.gov/dsstox_structurebrowser/?dbs=IRISTR-EPAFHM-HPVCS ... searches only through 3 EPA files: IRISTR, EPAFHM, and HPVCSI.

To add modifiers to specify search default options and search threshold, use the additional qualifiers, e.g. hits=exc-sub-sim&thresh=60; the default entries are hit=all and thresh=80

For example:

http://www.epa.gov/dsstox_structurebrowser/?dbs=NTPBSI-&hits=exc ... searches through NTPBSI for only exact matches.

http://www.epa.gov/dsstox_structurebrowser/?dbs=all&hits=sub-sim&thresh=75 ... searches through all files displaying only substructure and similarity hits above the similarity threshold = 75..

User-defined limits to the text or structure-search parameters are of the form ?qtype, which can be combined with the qval and data file restrictions above. If using data file restrictions, a hyphen should precede the ?qtype, i.e., as ?dbs=IRISTR-?qtype=... Various examples include:

http://www.epa.gov/dsstox_structurebrowser/?qtype=auto&qval=benzene ... chooses Text Search: Auto-detect = benzene

http://www.epa.gov/dsstox_structurebrowser/?qtype=name&qval=benzene ... chooses Text Search: Chemical Name = benzene

http://www.epa.gov/dsstox_structurebrowser/?qtype=cas&qval=87-86-5 ... chooses Text Search: CAS RN = benzene

http://www.epa.gov/dsstox_structurebrowser/?qtype=inchi&qval=InChI%3D1/C7H7N3O3/c1-9(8-11)6-2-4-7(5-3-6)10(12)13/h2-5H,1H3 ... chooses Text Search: InChI = InChI%3D1/C7H7N3O3/c1-9(8-11)6-2-4-7(5-3-6)10(12)13/h2-5H,1H3

http://www.epa.gov/dsstox_structurebrowser/?qtype=formula&qval=C6H12O6 ... chooses Text Search: Formula = C6H12O6

http://www.epa.gov/dsstox_structurebrowser/?qtype=smiles&qval=C1CO1 ... chooses Structure Search: SMILES = C1CO1 (see Note below)

http://www.epa.gov/dsstox_structurebrowser/?qtype=cid&qval=20238 ... submits a Structure Search based on the DSSTox_CID (compound ID) = 20238

http://www.epa.gov/dsstox_structurebrowser/?qtype=sid&qval=20429 ... submits a Structure Search based on the DSSTox_Generic_SID = 20429

http://www.epa.gov/dsstox_structurebrowser/?qtype=inchikey&qval=AMTRFSFECFPSHF-UHFFFAOYAZ ... submits a Structure Search based on the InChIKey = AMTRFSFECFPSHF-UHFFFAOYAZ

http://www.epa.gov/dsstox_structurebrowser/?dbs=IRISTR-&qtype=name&qval=benzene .... performs a Text Search only through IRISTR for Chemical Name = benzene

**Note: Some characters in search strings may be reserved for special html interpretations; hence, it may be necessary to "escape" or "encode" a url-based query when storing it or sending it to a colleague. This is particularly true of SMILES, where alternative characters may be needed for commonly used symbols such as:
= ..... %3d
@ ... %40
# ... %23
[ ... %5b
] ... %5d

More information and guidance on these restrictions are available at: http://en.wikipedia.org/wiki/URL-encodingexit EPA

Return to the list above Return to Top

DSSTox Structure-Browser Specifications exit EPA

The DSSTox Structure-Browser version 1.0 was developed using the following publicly available tools, scripting code, and applications:

JME Molecular Editor exit EPA (see Chemical Drawing Editor)

CDK (Chemistry Development Kit) exit EPA (with some modifications by Tom Transue, Lockheed Martin - Contractor to the US EPA)

JDeveloper exit EPA (freely available from Oracle exit EPA) was used to incorporate CDK source code using Java 1.4 exit EPAinto a servlet called by HTML and JavaScript pages.

Original Perl exit EPA code (developed and implemented using Perl 5.8.0) was used for pre-processing the database chemicals and for comparing query molecules (see below). This code (without documentation or further support or guarantees) can be provided to interested persons upon request (Contact Us).

Version 1.0 of the chemical structure search process is carried out using original Perl code designed by Tom Transue (Lockheed Martin - Contractor to the US EPA) with help from Joe Krahn (formerly Lockheed Martin - Contractor to the US EPA). Structures drawn or submitted as SMILES strings are first processed using the CDK toolkit to produce a "mol" file with an aromaticity model determined by CDK's implementation of Huckle's Rule. Perl code then establishes a compound's attributes or paths considering only heavy-atom (non-hydrogen) elements without consideration of charge or isotope and bond connectivity (including bond orders: 1,2,3, and "aromatic"). Each compound is assigned a set of attributes or paths consisting of linear paths (up to length 8), rings (up to 7-memeber), and atom centers (with 3 or more heavy-atom bond-partners). An original algorithm then compares paths generated from a query molecule to the 29,364 unique paths found in the 6,637 unique chemical structures found in the DSSTox data inventory (as of 15Aug2008) to determine a Tanimoto similarity coefficient and to determine candidates for superstructures of the query. A second algorithm confirms which of these candidates is in fact a superstructure and which are exact matches. Results are presented with a heavy use of client-side JavaScript to reduce network demand and improve performance. See Acknowledgements & Disclaimers.

Return to the list above Return to Top


DSSTox Structure-Browser v2.0 Enhancements

The DSSTox Structure-Browser version 2.0 includes several new enhancements and features:

Expanded "link-in" options and user-defined limits

Users or external websites are able to directly access the DSSTox Structure-Browser Main Search Page, or File Incidences Search Results Page (resulting from a structure search), by simple extensions to the DSSTox Structure Browser URL. Features include:

option to limit search to one or more DSSTox Data Files;
text-search options (Chemical Name, CAS, Formula, InChI); and
structure-search options (SMILES, InChIKey, DSSTox_CID, DSSTox_Generic_SID).

For general format of the URL extensions and several examples, see: Accessing DSSTox Structure-Browser from External Websites

Minimum of 10 structure similarity hits displayed

A minimum of 10 structure match hits are presented on the File Incidences search results page, regardless of the chosen default threshold for similarity. The Similarity cutoff displayed under the "Results Type" header will be adjusted accordingly on the FIle Incidences page (e.g., the top 10 structure matches for "atrazine" structure search yields 12 substance hits, exceeding a similarlty of 51.2%). The purpose is to override the similarity threshold only in cases where too few structure search hits are returned.

External Resources

New structure-based link-outs to external websites are provided (three are external to EPA and are currently active; the fourth, ACToR, will be activated as soon as this new EPA resource becomes publicly available, est. Sept 2008). These link-outs are intended to assist users in directly accessing structure-related content and capabilities on external public websites of potential interest to the DSSTox user community. The link-out corresponds to the displayed or indicated Query Structure on any of the DSSTox Structure Browser Search Results Pages (File Incidences, Search Details, Substance Results). Note that for the case where a text search (e.g., Chemical Name or CAS) produces a hit, the External Resources link-outs will be activated even though the initial File Incidences Search Results Page does not display a query structure (this requires a structure search resubmit, see below).

External Resources: PubChem, ChemSpider, EPA ACToR, Lazar in silico tox

PubChem exit EPA
For more information on PubChem and in relation to the DSSTox Project, see Searching DSSTox Files in PubChem. Link-out from the DSSTox Structure Browser is based on the DSSTox field, PubChem_CID (PubChem Compound IDs), available for all registered DSSTox substances.

ChemSpider exit EPA
ChemSpider is a free on-line service providing access to millions of chemical structures, computed chemical properties, chemical resources, and integration to a multitude of other online services, including link-outs to DSSTox and PubChem. In addition, ChemSpider has recently used the EPA EPI (Estimation Programs Interface) Suite™ estimation model software to generate physical/chemical and environmental fate properties for its entire on-line inventory of over 20 million structures. Link-out from the DSSTox Structure Browser is based on InChIKey (conforming to ChemSpider conventions for InChIKey generation), available also as a DSSTox Standard Chemical Field, STRUCTURE_InChIKey

Lazar Predictive Toxicity exit EPA
Lazy Structure-Activity Relationship (lazar) predictions are based on chemical structure and activity similarity calculations (activity-informed nearest neighbor approach) for a variety of summary toxicity endpoints. More information can be found on the Lazar Documentation page. exit EPA Link-out from the DSSTox Structure Browser is based on SMILES, available as a DSSTox Standard Chemical Field, STRUCTURE_SMILES

ACToR
ACToR (Aggregated Computational Toxicology Resource) is a tool being developed within the EPA's National Center for Computational Toxicology to manage large scale sets of assay associated with the EPA ToxCast project, as well as publicly available inventories and toxicology data on chemicals of interest to the EPA. The resource is slated for public internet release in September, 2008. For more information, visit the EPA ACToR Home Page. Link-out from the DSSTox Structure Browser will be based on ACToR CIDs corresponding to DSSTox registered substances.

Structure "hit" resubmit feature

As an extension of a normal structure search, this option allows a user to automatically submit any structure "hit" resulting from a text or structure search to a new structure search (exact/substructure/similarity). This feature provides structure look-up capability through the entire DSSTox Data File structure inventory and, additionally, circumvents the need for a user to recall and draw a structure to submit for a structure search. In addition, the feature enables structure analog hits resulting from the substructure or similarity search to be submitted for a structure search, i.e. converting the structure hit to the Query structure which then enables External Resource link-outs to be accessed.

An example is shown below in which a Search for Chemical Name = atrazine, was originally submitted, and Details of the Exact matches (1 Hit) was chosen on the File Incidences Page.

text search hit to structure

The text hit retrieves a structure for "Atrazine" that is listed as being present in 7 DSSTox Data Files. The yellow triangle (SS) in the lower right corner of the structure display (shown above) can be clicked to submit this structure for a standard search (i.e., exact, substructure, and similarity) across the originally specified DSSTox Data Files. The results of this structure search will display the structure in the Query box and will display similarity hits to this structure as shown below, any one of which could be resubmitted for a structure search. If the user wishes to edit this automatically generated Query structure and resubmit, they can return to the Search Tab Page where the Atrazine structure will now appear in the Chemical Drawing Editor window.

atrazine structure search results page

 

Return to the list above Return to Top

Planned Upgrades & Future Enhancements

The following upgrades and features are planned for future versions of the DSSTox Structure-Browser:

better management of browser "back arrow" or "back page" button in relation to DSSTox Structure-Browser page tabs;

an optional "any atom" (wildcard "X" atom) search in the Chemical Structure Drawing Editor to allow more complex queries;

more Output Options on the Search Results Page to include SDF file export;

highlightng of atoms matching the query substructure in the displayed structure hits on the Search Results page;

paged results on the details tab to display a limited number of chemicals (e.g., <100 at a time);

Return to the list above Return to Top

DSSTox Structure-Browser Acknowledgements

The DSSTox Structure-Browser was developed by Tom Transue (Lockheed Martin - Contractor to the US EPA), with design assistance from Ann Richard (EPA) and Michael Brown (Lockheed Martin). Others (from the US EPA or from Lockheed Martin, Contractors to the US EPA) who assisted with initial stages of development or in advisory/technical/review aspects of the DSSTox Structure-Browser implementation include: Maritja Wolf, Michael Uhl, Joseph Mack, Joseph Krahn, Joseph Retzer, Robert Kavlock, Christine Russom, Rob McCauley, Tommy Cathey, and Todd Plessel. The developers would also like to thank Peter Ertl and Novartis for making JME exit EPA publicly available, Open Eye Scientific Software exit EPA for providing an academic software license and for useful discussions regarding SMILES, Oracleexit EPA for providing the freely available JDeveloper IDE tool, and authors of the CDK Chemistry Development Kit exit EPA for providing freely available Java source code. See also Specifications.

We would also like to acknowledge collaborators who are currently using the URL link-in capability to access the DSSTox Structure Browser Main page, or to directly access the File Incidences Results page with a structure submission from their websites exit EPA:

Beth Bowden - National Toxicology Program (NTPBSI)
http://ntp-apps.niehs.nih.gov/ntp_tox/index.cfm

Antony Williams - ChemSpider, Inc
http://www.chemspider.com/

Christoph Helma - in silico toxicology, Lazar
http://lazar.in-silico.de/

To provide feedback or to report problems with the DSSTox Structure-Browser, please complete and submit the User Feedback Form. See also Disclaimers.

Return to the list above Return to Top

Disclaimers

The DSSTox Structure-Browser was developed using publicly available resources to enhance and extend the utility of the freely-downloadable DSSTox Data Files. However, the EPA DSSTox Project and Contractor-support code developers make no explicit or implied guarantees concerning the operation or results of the DSSTox Structure-Browser. The DSSTox Structure-Browser code has been reviewed within EPA's Office of Research and Development and approved for posting on the EPA DSSTox website. Approval does not signify that the contents reflect the views of the Agency, nor does mention of trade names or commercial products constitute endorsement or recommendation for use.

 

Return to the list above Return to Top


Local Navigation


Jump to main content.