Sar modeling of unbalanced data sets

H. S. Rosenkranz Department of Environmental and Occupational Health, Graduate School of Public Health, University of Pittsburgh, 111 Parran Hall, 130 DeSoto Street, Pittsburgh, P A, 15261, USA

A. R. Cunningham Department of Environmental and Occupational Health, Graduate School of Public Health, University of Pittsburgh, 111 Parran Hall, 130 DeSoto Street, Pittsburgh, P A, 15261, USA

Abstract

The increased acceptance of SAR approaches to hazard identification has led us to investigate methods to improve the predictive performance of SAR models. In the present study we demonstrate that although on theoretical grounds the ratio of active to inactive chemicals in the learning set should be unity, SAR models can ‚tolerate‘ an unbalanced range in ratios from 3 : 1 (i.e., 75% actives) to 1 : 2 (i.e., 33% actives) and still perform adequately. On the other hand SAR models derived from learning sets with ratios in excess of 4 : 1 (80% actives), even when corrected for the initial ratio do not perform satisfactorily.

Key Words:

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Sar modeling of unbalanced data sets

Information for

Open access

Opportunities

Help and information

Sar modeling of unbalanced data sets

Abstract

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature