41
Views
1
CrossRef citations to date
0
Altmetric
Original Articles

A parallel algorithm for subset selection

, &
Pages 1-17 | Received 13 May 1996, Published online: 20 Mar 2007
 

Abstract

Prior to performing an analysis of a large data set, it is often desirable to process a subset of the data only. Current methods of subset selection choose points in a random manner, which can lead to poor solutions. The method for selection described in this paper employs the Effective Independence Distribution (EID) method that chooses observations that optimize the determinant of the information matrix. Since the method requires repeated calculations of three matrix multiplications and a matrix. inverse, it is computationally intensive for extremely large data sets. A recursive form of the EID is developed here which is suitable for parallelization. The parallel method is described in detail, and load balancing and communication issues are addressed. Implementation results on the Intel Paragon show that this is an effective parallel algorithm.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.