130
Views
2
CrossRef citations to date
0
Altmetric
Special Section: Massive Datasets

Cataloging and Mining Massive Datasets for Science Data Analysis

&
Pages 589-610 | Received 01 Nov 1998, Published online: 21 Feb 2012

References

  • Aubele , J. C. and Slyuta , E. N. 1990 . “Small Domes on Venus: Characteristics and Origins” . Earth, Moon and Planets , 50/51 : 493 – 532 .
  • Barclay , T. , Eberl , R. , Gray , J. , Nordlinger , J. , Raghavendran , G. , Slutz , D. , Smith , G. , Smoot , P. , Hoffman , J. , Robb , N. III , Rossmeissl , H. , Duff , B. , Lee , G. , Mathesmier , T. , Sunne , R. , Stivers , L. and Goodman , K. 1998 . “The Microsoft TerraServer” , Redmond , WA : Microsoft . Microsoft Research Report MSR-TR-98-17
  • Blender , R. , Fraedrich , K. and Lunkeit , F. 1997 . “Identification of Cyclone Track Regimes in the North Atlantic” . Quarterly Journal of the Royal Meteorological Society , 123 : 727 – 741 .
  • Bradley , P. and Fayyad , U. “Refining Initial Points for K-Means Clustering” . Proceedings of the 15th International Conference on Machine Learning . pp. 91 – 99 . Morgan Kaufmann .
  • Bradley , P. , Fayyad , U. and Reina , C. “Scaling Clustering Algorithms to Large Databases” . Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD98) . pp. 9 – 15 . Menlo Park , CA : AAAI Press .
  • Bradley , P. , Fayyad , U. and Reina , C. 1998b . “Scaling EM (Expectation Maximization) Clustering to Large Databases” . In Microsoft Research Technical Report MSR-TR-98–35 , Redmond , WA : Microsoft .
  • Breiman , L. , Friedman , J. H. , Olshen , R. A. and Stone , C. J. 1984 . Classification and Regression Trees , Monterey , CA : Wadsworth & Brooks .
  • Brodley , C. and Smyth , P. 1997 . “Applying Classification Algorithms in Practice” . Statistics and Computing , 7 : 45 – 56 .
  • Burl , M. C. , Fayyad , U. M. , Perona , P. , Smyth , P. and Burl , M. P. “Automating the Hunt for Volcanoes on Venus” . Proceedings of the 1994 Computer Vision and Pattern Recognition Conference, CVPR-94 . pp. 302 – 309 . Los Alamitos , CA : IEEE Computer Society Press .
  • Burl , M. C. , Asker , L. , Smyth , P. , Fayyad , U. M. , Perona , Crumpler P.L. and Aubele , J. 1998 . “Learning to Recognize Volcanoes on Venus” . Machine Learning , 30 : 165 – 194 .
  • Burl , M. C. , Weber , M. , Leung , T. K. and Perona , P. “Recognition of Visual Object Classes” . In From Segmentation to Interpretation and Back: Mathematical Methods in Computer Vision , Springer-Verlag . (in press)
  • Cattermole , P. 1994 . Venus: The Geological Story , Baltimore , MD : Johns Hopkins University Press .
  • Chaudhuri , S. and Dayal , U. 1997 . “An Overview of Data Warehousing and OLAP Technology” , Association for Computing Machinery . ACM SIGMOD RECORD March 1997 issue
  • Chaudhuri , S. , Fayyad , U. M. and Bernhardt , J. “Scalable Classification over SQL Databases” . Proceedings of the International Conference on Data Engineering (ICDE-99) . Los Alamitos , CA : IEEE Press .
  • Cheng , X. and Wallace , J. M. 1993 . “Cluster Analysis of the Northern Hemisphere Wintertime 500-hPa Height Field: Spatial Patterns” . Journal of the Atmospheric Sciences , 50 : 2674 – 2696 .
  • Christensen , G. E. , Rabbitt , R. D. and Miller , M. I. 1994 . “3D Brain Mapping Using a Deformable Neuroanatomy” . Physics in Medicine and Biology , 39 : 609 – 618 .
  • Djorgovski , S. G. , Weir , N. and Fayyad , U. M. “Processing and Analysis of the Palomar—STScI Digital Sky Survey Using a Novel Software Technology” . Astronomical Data Analysis Software and Systems III, A.S.P. Conference Series . Edited by: Crabtree , D. , Hanisch , R. and Barnes , J. Vol. 61 , pp. 195
  • Dryden , I. L. and Mardia , K. V. 1998 . Statistical Shape Analysis , New York : Wiley .
  • Duda , R. O. and Hart , P. E. 1973 . Pattern Classification and Scene Analysis , New York : Wiley .
  • Fasman , K. H. , Cuticchia , A. J. and Kingsbury , D. T. 1994 . “The GDB Human Genome Database Anno 1994” . Nucleic Acid Research , 22 : 3462 – 3469 .
  • Fayyad , U. M. 1991 . On the Induction of Decision trees for Multiple Concept Learning , Ph.D. Dissertation Ann Arbor : The University of Michigan .
  • Fayyad , U. M. “Branching on Attribute Values in Decision Tree Generation” . Proceedings of the Twelfth National Conference on Artificial Intelligence AAAI-94 . pp. 601 – 606 . Cambridge , MA : MIT Press .
  • Fayyad , U. M. and Irani , K. B. “The Attribute Selection Problem in Decision Tree generation” . Proceedings of the Tenth National Conference on Artificial Intelligence . pp. 104 – 110 . Menlo Park , CA : AAAI Press .
  • Fayyad , U. M. and Irani , K. B. “Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning” . Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence . pp. 1022 – 1027 . San Mateo , CA : Morgan Kaufmann .
  • Fayyad , U. M. and Stolorz , P. 1997 . “Data Mining and KDD: Promise and Challenges” . Future Generation Computer Systems , 13 : 99 – 115 .
  • Fayyad , U. M. , Djorgovski , S. G. and Weir , N. 1996 . “Automating Analysis and Cataloging of Sky Surveys” . In Advances in Knowledge Discovery and Data Mining , Edited by: Fayyad , U. , Piatetsky , G. , Shapiro-Smyth , P. and Uthurusamy , R. 471 – 494 . Cambridge , MA : MIT Press .
  • Fayyad , U. M. , Piatetsky-Shapiro , G. and Smyth , P. 1996 . “From Data Mining to Knowledge Discovery: An Overview” . In Advances in Knowledge Discovery and Data Mining , Edited by: Fayyad , U. , Piatetsky , G. , Shapiro , Smyth P. and Uthurusamy , R. 1 – 36 . Cambridge , MA : MIT Press .
  • Fayyad , U. M. , Weir , N. and Djorgovski , S. G. “SKICAT: A Machine Learning System for Automated Cataloging of Large Scale Sky Surveys” . Proceedings of the Tenth International Conference on Machine Learning . pp. 112 – 119 . San Mateo , CA : Morgan Kaufmann .
  • Gaffney , S. and Smyth , P. “Trajectory Clustering Using Mixtures of Regression Models” . Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . pp. 63 – 72 . ACM Press .
  • Geman , S. , Bienenstock , E. and Doursat , R. 1992 . “Neural Networks and the Bias-Variance Dilemma” . Neural Computation , 4 : 1 – 58 .
  • Glymour , C. , Madigan , D. , Pregibon , D. and Smyth , P. 1997 . “Statistical Themes and Lessons for Data Mining” . Data Mining and Knowledge Discovery , : 1
  • Graefe , G. , Fayyad , U. M. and Chaudhuri , S. “On the Efficient Gathering of Sufficient Statistics for Classification from Large SQL Databases” . Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining . pp. 204 – 208 . Menlo Park , CA : AAAI Press .
  • Guest , J. E. 1992 . “Small Volcanic Edifices and Volcanism in the Plains of Venus” . Journal of Geophysical Research , 97 : 15949 – 15966 .
  • Hand , D. J. 1994 . “Deconstructing Statistical Questions” . Journal of the Royal Statistical Society , 157 : 317 – 356 . Ser. A
  • Jarvis , J. and Tyson , A. 1981 . “FOCAS: Faint Object Classification and Analysis System” . Astronomical Journal , 86 : 476
  • Kennefick , J. D. , de Carvalho , R. R. , Djorgovski , S. G. , Wilber , M. M. , Dickson , E. S. , Weir , N. , Fayyad , U. M. and Roden , J. 1995 . “The Discovery of Five Quasars at z>4 Using the Second Palomar Sky Survey” . Astronomical Journal , 110 : 78 – 86 .
  • 1992. . Magellan at Venus: Special Issue of the Journal of Geophysical Research , American Geophysical Union .
  • 1998 . NSSDC News , December available online at http://nssdc.gsfc.nasa.gov/nssdc-news
  • Quinlan , J. R. 1986 . “The Induction of Decision Trees” . Machine Learning , : 1
  • Saitta , L. and Neri , F. 1998 . “Learning in the Real World” . Machine Learning , 30 : 133 – 163 .
  • Saunders , R. S. 1992 . “Magellan Mission Summary” . Journal of Geophysical Research , 97 ( E8 ) : 13067 – 13090 .
  • 1991 . Science , April 12 special issue on Magellan data
  • Shafer , J. C. , Agrawal , R. and Mehta , M. “SPRINT: A Scalable Parallel Classifier for Data Mining” . Proceedings of the 22nd international Conference on Very Large Databases (VLDB-96) . San Francisco , CA : Morgan Kaufmann .
  • Smyth , P. 1996 . “Bounds on the Mean Classification Error Rate of Multiple Experts” . Pattern Recognition Letters , 17 : 1253 – 1257 .
  • Smyth , P. , Burl , M. C. , Fayyad , U. M. and Perona , P. 1996 . “Knowledge Discovery in Large Image Databases: Dealing With Uncertainties in Ground Truth” . In Advances in Knowledge Discovery and Data Mining , Edited by: Fayyad , U. M. , Piatetsky-Shapiro , G. , Smyth , P. and Uthurasamy , R. 517 – 539 . Cambridge , MA : MIT Press .
  • Smyth , P. , Ide , K. and Ghil , M. “Multiple Regimes in Northern Hemisphere Height Fields via Mixture Model Clustering” . Journal of the Atmospheric Sciences , (in press)
  • Stolorz , P. “Fast Spatio-Temporal Data Mining of Large Geophysical Datasets” . Proceedings of the First International Conference on Knowledge Discovery and Data Mining . Edited by: Fayyad , U. M. and Uthurasamy , R. pp. 300 – 305 . Menlo Park , CA : AAAI Press .
  • Stolorz , P. and Cheeseman , P. 1998 . “Onboard Science Data Analysis: Applying Data Mining to Science-Directed Autonomy” . IEEE Intelligent Systems , : 62 – 68 . September/October 1998
  • Turmon , M. “Identification of Solar Features via Markov Random Fields” . Proceedings Second Meeting of International Association for Statistical Computing (IASC-2) .
  • Turmon , M. , Pap , J. and Mukhtar , S. “Bayesian Inference for Identifying Solar Active Regions” . Proceedings Third International Conference on Knowledge Discovery and Data Mining . pp. 267 – 270 . Menlo Park , CA : AAAI Press .
  • Turmon , M. and Mukhtar , S. “Representing Solar Active Regions With Triangulations” . Proceedings in Computational Statistics, COMPSTAT-98 . Edited by: Payne , R. and Green , P. pp. 473 – 478 . Vienna : Physica-Verlag .
  • Uebersax , J. S. 1993 . “Statistical Modeling of Expert Ratings on Medical Treatment Appropriateness” . Journal of the American Statistical Association , 88 : 421 – 427 .
  • Valdes , F. 1982 . “The Resolution Classifier” . In Instrumentation in Astronomy IV , vol. 331 , 465 Bellingham , WA : SPIE .
  • Valdes-Perez , R. E. 1999 . “Principles of Human Computer Collaboration for Knowledge Discovery in Science” . Artificial Intelligence , 107 : 335 – 346 .
  • Weir , N. , Djorgovski , S. G. and Fayyad , U. M. 1995 . “Initial Galaxy Counts From Digitized POSS-II” . The Astronomical Journal , 110 : 1 – 20 .
  • Weir , N. , Fayyad , U. M. and Djorgovski , S. G. 1995 . “Automated Star/Galaxy Classification for Digitized POSS-II” . The Astronomical Journal , 109 : 2401 – 2414 .
  • Wilks , D. S. 1995 . Statistical Methods in the Atmospheric Sciences , San Diego : Academic Press .
  • Wilson , G. S. and Backlund , P. W. 1992 . “Mission to Planet Earth” . Photo. Eng. Rem. Sens. , 58 : 1133 – 1135 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.