The available Java package Data-To-Knowledge from the NCSA is being used as the basis of a knowledge-discovery in data bases (KDD) approach to the classification of objects within the SDSS Data Release 3, as part of the Laboratory for Cosmological Data Mining at the Department of Astronomy and NCSA. Decision trees are the current algorithm of choice.
The combination of peer-reviewed allocated supercomputing resources on the Linux cluster tungsten, the expertise of the Automated Learning Group at NCSA and the local expertise in handling large data sets places us in a very strong position to carry out this type of analysis.
The current results are presented in Ball et al. 2006 (ApJ 650 497), or in preprint form at astro-ph/0606541. There is also a poster, which was displayed at the June 2006 208th meeting of the American Astronomical Society.
Last modified: Mar 27th 2008