标题：Data mining for WDMS in SDSS DR12 archive
作者：Jiang Bin; Ma Chunyu; Wang Wenyu; Wang Wei; Gao Jun
作者机构：[Jiang Bin; Ma Chunyu; Wang Wenyu; Wang Wei; Gao Jun] Shandong Univ Weihai, Sch Mech Elect & Informat Engn, Weihai, Peoples R China.
会议名称：3rd International Conference on Information Science and Control Engineering (ICISCE)
会议日期：JUL 08-10, 2016
来源：2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE)
关键词：Datamining; Spectra; PCA; Data ming
摘要：Data Release 12 is the final data release of the SDSS-III, containing all SDSS observations. The massive spectra can not only be used for research of the structure and evolution of the Galaxy but also for multi-waveband identification. In addition, the spectra are a ideal sample for data mining for rare and special objects like white dwarf main-sequence star. WDMS consists of a white dwarf primary and a low-mass main-sequence companion which has positive significance to the study of evolution and parameters of close binaries. In this paper, after feature extraction by PCA, an clustering approach is proposed based on the idea that cluster centers are characterized by a higher density than their neighbors and by a relatively large distance from points with higher densities. A total number of 2,340 WDMS candidates are selected by the method and some of them are new discoveries which prove that our approach of finding special celestial bodies in massive spectra data is feasible.