Identification of Interesting Objects in Large Spectral Surveys Using Highly Parallelized Machine Learning

Petr Škoda; Andrej Palička; Jakub Koza; Ksenia Shakurova

doi:10.1017/S1743921317000047

Identification of Interesting Objects in Large Spectral Surveys Using Highly Parallelized Machine Learning

Published online by Cambridge University Press: 30 May 2017

Jakub Koza and

Petr Škoda: Affiliation:
Astronomical Institute of the Czech Academy of Sciences, Fričova 298, 251 65 Ondřejov, Czech Republic email: skoda@sunstel.asu.cas.cz
Andrej Palička: Affiliation:
Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague 6, Czech Republic
Jakub Koza: Affiliation:
Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague 6, Czech Republic
Ksenia Shakurova: Affiliation:
Faculty of Information Technology, Czech Technical University in Prague, Thákurova 9, 160 00 Prague 6, Czech Republic

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

The current archives of LAMOST multi-object spectrograph contain millions of fully reduced spectra, from which the automatic pipelines have produced catalogues of many parameters of individual objects, including their approximate spectral classification. This is, however, mostly based on the global shape of the whole spectrum and on integral properties of spectra in given bandpasses, namely presence and equivalent width of prominent spectral lines, while for identification of some interesting object types (e.g. Be stars or quasars) the detailed shape of only a few lines is crucial. Here the machine learning is bringing a new methodology capable of improving the reliability of classification of such objects even in boundary cases.

We present results of Spark-based semi-supervised machine learning of LAMOST spectra attempting to automatically identify the single and double-peak emission of Hα line typical for Be and B[e] stars. The labelled sample was obtained from archive of 2m Perek telescope at Ondřejov observatory. A simple physical model of spectrograph resolution was used in domain adaptation to LAMOST training domain. The resulting list of candidates contains dozens of Be stars (some are likely yet unknown), but also a bunch of interesting objects resembling spectra of quasars and even blazars, as well as many instrumental artefacts. The verification of a nature of interesting candidates benefited considerably from cross-matching and visualisation in the Virtual Observatory environment.

Keywords

stars: emission-line Be surveys methods: statistical techniques: spectroscopic

Type: Contributed Papers
Information: Proceedings of the International Astronomical Union , Volume 12 , Symposium S325: Astroinformatics , October 2016 , pp. 180 - 185

DOI: https://doi.org/10.1017/S1743921317000047 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]

References

Chapelle, O., Schölkopf, B., & Zien, A. 2006, Semi-supervised learning, MIT press, Cambridge, Massachusetts Google Scholar

Cui, X. Q., et al. 2012, Research in Astronomy and Astrophysics, 12, 1197 Google Scholar

Luo, A. L., et al. 2015, Research in Astronomy and Astrophysics, 15, 1095 CrossRef Google Scholar

Nandrekar-Heinis, D., Michel, L., Louys, M., & Bonnarel, F. 2014, Astronomy and Computing, 7, 37 Google Scholar

Palička, A. 2016, Master Thesis, Czech Technical University in Prague, Faculty of ITGoogle Scholar

Porter, J. M. & Rivinius, T. 2003, PASP, 115, 1153 CrossRef Google Scholar

Shakurova, K. 2016, Master Thesis, Czech Technical University in Prague, Faculty of ITGoogle Scholar

Silaj, J., Jones, C. E., Tycner, C., Sigut, T. A. A., & Smith, A. D. 2010, ApJS, 187, 228 Google Scholar

Tody, D. et al. 2012, IVOA Recommendation: Simple Spectral Access Protocol Version 1.1, ArXiv:1203.5725Google Scholar

Zickgraf, F.-J. 2003, A&A, 408, 257 Google Scholar

Article contents

Identification of Interesting Objects in Large Spectral Surveys Using Highly Parallelized Machine Learning

Abstract

Keywords

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests