Adapting Predictive Models for Cepheid Variable Star Classification Using Linear Regression and Maximum Likelihood

Kinjal Dhar Gupta; Ricardo Vilalta; Vicken Asadourian; Lucas Macri

doi:10.1017/S1743921314013775

Adapting Predictive Models for Cepheid Variable Star Classification Using Linear Regression and Maximum Likelihood

Published online by Cambridge University Press: 01 July 2015

Kinjal Dhar Gupta ,

Ricardo Vilalta ,

Vicken Asadourian and

Lucas Macri

Show author details

Kinjal Dhar Gupta: Affiliation:
Dept. of Computer Science, University of Houston.
Ricardo Vilalta: Affiliation:
Dept. of Computer Science, University of Houston.
Vicken Asadourian: Affiliation:
Dept. of Mathematics, University of Houston. 4800 Calhoun Road, Houston TX-70004, USA. email: kinjal13@cs.uh.edu, vilalta@cs.uh.edu, vmasadourian@uh.edu
Lucas Macri: Affiliation:
Dept. of Physics and Astronomy, Texas A&M University. 4242 TAMU, College Station, TX 77843-4242, USA. email: lmacri@tamu.edu

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

We describe an approach to automate the classification of Cepheid variable stars into two subtypes according to their pulsation mode. Automating such classification is relevant to obtain a precise determination of distances to nearby galaxies, which in addition helps reduce the uncertainty in the current expansion of the universe. One main difficulty lies in the compatibility of models trained using different galaxy datasets; a model trained using a training dataset may be ineffectual on a testing set. A solution to such difficulty is to adapt predictive models across domains; this is necessary when the training and testing sets do not follow the same distribution. The gist of our methodology is to train a predictive model on a nearby galaxy (e.g., Large Magellanic Cloud), followed by a model-adaptation step to make the model operable on other nearby galaxies. We follow a parametric approach to density estimation by modeling the training data (anchor galaxy) using a mixture of linear models. We then use maximum likelihood to compute the right amount of variable displacement, until the testing data closely overlaps the training data. At that point, the model can be directly used in the testing data (target galaxy).

Keywords

(stars: variables:) Cepheids (galaxies:) Magellanic Clouds methods: statistical infrared: stars methods: data analysis

Type: Contributed Papers
Information: Proceedings of the International Astronomical Union , Volume 10 , Symposium S306: Statistical Challenges in 21st Century Cosmology , May 2014 , pp. 319 - 321

DOI: https://doi.org/10.1017/S1743921314013775 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]

References

Ben-David, S., Blitzer, J., Crammer, K., & Pereira, F. 2006, Neural Information Processing Systems 19 137–144Google Scholar

Ben-David, S., Blitzer, J., Crammer, K., Pereira, F., & Wortman, J. 2010, Machine Learning 79 151–175CrossRef Google Scholar

Faria, S. & Soromenho, G. 2010, Journal of Statistical Computation and Simulation, Vol. 80, No. 2, 201–225Google Scholar

Storkey, A. 2009, Dataset Shift in Machine Learning, MIT Press, 3-28CrossRef Google Scholar

Vilalta, R., Dhar Gupta, K., & Macri, L. 2013, Astronomy and Computing 2 46–53Google Scholar

Article contents

Adapting Predictive Models for Cepheid Variable Star Classification Using Linear Regression and Maximum Likelihood

Abstract

Keywords

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests