Latest revision as of 17:59, 13 December 2011

Based on materials originally developed by Peter Plavchan (NExScI/IPAC/Caltech) for the 2010 Sagan Exoplanet Workshop. This group project was entitled "Identifying and Classifying Variables in the Kepler Data Set."

NOTE: this page is still under construction

Introduction

Every star has a story, told by its light curve. How do you listen to 150,000 stories at the same time? How can you (easily and accurately) tell which stories are interesting? In the figures below, one is a transit, one is noise, and one is indicative of stellar rotation, but they all have similar statistical properties. How can you tell without looking at each one by hand?

The goals for this exercise are to learn about variability statistics, periodograms, machine learning classification schemes, and the utility of ancillary information.

Time domain astronomy

All of spatially unresolved astronomy can be thought of as the exploration of a two-dimensional space: frequency and time. Astronomers have learned a lot from exploring the frequency domain, but have under-sampled the time domain in sample size, wavelength coverage, and cadence. The future of time-domain astronomy is bountiful: LSST, Pan-STARRS, Kepler, CoRoT, YSOVAR, radial velocities, ASTrO/TESS/PLATO, etc.

Every star is variable at some level. Once you have a variability time-scale, you have clues to its physical mechanism.

Astronomers trip over themselves in the time-domain, because the analysis can be tricky. The signatures of non-ideal observations in time-series data are often detectable, often un-avoidable, and always annoying. For example:

CoRoT unavoidably moves into Earth's shadow
When observing from ground-based telescopes, your object rises and sets (or at least the Sun rises), introducing 1-day period aliasing
Seeing variations and blends, as well as simply a source rising (being seen through different airmasses) may introduce low-frequency noise. And, if your PSF encompasses more than one object, which one is varying? Or are they both varying?
Systematic sources of variability can dominate and/or mask intrinsic variability in a survey. Watch out for the dreaded “red noise”! (e.g., low frequency noise)

The project

Use:

Kepler data set
NASA Exoplanet Archive standard variability statistics
NASA Exoplanet Archive periodogram and visualization tools
Harvard Time-Series Center classification tool
Or, “roll your own code”

To:

Define and identify a subset of ~50 interesting light curves
Identify and handle systematics
Identify important time-scales
Classify the light curves

Questions you can address by working through this project

What time-scales are seen, e.g., what physical mechanisms are of interest? Ex: eclipses, flares, rotation, asteroseismology, young stars
What are the amplitudes of variability? (and how to identify them) Ex: How can you tell the difference between a 0.1 mag amplitude sinusoid variable with a V= 9 and 0.01 mag photon noise, vs. a V=16 object with 0.1 mag photon amplitude noise?
How can you handle multi-periodic signals?
How to identify, flag and remove systematic sources of variability?
What algorithms do you use?
How do you get and prepare the data?
How can you use ancillary diagnostics to classify an object:
- Colors? (e.g. red = M star, giant or dwarf?)
- Phased light curve shapes ? (e.g. phased Fourier decomposition)
How do you scale these methods to large/multiple data sets?

@@ Line 1: / Line 1: @@
-''Based on materials originally developed by Peter Plavchan (NStED/IPAC) for the 2010 Sagan Exoplanet Workshop. This group project was entitled "Identifying and Classifying Variables in the Kepler Data Set."''
+''Based on materials originally developed by Peter Plavchan (NExScI/IPAC/Caltech) for the [http://nexsci.caltech.edu/workshop/2010/ 2010 Sagan Exoplanet Workshop]. This group project was entitled "Identifying and Classifying Variables in the Kepler Data Set."''
+'''NOTE: this page is still under construction'''
 =Introduction=
@@ Line 10: / Line 12: @@
 ==Time domain astronomy==
-All of spatially unresolved astronomy can be thought of as the exploration of a two-dimensional space:  frequency and time.
-Astronomers have learned a lot from exploring the frequency domain, but have under-sampled the time domain in sample size, wavelength coverage, and cadence.
-The future of time-domain astronomy is bountiful: LSST, Pan-STARRS, Kepler, CoRoT, YSOVAR, radial velocities, ASTrO/TESS/PLATO,  etc.
-''Every'' star is variable at some level.
-Once you have a variability time-scale, you have clues to its physical mechanism.
+All of spatially unresolved astronomy can be thought of as the exploration of a two-dimensional space:  frequency and time.  Astronomers have learned a lot from exploring the frequency domain, but have under-sampled the time domain in sample size, wavelength coverage, and cadence. The future of time-domain astronomy is bountiful: LSST, Pan-STARRS, Kepler, CoRoT, YSOVAR, radial velocities, ASTrO/TESS/PLATO,  etc.
-Astronomers trip over themselves in the time-domain, because the analysis can be tricky.
+''Every'' star is variable at some level. Once you have a variability time-scale, you have clues to its physical mechanism.
-The signatures of non-ideal observations in time-series data are often detectable, often un-avoidable, and always annoying.  For example:
+Astronomers trip over themselves in the time-domain, because the analysis can be tricky.  The signatures of non-ideal observations in time-series data are often detectable, often un-avoidable, and always annoying.  For example:
 *CoRoT unavoidably moves into Earth's shadow
 *When observing from ground-based telescopes, your object rises and sets (or at least the Sun rises), introducing 1-day period aliasing
@@ Line 31: / Line 26: @@
 Use:
 *Kepler data set
-*NStED standard variability statistics
+*NASA Exoplanet Archive standard variability statistics
-*NStED periodogram and visualization tools
+*NASA Exoplanet Archive periodogram and visualization tools
 *Harvard Time-Series Center classification tool
 *Or, “roll your own code”
@@ Line 41: / Line 36: @@
 *Classify the light curves
-==Questions to answer==
+==Questions you can address by working through this project==
 *What time-scales are seen, e.g., what physical mechanisms are of interest?  Ex: eclipses, flares, rotation, asteroseismology, young stars
 *What are the amplitudes of variability? (and how to identify them) Ex:  How can you tell the difference between a 0.1 mag amplitude sinusoid variable with a V= 9 and 0.01 mag photon noise, vs. a V=16 object with 0.1 mag photon amplitude noise?

Difference between revisions of "Classifying variables in the Kepler data set"

Latest revision as of 17:59, 13 December 2011

Contents

Introduction

Time domain astronomy

The project

Questions you can address by working through this project

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools