Accessing Kepler and CoRoT data

From CoolWiki
Revision as of 05:47, 11 August 2010 by Rebull (talk | contribs)
Jump to navigationJump to search

Based on materials developed by Peter Plavchan, NStED/IPAC

File Formats

First, a necessary aside on file formats. Modern time-series data sets are served in one of two formats:

  • Binary FITS Tables
    • Pros:
      • Standard data format
      • Compact file size
      • Standard headers (FITS keywords)
      • Supports inclusion of ancillary information
    • Cons:
      • Difficult to work with other visualization and manipulation tools -- e.g., how to plot, load into Excel?
      • Fixed precision in data values – float or double – vs. formatting specified in somewhat obscure keywords
      • 64-bit vs. 32-bit wonkiness from heritage tools (e.g. long doubles) (e.g., some tools assume 64-bit numbers and some assume 32-bit)
      • Real life example: Original Kepler public data release introduced machine precision errors into time and position values larger than the other noise sources. (e.g., the real number should have been, say, 6.0045123056 and what got written to the file was shorter, say, 6.00451 -- that truncation introduced errors into the data that were much larger than any other noise source).
  • ASCII
    • Pros:
      • (relatively) Easy to apply multiple tools -- IDL, Excel, mathematica, other plotting utilities can easily grab these files
      • Precision tailored to data set -- if you want 6.0045123056 then you write 6.0045123056.
      • Easier to directly investigate data values with simple file viewers
    • Cons:
      • Lack of standardized formatting -- e.g. tab vs. space vs. comma vs. fixed width, etc.
      • Lack of standardized headers -- e.g. NStED IPAC ASCII keywords and formatting
      • Larger file sizes

Kepler data

Kepler stares at one field continuously. It produces light curves at a specific cadence for selected objects in its field of view.

MAST is the official archive for Kepler. MAST=Multimission Archive at STScI, http://archive.stsci.edu This serves FITS light curves.

NStED also serves Kepler data. NStED=NASA/IPAC/NExScI Star and Exoplanet Database, http://nsted.ipac.caltech.edu This serves FITS and ASCII light curves. We also have a "value-added" statistics and periodogram tool (e.g., this is something you can't get from MAST)

MAST

Here is a screenshot -- there are three different ways to find Kepler data, which are circled in orange here: Mast1.gif

When you get to the Kepler archive, these are the relevant files (circled in orange) that you want: Mast2.gif

e.g., the interactive data search, the target search, the public light curves, and the data release notes.

You can download all the public light curves at once in one GIGANTIC tarfile -- like 11 GB.

Their search form looks like this: Mast3.gif

And the search results look like this (the buttons circled in orange are the ones you want): Mast4.gif

This is what happens when you ask it to plot up that light curve: Mast5.gif

You can then choose to download the data; that screen looks like this: Mast6.gif

NStED

Unsurprisingly (because we developed it), we like this option better. :) We think it's easier to understand what is going on.

Here is a screenshot of the search page: Nsted1.gif

Of course, for Kepler data, hit the Kepler button.

Here is a screenshot of the Kepler search page: Nsted2.gif

Note that you can search with a wide variety of constraints, like color, dispersion, etc.: Nsted3.gif

Here is an example of the search results. The most important things in this figure have an orange box around them. Nsted4.gif

This is an example of the light curve viewer. In this figure, the orange box indicates where to go to download the light curve itself: Nsted5.gif

NStED also allows you to compute a periodogram from the light curve. Here is an example of what happens when you do that. The orange boxes in this figure indicate how to download the results of the periodogram analysis: Nsted6.gif


CoRoT data

CoRoT stares at a field for a long periods of time (up to 6 months), and then picks another field to observe.

CoRoT has two channels -- Asteroseismology (~10 targets/run) & Exoplanet (~10k targets/run) Exoplanet light curves have two categories -- CHRomatic & MONochromatic, in separate binary FITS table files Asteroseismology light curves come in three flavors --RAW, HELiocentric, & HELiocentric with REGularized time-sampling, all in one binary FITS file with 3 table extensions.

NStED is the official US portal for CoRoT data -- http://nsted.ipac.caltech.edu -- It serves FITS light curves (ASCII coming soon).

IAS Data and Operations Center also serves the data, but in binary FITS table format -- http://idoc-corotn2-public.ias.u-psud.fr/index.jsp

Finally, there is also the CoRoT archive at LAEFF (in Spain); they serve binary FITS tables & ASCII format -- http://sdc.laeff.inta.es/corotfa/jsp/frontpage.jsp