Difference between revisions of "Accessing Kepler and CoRoT data"

From CoolWiki
Jump to navigationJump to search
m
m
Line 65: Line 65:
 
Here is a screenshot of the Kepler search page:
 
Here is a screenshot of the Kepler search page:
 
[[image:nsted2.gif]]
 
[[image:nsted2.gif]]
 +
 +
Note that you can search with a wide variety of constraints, like color, dispersion, etc.:
 +
[[image:nsted3.gif]]

Revision as of 05:31, 11 August 2010

Based on materials developed by Peter Plavchan, NStED/IPAC

File Formats

First, a necessary aside on file formats. Modern time-series data sets are served in one of two formats:

  • Binary FITS Tables
    • Pros:
      • Standard data format
      • Compact file size
      • Standard headers (FITS keywords)
      • Supports inclusion of ancillary information
    • Cons:
      • Difficult to work with other visualization and manipulation tools -- e.g., how to plot, load into Excel?
      • Fixed precision in data values – float or double – vs. formatting specified in somewhat obscure keywords
      • 64-bit vs. 32-bit wonkiness from heritage tools (e.g. long doubles) (e.g., some tools assume 64-bit numbers and some assume 32-bit)
      • Real life example: Original Kepler public data release introduced machine precision errors into time and position values larger than the other noise sources. (e.g., the real number should have been, say, 6.0045123056 and what got written to the file was shorter, say, 6.00451 -- that truncation introduced errors into the data that were much larger than any other noise source).
  • ASCII
    • Pros:
      • (relatively) Easy to apply multiple tools -- IDL, Excel, mathematica, other plotting utilities can easily grab these files
      • Precision tailored to data set -- if you want 6.0045123056 then you write 6.0045123056.
      • Easier to directly investigate data values with simple file viewers
    • Cons:
      • Lack of standardized formatting -- e.g. tab vs. space vs. comma vs. fixed width, etc.
      • Lack of standardized headers -- e.g. NStED IPAC ASCII keywords and formatting
      • Larger file sizes

Kepler data

MAST is the official archive for Kepler. MAST=Multimission Archive at STScI, http://archive.stsci.edu This serves FITS light curves.

NStED also serves Kepler data. NStED=NASA/IPAC/NExScI Star and Exoplanet Database, http://nsted.ipac.caltech.edu This serves FITS and ASCII light curves. We also have a "value-added" statistics and periodogram tool (e.g., this is something you can't get from MAST)

MAST

Here is a screenshot -- there are three different ways to find Kepler data, which are circled in orange here: Mast1.gif

When you get to the Kepler archive, these are the relevant files (circled in orange) that you want: Mast2.gif

e.g., the interactive data search, the target search, the public light curves, and the data release notes.

You can download all the public light curves at once in one GIGANTIC tarfile -- like 11 GB.

Their search form looks like this: Mast3.gif

And the search results look like this (the buttons circled in orange are the ones you want): Mast4.gif

This is what happens when you ask it to plot up that light curve: Mast5.gif

You can then choose to download the data; that screen looks like this: Mast6.gif

NStED

Unsurprisingly (because we developed it), we like this option better. :) We think it's easier to understand what is going on.

Here is a screenshot of the search page: Nsted1.gif

Of course, for Kepler data, hit the Kepler button.

Here is a screenshot of the Kepler search page: Nsted2.gif

Note that you can search with a wide variety of constraints, like color, dispersion, etc.: Nsted3.gif