Accessing Kepler and CoRoT data
Based on materials developed by Peter Plavchan, NStED/IPAC
File Formats
First, a necessary aside on file formats. Modern time-series data sets are served in one of two formats:
- Binary FITS Tables
- Pros:
- Standard data format
- Compact file size
- Standard headers (FITS keywords)
- Supports inclusion of ancillary information
- Cons:
- Difficult to work with other visualization and manipulation tools -- e.g., how to plot, load into Excel?
- Fixed precision in data values – float or double – vs. formatting specified in somewhat obscure keywords
- 64-bit vs. 32-bit wonkiness from heritage tools (e.g. long doubles) (e.g., some tools assume 64-bit numbers and some assume 32-bit)
- Real life example: Original Kepler public data release introduced machine precision errors into time and position values larger than the other noise sources. (e.g., the real number should have been, say, 6.0045123056 and what got written to the file was shorter, say, 6.00451 -- that truncation introduced errors into the data that were much larger than any other noise source).
- Pros:
- ASCII
- Pros:
- (relatively) Easy to apply multiple tools -- IDL, Excel, mathematica, other plotting utilities can easily grab these files
- Precision tailored to data set -- if you want 6.0045123056 then you write 6.0045123056.
- Easier to directly investigate data values with simple file viewers
- Cons:
- Lack of standardized formatting -- e.g. tab vs. space vs. comma vs. fixed width, etc.
- Lack of standardized headers -- e.g. NStED IPAC ASCII keywords and formatting
- Larger file sizes
- Pros:
Kepler data
MAST is the official archive for Kepler. MAST=Multimission Archive at STScI, http://archive.stsci.edu This serves FITS light curves.
NStED also serves Kepler data. NStED=NASA/IPAC/NExScI Star and Exoplanet Database, http://nsted.ipac.caltech.edu This serves FITS and ASCII light curves. We also have a "value-added" statistics and periodogram tool (e.g., this is something you can't get from MAST)
MAST
Here is a screenshot -- there are three different ways to find Kepler data, which are circled in orange here:
When you get to the Kepler archive, these are the relevant files (circled in orange) that you want:
e.g., the interactive data search, the target search, the public light curves, and the data release notes.
You can download all the public light curves at once in one GIGANTIC tarfile -- like 11 GB.