Source code for special.chi

#! /usr/bin/env python

"""
Function defining the goodness of fit.

.. [OLO16]
   | Olofsson et al. 2016
   | **Azimuthal asymmetries in the debris disk around HD 61005. A massive collision of planetesimals?**
   | *Astronomy & Astrophysics, Volume 591, p. 108*
   | `https://arxiv.org/abs/1601.07861
     <https://arxiv.org/abs/1601.07861>`_

.. [GRE16]
   | Greco & Brandt 2016
   | **The Measurement, Treatment, and Impact of Spectral Covariance and 
     Bayesian Priors in Integral-field Spectroscopy of Exoplanets**
   | *The Astrophysical Journal, Volume 833, Issue 1, p. 134*
   | `https://arxiv.org/abs/1602.00691
     <https://arxiv.org/abs/1602.00691>`_
     
"""

__author__ = 'Valentin Christiaens'
__all__ = ['goodness_of_fit',
           'gof_scal']

import numpy as np
from matplotlib import pyplot as plt
from .config import figsize
from .model_resampling import resample_model
from .utils_spec import extinction

[docs]def goodness_of_fit(lbda_obs, spec_obs, err_obs, lbda_mod, spec_mod, dlbda_obs=None, instru_corr=None, instru_res=None, instru_idx=None, use_weights=True, filter_reader=None, plot=False, outfile=None): """ Function to estimate the goodness of fit indicator defined as in [OLO16]_ (Eq. 8). In addition, if a spectral correlation matrix is provided, it is used to take into account the correlated noise between spectral channels (see [GRE16]_). The goodness of fit indicator is identical to a :math:`\chi^2` if all points are obtained with the same instrument (or ``use_weights`` set to False) and in absence of spectrally correlated noise. Parameters ---------- lbda_obs : numpy 1d ndarray or list Wavelengths of observed spectrum. If several instruments were used, the wavelengths should be ordered per instrument, not necessarily as monotonically increasing wavelength. Hereafter, :math:`n_{ch}` is the length of ``lbda_obs``. spec_obs : numpy 1d ndarray or list Observed spectrum for each value of ``lbda_obs``. Should have a length of :math:`n_{ch}`. err_obs : numpy 1d/2d ndarray or list Uncertainties on the observed spectrum. The array (list) can have either a length of :math:`n_{ch}`, or a shape of :math:`(2,n_{ch})` for lower (first column) and upper (second column) uncertainties provided. lbda_mod : numpy 1d ndarray or list Wavelengths of tested model. Should have a wider (or equal) wavelength extent than the observed spectrum. spec_mod : numpy 1d ndarray Model spectrum. It does not require the same wavelength sampling as the observed spectrum. If higher spectral resolution, it will be convolved with the instrumental spectral PSF (if ``instru_res`` is provided) and then binned to the same sampling. If lower spectral resolution, a linear interpolation is performed to infer the value at the observed spectrum wavelength sampling. dlbda_obs : numpy 1d ndarray or list, optional Respective spectral channel width or FWHM of the photometric filters used for each point of the observed spectrum. This vector is used to infer which part(s) of a combined spectro+photometric spectrum should involve convolution+subsampling (model resolution higher than measurements), interpolation (the opposite), or convolution by the transmission curve of a photometric filter. If not provided, it will be inferred from the difference between consecutive lbda_obs points (i.e. inaccurate for a combined spectrum). It must be provided IF one wants to weigh each measurement based on the spectral resolution of each instrument (as in [OLO16]_), through the ``use_weights`` argument. instru_corr : numpy 2d ndarray, optional Spectral correlation between post-processed images in which the spectrum is measured. It is specific to the instrument, PSF subtraction algorithm and radial separation of the companion from the central star. Can be computed using ``special.spec_corr.spectral_correlation``. In case of a spectrum obtained with different instruments, it is recommended to construct the final spectral_correlation matrix with ``special.spec_corr.combine_corrs``. If ``instru_corr`` is not provided, the uncertainties in each spectral channel will be considered independent. See [GRE16]_ for more details. instru_res : float or list of floats/strings, optional The mean instrumental resolving power(s) OR filter names. This is used to convolve the model spectrum. If several instruments are used, provide a list of resolving power values / filter names, one for each instrument used. instru_idx: numpy 1d array, optional 1d array containing an index representing each instrument used to obtain the spectrum, label them from 0 to the number of instruments (:math:`n_{ins}`). Zero for points that don't correspond to any of the ``instru_res`` values provided, and i in :math:`[1,n_{ins}]` for points associated to instru_res[i-1]. This parameter must be provided if the spectrum consists of points obtained with different instruments. use_weights: bool, optional For the likelihood calculation, whether to weigh each point of the spectrum based on the spectral resolution or bandwidth of photometric filters used. Weights will be proportional to ``dlbda_obs/lbda_obs`` if ``dlbda_obs`` is provided, or set to 1 for all points otherwise. filter_reader: python routine, optional External routine that reads a filter file and returns a 2D numpy array, where the first column corresponds to wavelengths, and the second contains transmission values. Important: if not provided, but strings are detected in instru_res, the default file reader will be used. It assumes the following format for the files: - first row contains headers (titles of each column) - starting from 2nd row: 1st column: wavelength, 2nd col.: transmission - Unit of wavelength can be provided in parentheses of first header \ key name: e.g. "WL(AA)" for angstrom, "wavelength(mu)" for micrometer \ or "lambda(nm)" for nanometer. Note: only what is in parentheses \ matters for the units. plot : bool, optional Whether to plot the measured spectrum and the model spectrum. outfile : string, optional Path+filename for the plot to be saved (won't be saved if not provided). Returns ------- chi_sq : float Goodness of fit indicator. """ lbda_obs = np.array(lbda_obs) spec_obs = np.array(spec_obs) err_obs = np.array(err_obs) lbda_mod = np.array(lbda_mod) spec_mod = np.array(spec_mod) if lbda_obs.ndim != 1 or spec_obs.ndim != 1: raise TypeError('Observed lbda and spec must be lists or 1d arrays') if lbda_obs.shape[0] != spec_obs.shape[0]: raise TypeError('Obs lbda and spec need same shape') if lbda_obs.shape[0] != err_obs.shape[-1]: raise TypeError('Obs lbda and err need same shape') if lbda_mod.ndim != 1 or spec_mod.ndim != 1: raise TypeError('The input model lbda/spec must be lists or 1d arrays') else: if lbda_mod.shape != spec_mod.shape: raise TypeError('The input model lbda and spec need same shape') if dlbda_obs is not None: if isinstance(dlbda_obs, list): dlbda_obs = np.array(dlbda_obs) if lbda_obs.shape != dlbda_obs.shape: raise TypeError('The input lbda_obs and dlbda_obs need same shape') n_ch = lbda_obs.shape[0] # interpolate OR convolve+bin model spectrum if not same sampling if len(lbda_obs) != len(lbda_mod): _,spec_mod_fin = resample_model(lbda_obs, lbda_mod, spec_mod, dlbda_obs, instru_res, instru_idx, filter_reader) elif not np.allclose(lbda_obs, lbda_mod): _,spec_mod_fin = resample_model(lbda_obs, lbda_mod, spec_mod, dlbda_obs, instru_res, instru_idx, filter_reader) else: spec_mod_fin = spec_mod # compute covariance matrix if err_obs.ndim == 2: err_obs = (np.absolute(err_obs[0])+np.absolute(err_obs[1]))/2. if instru_corr is None: instru_corr = np.diag(np.ones(n_ch)) cov = np.ones_like(instru_corr) for ii in range(n_ch): for jj in range(n_ch): cov[ii,jj] = instru_corr[ii,jj]*err_obs[ii]*err_obs[jj] # replace potential NaN values nnan_idx = np.where(np.isfinite(spec_mod_fin)) if len(nnan_idx[0])>0: cov_crop = [] for i in range(len(lbda_obs)): if i in nnan_idx[0]: tmp = cov[i] cov_crop.append(tmp[nnan_idx]) cov = np.array(cov_crop) spec_obs = spec_obs[nnan_idx] spec_mod_fin = spec_mod_fin[nnan_idx] if dlbda_obs is not None: dlbda_obs = dlbda_obs[nnan_idx] lbda_obs = lbda_obs[nnan_idx] delta = spec_obs-spec_mod_fin wi = np.ones_like(lbda_obs) if use_weights and dlbda_obs is not None: if np.sum(np.power((dlbda_obs[1:]/lbda_obs[1:])-(dlbda_obs[:-1]/lbda_obs[:-1]),2))!=0: # normalize weights for their sum to be equal to the number of points wi = np.sqrt(((dlbda_obs/lbda_obs)/np.sum(dlbda_obs/lbda_obs))*dlbda_obs.shape[0]) chi_sq = np.linalg.multi_dot((wi*delta,np.linalg.inv(cov),wi*delta)) if plot: _, ax = plt.subplots(figsize=figsize) ax.plot(lbda_obs, lbda_obs*spec_obs, 'o', alpha=0.6, color='#1f77b4', label="Measured spectrum") ax.plot(lbda_obs, lbda_mod*spec_mod, '-', alpha=0.4, color='#1f77b4', label="Model") plt.xlabel('Wavelength') plt.ylabel(r'Flux density ($\lambda F_{\lambda}$)') plt.xlim(xmin=0.9*lbda_obs[0], xmax=1.1*lbda_obs[-1]) plt.minorticks_on() plt.legend(fancybox=True, framealpha=0.5, fontsize=12, loc='best') plt.grid(which='major', alpha=0.2) if outfile: plt.savefig(outfile, bbox_inches='tight') plt.show() return chi_sq
[docs]def gof_scal(params, lbda_obs, spec_obs, err_obs, lbda_tmp, spec_tmp, dlbda_obs, instru_corr, instru_res, instru_idx, use_weights, filter_reader, ext_range): """ Wrapper of routine ``special.chi.goodness_of_fit`` for the goodness of fit used to search for the best-fit library template spectrum to an observed spectrum. It has a ``params`` argument to consider the scaling factor and optionnally the differential extinction as free parameter(s). Parameters ---------- params: tuple Tuple of 1 or 2 elements: scaling factor and (optionally) differential optical extinction :math:`\Delta A_V` (:math:`\Delta A_V` can be negative if template spectra are not dereddened). lbda_obs : numpy 1d ndarray or list Wavelengths of observed spectrum. If several instruments were used, the wavelengths should be ordered per instrument, not necessarily as monotonically increasing wavelength. Hereafter, :math:`n_{ch}` is the length of ``lbda_obs``. spec_obs : numpy 1d ndarray or list Observed spectrum for each value of ``lbda_obs``. Should have a length of :math:`n_{ch}`. err_obs : numpy 1d/2d ndarray or list Uncertainties on the observed spectrum. The array (list) can have either a length of :math:`n_{ch}`, or a shape of :math:`(2,n_{ch})` for lower (first column) and upper (second column) uncertainties provided. lbda_tmp : numpy 1d ndarray or list Wavelengths of tested template. Should have a wider wavelength extent than the observed spectrum. spec_tmp : numpy 1d ndarray Template spectrum. It does not require the same wavelength sampling as the observed spectrum. If higher spectral resolution, it will be convolved with the instrumental spectral PSF (if ``instru_res`` is provided) and then binned to the same sampling. If lower spectral resolution, a linear interpolation is performed to infer the value at the observed spectrum wavelength sampling. dlbda_obs : numpy 1d ndarray or list, optional Respective spectral channel width or FWHM of the photometric filters used for each point of the observed spectrum. This vector is used to infer which part(s) of a combined spectro+photometric spectrum should involve convolution+subsampling (model resolution higher than measurements), interpolation (the opposite), or convolution by the transmission curve of a photometric filter. If not provided, it will be inferred from the difference between consecutive lbda_obs points (i.e. inaccurate for a combined spectrum). It must be provided IF one wants to weigh each measurement based on the spectral resolution of each instrument (as in [OLO16]_), through the ``use_weights`` argument. instru_corr : numpy 2d ndarray, optional Spectral correlation between post-processed images in which the spectrum is measured. It is specific to the instrument, PSF subtraction algorithm and radial separation of the companion from the central star. Can be computed using ``special.spec_corr.spectral_correlation``. In case of a spectrum obtained with different instruments, it is recommended to construct the final spectral_correlation matrix with ``special.spec_corr.combine_corrs``. If ``instru_corr`` is not provided, the uncertainties in each spectral channel will be considered independent. See [GRE16]_ for more details. instru_res : float or list of floats/strings, optional The mean instrumental resolving power(s) OR filter names. This is used to convolve the model spectrum. If several instruments are used, provide a list of resolving power values / filter names, one for each instrument used. instru_idx: numpy 1d array, optional 1d array containing an index representing each instrument used to obtain the spectrum, label them from 0 to the number of instruments (:math:`n_{ins}`). Zero for points that don't correspond to any of the ``instru_res`` values provided, and i in :math:`[1,n_{ins}]` for points associated to instru_res[i-1]. This parameter must be provided if the spectrum consists of points obtained with different instruments. use_weights: bool, optional For the likelihood calculation, whether to weigh each point of the spectrum based on the spectral resolution or bandwidth of photometric filters used. Weights will be proportional to ``dlbda_obs/lbda_obs`` if ``dlbda_obs`` is provided, or set to 1 for all points otherwise. filter_reader: python routine, optional External routine that reads a filter file and returns a 2D numpy array, where the first column corresponds to wavelengths, and the second contains transmission values. Important: if not provided, but strings are detected in instru_res, the default file reader will be used. It assumes the following format for the files: - first row contains headers (titles of each column) - starting from 2nd row: 1st column: wavelength, 2nd col.: transmission - Unit of wavelength can be provided in parentheses of first header \ key name: e.g. "WL(AA)" for angstrom, "wavelength(mu)" for micrometer \ or "lambda(nm)" for nanometer. Note: only what is in parentheses \ matters for the units. ext_range : tuple or None, opt - If None: differential extinction is not considered as a free parameter. - If a tuple: it should contain 2 floats (for simplex \ ``search_mode``) or 3 floats (for grid search ``search_mode``) \ corresponding to the lower limit, upper limit (and step for the grid \ search). For the simplex search, the lower and upper limits are used \ to set a chi square of infinity outside of the range. Returns ------- chi_sq : float Goodness of fit indicator. Note ---- If several filter filenames are provided in ``instru_res``, the filter files must all have the same format and wavelength units (for reading by the same ``filter_reader`` snippet or default function). """ tmp_spec = spec_tmp*params[0] if len(params) == 2: if ext_range is None: raise TypeError("ext_range should be a tuple of 3 elements") else: if params[1]<ext_range[0] or params[1]>ext_range[1]: return np.inf AV=params[1] if len(ext_range) == 3 and abs(AV) < ext_range[-1]: # round to zero if smaller than step AV = 0 Albdas = extinction(lbda_tmp,abs(AV)) extinc_fac = np.power(10.,-Albdas/2.5) if AV>0: tmp_spec *= extinc_fac elif AV<0: tmp_spec /= extinc_fac elif len(params) > 2: raise TypeError("params tuple should have length 1 or 2") return goodness_of_fit(lbda_obs, spec_obs, err_obs, lbda_tmp, tmp_spec, dlbda_obs=dlbda_obs, instru_corr=instru_corr, instru_res=instru_res, instru_idx=instru_idx, use_weights=use_weights, filter_reader=filter_reader)