您选择的条件: Zechang Sun
  • An Unsupervised Learning Approach for Quasar Continuum Prediction

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: Modeling quasar spectra is a fundamental task in astrophysics as quasars are the tell-tale sign of cosmic evolution. We introduce a novel unsupervised learning algorithm, Quasar Factor Analysis (QFA), for recovering the intrinsic quasar continua from noisy quasar spectra. QFA assumes that the Ly$\alpha$ forest can be approximated as a Gaussian process, and the continuum can be well described as a latent factor model. We show that QFA can learn, through unsupervised learning and directly from the quasar spectra, the quasar continua and Ly$\alpha$ forest simultaneously. Compared to previous methods, QFA achieves state-of-the-art performance for quasar continuum prediction robustly but without the need for predefined training continua. In addition, the generative and probabilistic nature of QFA paves the way to understanding the evolution of black holes as well as performing out-of-distribution detection and other Bayesian downstream inferences.

  • Quasar Factor Analysis -- An Unsupervised and Probabilistic Quasar Continuum Prediction Algorithm with Latent Factor Analysis

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: Since their first discovery, quasars have been essential probes of the distant Universe. However, due to our limited knowledge of its nature, predicting the intrinsic quasar continua has bottlenecked their usage. Existing methods of quasar continuum recovery often rely on a limited number of high-quality quasar spectra, which might not capture the full diversity of the quasar population. In this study, we propose an unsupervised probabilistic model, \textit{Quasar Factor Analysis} (QFA), which combines factor analysis (FA) with physical priors of the intergalactic medium (IGM) to overcome these limitations. QFA captures the posterior distribution of quasar continua through generatively modeling quasar spectra. We demonstrate that QFA can achieve the state-of-the-art performance, $\sim 2\%$ relative error, for continuum prediction in the Ly$\alpha$ forest region compared to previous methods. We further fit 90,678 $22$ quasar spectra from Sloan Digital Sky Survey Data Release 16 and found that for $\sim 30\%$ quasar spectra where the continua were ill-determined with previous methods, QFA yields visually more plausible continua. QFA also attains $\lesssim 1\%$ error in the 1D Ly$\alpha$ power spectrum measurements at $\mathrm{z}\sim 3$ and $\sim 4\%$ in $\mathrm{z}\sim 2.4$. In addition, QFA determines latent factors representing more physically motivated than PCA. We investigate the evolution of the latent factors and report no significant redshift or luminosity dependency except for the Baldwin effect. The generative nature of QFA also enables outlier detection robustly; we showed that QFA is effective in selecting outlying quasar spectra, including damped Ly$\alpha$ systems and potential Type II quasar spectra.

  • DESI survey validation data in the COSMOS/HSC field: Cool gas trace main sequence star-forming galaxies at the cosmic noon

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: We present the first result in exploring the gaseous halo and galaxy correlation using the Dark Energy Spectroscopic Instrument (DESI) survey validation data in the Cosmic Evolution Survey (COSMOS) and Hyper Suprime-Cam (HSC) field. We obtain the multiphase gaseous halo properties in the circumgalactic medium (CGM) by using 115 quasar spectra (S/N > 3). We detect MgII absorption at redshift 0.6 < z < 2.5, CIV absorption at 1.6 < z < 3.6, and HI absorption associated with the MgII and CIV. The CGM is mixed by a higher density phase of detectable MgII and CIV and a lower density of CIV-only phase. By cross-matching the COSMOS2020 catalog, we identify the MgII and CIV host galaxies at 0.9 < z < 3.1 in ten quasar fields. We find that within the impact parameter of 250 kpc, a tight correlation is seen between strong MgII equivalent width and the host galaxy star formation rate. The covering fraction fc of strong MgII selected galaxies, which is the ratio of absorbing galaxy in a certain galaxy population, shows significant evolution in the main-sequence galaxies and marginal evolution in all the galaxy populations within 250 kpc at 0.9 < z < 2.2. The fc increase in the main-sequence galaxies likely suggests the co-evolution of strong MgII absorbing gas and the main-sequence galaxies at the cosmic noon. Furthermore, several MgII and CIV absorbing gas is detected out of the galaxy virial radius, tentatively indicating the feedback produced by the star formation and/or the environmental effects.

  • Quasar Factor Analysis -- An Unsupervised and Probabilistic Quasar Continuum Prediction Algorithm with Latent Factor Analysis

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: Since their first discovery, quasars have been essential probes of the distant Universe. However, due to our limited knowledge of its nature, predicting the intrinsic quasar continua has bottlenecked their usage. Existing methods of quasar continuum recovery often rely on a limited number of high-quality quasar spectra, which might not capture the full diversity of the quasar population. In this study, we propose an unsupervised probabilistic model, \textit{Quasar Factor Analysis} (QFA), which combines factor analysis (FA) with physical priors of the intergalactic medium (IGM) to overcome these limitations. QFA captures the posterior distribution of quasar continua through generatively modeling quasar spectra. We demonstrate that QFA can achieve the state-of-the-art performance, $\sim 2\%$ relative error, for continuum prediction in the Ly$\alpha$ forest region compared to previous methods. We further fit 90,678 $22$ quasar spectra from Sloan Digital Sky Survey Data Release 16 and found that for $\sim 30\%$ quasar spectra where the continua were ill-determined with previous methods, QFA yields visually more plausible continua. QFA also attains $\lesssim 1\%$ error in the 1D Ly$\alpha$ power spectrum measurements at $\mathrm{z}\sim 3$ and $\sim 4\%$ in $\mathrm{z}\sim 2.4$. In addition, QFA determines latent factors representing more physically motivated than PCA. We investigate the evolution of the latent factors and report no significant redshift or luminosity dependency except for the Baldwin effect. The generative nature of QFA also enables outlier detection robustly; we showed that QFA is effective in selecting outlying quasar spectra, including damped Ly$\alpha$ systems and potential Type II quasar spectra.

  • Deep Learning of DESI Mock Spectra to Find Damped Ly{\alpha} Systems

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: We have updated and applied a convolutional neural network (CNN) machine learning model to discover and characterize damped Ly$\alpha$ systems (DLAs) based on Dark Energy Spectroscopic Instrument (DESI) mock spectra. We have optimized the training process and constructed a CNN model that yields a DLA classification accuracy above 99$\%$ for spectra which have signal-to-noise (S/N) above 5 per pixel. Classification accuracy is the rate of correct classifications. This accuracy remains above 97$\%$ for lower signal-to-noise (S/N) $\approx1$ spectra. This CNN model provides estimations for redshift and HI column density with standard deviations of 0.002 and 0.17 dex for spectra with S/N above 3 per pixel. Also, this DLA finder is able to identify overlapping DLAs and sub-DLAs. Further, the impact of different DLA catalogs on the measurement of Baryon Acoustic Oscillation (BAO) is investigated. The cosmological fitting parameter result for BAO has less than $0.61\%$ difference compared to analysis of the mock results with perfect knowledge of DLAs. This difference is lower than the statistical error for the first year estimated from the mock spectra: above $1.7\%$. We also compared the performance of CNN and Gaussian Process (GP) model. Our improved CNN model has moderately 14$\%$ higher purity and 7$\%$ higher completeness than an older version of GP code, for S/N $>$ 3. Both codes provide good DLA redshift estimates, but the GP produces a better column density estimate by $24\%$ less standard deviation. A credible DLA catalog for DESI main survey can be provided by combining these two algorithms.