按提交时间
按主题分类
按作者
按机构
您选择的条件: Jun Han
  • Surveillance of SARS-CoV-2 at the Huanan Seafood Market

    分类: 医学、药学 >> 预防医学与公共卫生学 提交时间: 2023-03-29

    摘要: Emerging in December 2019, coronavirus disease 2019 (COVID-19) eventually became a pandemic and posed a tremendous threat to global public health. However, the origins of SARS-CoV-2, the causative agent of COVID-19, remain to be determined. It has been reported that a certain number of the early case clusters had a contact history with the Huanan Seafood Market. Therefore, surveillance of SARS-CoV-2 within the market is of vital importance. Herein, we presented the SARS-CoV-2 detection results of 1380 samples collected from the environment since 1st Jan and animals since 18th Jan within the market in early 2020. By SARS-CoV-2-specific RT-qPCR, 73 environmental samples tested positive for SARS-CoV-2 and three live viruses were successfully isolated. The viruses from the market shared nucleotide identity of 99.99% to 100% with the human isolate HCoV-19/Wuhan/IVDC-HB-01/2019. The A lineage (8782T and 28144C), as the likely ancestral SARS-CoV-2 lineage, was found in an environmental sample. No virus was detected in the animal swabs covering 18 species of animals in the market. The RNA-seq analysis of SARS-CoV-2 positive/negative environmental samples showed the abundance of different vertebrata genera. In summary, this study provided convincing evidence of the prevalence of SARS-CoV-2 in the Huanan Seafood Market during the early stage of COVID-19 outbreak.

  • Photometric redshift estimation of galaxies in the DESI Legacy Imaging Surveys

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: The accurate estimation of photometric redshifts plays a crucial role in accomplishing science objectives of the large survey projects. The template-fitting and machine learning are the two main types of methods applied currently. Based on the training set obtained by cross-correlating the DESI Legacy Imaging Surveys DR9 galaxy catalogue and SDSS DR16 galaxy catalogue, the two kinds of methods are used and optimized, such as EAZY for template-fitting approach and CATBOOST for machine learning. Then the created models are tested by the cross-matched samples of the DESI Legacy Imaging SurveysDR9 galaxy catalogue with LAMOST DR7, GAMA DR3 and WiggleZ galaxy catalogues. Moreover three machine learning methods (CATBOOST, Multi-Layer Perceptron and Random Forest) are compared, CATBOOST shows its superiority for our case. By feature selection and optimization of model parameters, CATBOOST can obtain higher accuracy with optical and infrared photometric information, the best performance ($MSE=0.0032$, $\sigma_{NMAD}=0.0156$ and $O=0.88$ per cent) with $g \le 24.0$, $r \le 23.4$ and $z \le 22.5$ is achieved. But EAZY can provide more accurate photometric redshift estimation for high redshift galaxies, especially beyond the redhisft range of training sample. Finally, we finish the redshift estimation of all DESI DR9 galaxies with CATBOOST and EAZY, which will contribute to the further study of galaxies and their properties.

  • Photometric redshift estimation of galaxies in the DESI Legacy Imaging Surveys

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: The accurate estimation of photometric redshifts plays a crucial role in accomplishing science objectives of the large survey projects. The template-fitting and machine learning are the two main types of methods applied currently. Based on the training set obtained by cross-correlating the DESI Legacy Imaging Surveys DR9 galaxy catalogue and SDSS DR16 galaxy catalogue, the two kinds of methods are used and optimized, such as EAZY for template-fitting approach and CATBOOST for machine learning. Then the created models are tested by the cross-matched samples of the DESI Legacy Imaging SurveysDR9 galaxy catalogue with LAMOST DR7, GAMA DR3 and WiggleZ galaxy catalogues. Moreover three machine learning methods (CATBOOST, Multi-Layer Perceptron and Random Forest) are compared, CATBOOST shows its superiority for our case. By feature selection and optimization of model parameters, CATBOOST can obtain higher accuracy with optical and infrared photometric information, the best performance ($MSE=0.0032$, $\sigma_{NMAD}=0.0156$ and $O=0.88$ per cent) with $g \le 24.0$, $r \le 23.4$ and $z \le 22.5$ is achieved. But EAZY can provide more accurate photometric redshift estimation for high redshift galaxies, especially beyond the redhisft range of training sample. Finally, we finish the redshift estimation of all DESI DR9 galaxies with CATBOOST and EAZY, which will contribute to the further study of galaxies and their properties.

  • Photometric Redshift Estimation of BASS DR3 Quasars by Machine Learning

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: Correlating BASS DR3 catalogue with ALLWISE database, the data from optical and infrared information are obtained. The quasars from SDSS are taken as training and test samples while those from LAMOST are considered as external test sample. We propose two schemes to construct the redshift estimation models with XGBoost, CatBoost and Random forest. One scheme (namely one-step model) is to predict photometric redshifts directly based on the optimal models created by these three algorithms; the other scheme (namely two-step model) is to firstly classify the data into low- and high- redshift datasets, and then predict photometric redshifts of these two datasets separately. For one-step model, the performance of these three algorithms on photometric redshift estimation is compared with different training samples, and CatBoost is superior to XGBoost and Random forest. For two-step model, the performance of these three algorithms on the classification of low- and high-redshift subsamples are compared, and CatBoost still shows the best performance. Therefore CatBoost is regard as the core algorithm of classification and regression in two-step model. By contrast with one-step model, two-step model is optimal when predicting photometric redshift of quasars, especially for high redshift quasars. Finally the two models are applied to predict photometric redshifts of all quasar candidates of BASS DR3. The number of high redshift quasar candidates is 3938 (redshift $\ge 3.5$) and 121 (redshift $\ge 4.5$) by two-step model. The predicted result will be helpful for quasar research and follow up observation of high redshift quasars.

  • Identification of BASS DR3 Sources as Stars, Galaxies and Quasars by XGBoost

    分类: 天文学 >> 天文学 提交时间: 2023-02-19

    摘要: The Beijing-Arizona Sky Survey (BASS) Data Release 3 (DR3) catalogue was released in 2019, which contains the data from all BASS and the Mosaic z-band Legacy Survey (MzLS) observations during 2015 January and 2019 March, about 200 million sources. We cross-match BASS DR3 with spectral databases from the Sloan Digital Sky Survey (SDSS) and the Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST) to obtain the spectroscopic classes of known samples. Then, the samples are cross-matched with ALLWISE database. Based on optical and infrared information of the samples, we use the XGBoost algorithm to construct different classifiers, including binary classification and multiclass classification. The accuracy of these classifiers with the best input pattern is larger than 90.0 per cent. Finally, all selected sources in the BASS DR3 catalogue are classified by these classifiers. The classification label and probabilities for individual sources are assigned by different classifiers. When the predicted results by binary classification are the same as multiclass classification with optical and infrared information, the number of star, galaxy and quasar candidates is separately 12 375 838 (P_S>0.95), 18 606 073 (P_G>0.95) and 798 928 (P_Q>0.95). For these sources without infrared information, the predicted results can be as a reference. Those candidates may be taken as input catalogue of LAMOST, DESI or other projects for follow up observation. The classified result will be of great help and reference for future research of the BASS DR3 sources.