Submitted Date
Subjects
Authors
Institution
  • Parallel Corpus Sentence Alignment Scoring for Low-Resource Language Machine Translation

    Subjects: Linguistics and Applied Linguistics >> Linguistics and Applied Linguistics Subjects: Computer Science >> Natural Language Understanding and Machine Translation submitted time 2024-06-05

    Abstract: Objective This paper aims to quantify the sentence alignment scores of low-resource parallel corpora to obtain high-quality parallel corpora, improving machine translation performance. Methods We propose NeuroAlign, a neural network-based unsupervised sentence embedding method for scoring bilingual parallel sentence alignment. Parallel sentence pairs are embedded into the same vector space, and alignment scores for given candidate sentence pairs in the parallel corpus are calculated. Based on these scores, low-scoring sentence pairs are filtered out, resulting in high-quality bilingual parallel corpora for low-resource languages. Results In the BUCC2018 parallel text mining task, the F1 score can be improved by 0.5-0.8. In the CCMT2021 low-resource language neural machine translation task, the BLEU score can be improved by 0.1-10.9. The sentence alignment scores can approach human evaluation. Limitations Due to the scarcity of low-resource bilingual parallel corpora, research has not been conducted on language pairs other than Tibetan-Chinese, Uyghur-Chinese, and Mongolian-Chinese. Conclusions This method can be effectively applied to sentence alignment scoring for low-resource language machine translation parallel corpora, improving the quality of the data source, and thereby enhancing machine translation performance.

  • A Study on the Applicability of Author Identification Numbers in Scientific and Technical Paper Databases

    Subjects: Management Science >> Science ology and Management submitted time 2024-06-05

    Abstract: Purpose To evaluate the coverage and accuracy of author identification number (author ID) of the major bibliographic databases and to assess whether they could be directly used in empirical research.
    Methods The ground truth data set consists of articles from 825 Chinese scientists. The coverage, accuracy, and robustness of each author ID are calculated by retrieving and collecting the IDs of scientists and their respective publication information in the bibliographic databases. The validity of the author IDs for empirical research is assessed by replicating a top journal empirical article using the data collected through author IDs.
    Results First, WOS, Scopus, AMiner, and OpenAlex can retrieve more than 90% of Chinese scientists’ identifiers, while ORCID’s coverage is less than 50%. Second, the accuracy of Scopus is the highest at 85.2%, and the accuracy of OpenAlex is the lowest at only 51.2%. Third, directly using the publication data collected through author IDs for empirical research will introduce non-negligible bias.
    Limitations The ground truth data set is limited, because it is mainly composed of young scientists, and lack scientists from social sciences and humanities.
    Conclusion At present, the author identification number of the major databases cannot be directly applied to the empirical research of large-scale data. A standardized information platform for scientists’ publications is needed to overcome the author-name disambiguation problem.

  • Prediction of Clock Bias for BeiDou Satellites Using a Combination of Variational Mode Decomposition and Long Short-Term Memory Network

    Subjects: Physics >> Geophysics, Astronomy, and Astrophysics Subjects: Information Science and Systems Science >> Basic Disciplines of Information Science and Systems Science submitted time 2024-06-05

    Abstract: The precise estimation of the satellite clock bias (SCB) holds considerable importance in ensuring accurate timekeeping, navigation, and positioning. This studyintroduces a novel SCB prediction approach that integrates variational mode decomposition (VMD) and long short-term memory (LSTM) network techniques, combining signal decomposition with deep learning methodologies. Initially, the raw SCB data undergoespreprocessing, followed by decomposition using the VMD method to generate multiple intrinsic mode functions (IMFs). These decomposed IMFs serve as inputs for LSTM, where several independent LSTM models are established for training and prediction purposes. Subsequently, the predicted outcomes are aggregated and reconstructed to derive the finalSCB prediction. Experimental findings demonstrate notable advancements in clock bias prediction for the spaceborne hydrogen atomic clock for BDS, with prediction accuracies of 0.048 ns, 0.204 ns and 1.397 ns for 6 hours, 3 days and 15 days, respectively. These results exhibit significant enhancements compared to both the LSTM network and the Back Propagation (BP) neural network, with improvements of 56%, 84% and 83% for the aforementioned time intervals in comparison to LSTM, and enhancements of 59%, 82% and 83% relative to the BP neural network. 

  • Analysis of the batch retraction of publications in Hindawi journals

    Subjects: Library Science,Information Science >> Library Science submitted time 2024-06-05

    Abstract: Objective To analyze some of the characteristics of papers retracted in batch by Hindawi journals. The aim is to provide a reference for Chinese scientific and technical journals on how to deal with the systematic manipulation of the publication process.
    Methods Retraction statements and retracted papers were obtained from the Hindawi XML corpus. Based on the Web of Science core collection, we analyzed the author affiliations and citations of the retracted papers.
    Results Hindawi journals retracted a total of 11,371 papers between November 16, 2022 and March 20, 2024. The majority (74.7%) of these retracted papers were published in 2022, with an average of 46 days from submission to acceptance, 77 days from submission to publication, and 526 days from publication to retraction. The 50 institutions in mainland China with the highest number of publications in the same period were involved in publishing 18.6% of the retracted papers.
    Conclusion Batched retraction of papers involving systematic manipulation of the publication process is a practice by publishers to combat “paper mills”. Although most retracted papers had authors from mainland China, the authors of retracted papers cannot be considered representative of Chinese researchers. Chinese scientific journals should avoid manipulation of the publication process due to editorial incompetence, and research institutions should strengthen education on research integrity to prevent Chinese authors from being involved in similar incidents.

  • A LLM-based RPA System for Optimizing Workflows in Financial Risk Management

    Subjects: Management Science >> Enterprise Management submitted time 2024-06-05

    Abstract: This paper aims to leverage the advancements in General Computer Control (GCC) to improve the efficiency and effectiveness of risk management operations in financial institutions. Specifically, we introduce an LLM-based Robotic Process Automation (RPA) framework designed to enhance front-line employee work, adapt to the specific needs of financial institutions, and automate tasks requiring minimal cognitive effort. To demonstrate the effectiveness of our proposed framework, stress testing, a common task for risk management department, is used as a case study. The results show that the RPA system can improve efficiency, reduce costs, and minimize errors, all without significantly altering the existing workflow. Moreover, to address customer information security and prompts copyright protection issues, a storage method that separates the server from the client is used. Finally, empirical evidence implies that even models with weaker capabilities can achieve the desired work objectives when guided by detailed prompts.

  • A case report of integrated treatment and nursing of acute giant cell infection secondary to intracranial venous sinus thrombosis

    Subjects: Nursing >> Nursing Subjects: Medicine, Pharmacy >> Traditional Chinese Medicine and Chinese Materia Medica submitted time 2024-06-05

    Abstract: A case of giant cell infection secondary to intracranial venous sinus thrombosis was summarized and analyzed. In this case, headache was the main clinical manifestation, accompanied by vomiting and cold sweat, without fever or cough. After admission, relevant examinations were improved, treatment such as reducing intracranial pressure, anticoagulation and antiviral were provided, lumbar puncture and intracranial angiography were improved, vital signs of patients were closely monitored, the formation of cerebral hernia was prevented by basic nursing such as decubitus position, diet and urine and bowel movements, medication and postoperative signs were monitored, and various treatments and nursing were conducted to promote the recovery of patients.

  • Wide-bandgap semiconductor of three-dimensional unconventional stoichiometric NaCl2 crystal

    Subjects: Physics >> Condensed Matter: Structural, Mechanical and Thermal Properties submitted time 2024-06-04

    Abstract: The expanding applications call for novel new-generation wide-bandgap semiconductors.  Here, we show that a compound only composed of the ordinary elements Na and Cl, namely three-dimensional NaCl2 crystal, is a wide-bandgap semiconductor. This finding benefits from the breaking of conventional stoichiometry frameworks in the theoretical design, leading to the discovery of three-dimensional XY2 (X = Na, Li, K; Y = Cl, F, Br, I) crystals, with covalent bonds of Y pairs inducing the wide bandgap from 2.24 to 4.45 eV. Crucially, such an unexpected NaCl2 crystal was successfully synthesized under ambient conditions. The unconventional stoichiometric strategy with other chemical elements potentially yields more wide-bandgap semiconductors, offering the capability for bandgap tuning. These unconventional stoichiometric materials may also exhibit superconductivity, transparent inorganic electrides, high-energy-density, and beyond.
     

  • Self-adjoint operators and nontrivial zeros of Dirichlet L-function

    Subjects: Mathematics >> Algebra and Number Theory submitted time 2024-06-03

    Abstract: We give a kind of self-adjoint operator, whose spectrums are the set $S_ chi= {i( rho- frac{1}{2}) mid rho text{ is nontrivial zeros of $L$-function } L( chi,s) }$.

  • Radio Frequency Interference Mitigation Methods for Fast Radio Burst Observational Data

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: To quickly search for rare fast radio bursts (FRBs) from massive astronomical observation data, Radio Frequency Interference (RFI) mitigation is one of the crutial and challenging tasks. RFI will produce a large number of false positive candidates, and even submerge real astronomical events. Due to the complexity of the sources and types of RFI, there is currently no universal method to solve this problem. In order to reduce the impact of RFI on FRB search, the RFI in L band observational data of the Nanshan 26 m radio telescope was analyzed and studied. A three-level RFI mitigation procedure was established for the main narrowband RFI and broadband RFI, which effectively alleviated the RFI pollution of observational data. Embedding this procedure into the FRB DDSS (Dispersion Dynamic Spectra Search) pipeline, experimental results show that the detection rate and accuracy of the search algorithm have been further improved. This method provides valuable reference for RFI mitigation of FRB observational data.

  • Dynamical Characteristics of Active Asteroid 311P/PANSTARRS

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: Active asteroids have received lots of attention since they were discovered. 311P/PANSTARRS is one of the most interesting active asteroids in China recently and one of the targets of Tianwen-2. Because of its small size of about 400\;m, the Yarkovsky effect may have a significantly influence on its long-term dynamics. This paper discussed the changes in the long-term motion of 311P/PANSTARRS caused by the Yarkovsky effect. By assuming different surface compositions, this simulation introduced the semi-major axis drift by propagating orbits of orbital clones, and the effects of other factors such as close encounters, meteoroid impacts and YORP effect on the orbit evolution of 311P/PANSTARRS were also discussed. Additionally, we estimated the time scale for 311P/PANSTARRS to reach its rotation period of splitting limit. The results of the simulations show that the Yarkovsky effect may cause 311P/PANSTARRS to exit from the resonance region faster when compared to a purely gravitational model. 311P/PANSTARRS will leave the current resonance region after roughly 10\;Myr and have a chance to become a Mars-crossing asteroid through $\nu_{6}$ secular resonance due to the diurnal Yarkovsky effect if its surface is covered by a regolith layer. It is concluded that 311P/PANSTARRS is stable at least 10\;Myr time scale even if taking the Yarkovsky effect and the YORP effect into account. Furthermore, the YORP effect may not significantly affect the semi-major axis drift of 311P/PANSTARRS.

  • Adjustment Method of Main Reflector for a Large Radio Telescope

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: The performance of the antennas, which plays an important role in millimeter-wave astronomy and space communication, is often limited by their reflector surface accuracy. Microwave holography is a fast and effective technique for measuring the surface profile of reflector antenna. In this paper, the antenna aperture phase profile is obtained by microwave holography to estimate the deviation between the reflector and the ideal paraboloid of the Tianma 65 m radio telescope. The panels of the Tianma 65 m radio telescope is in radial pattern with 14 rings. Each corner of the panel is fixed on the screw of the actuator to move up and down, and the adjacent corners of the four panels share an actuator. We use the method of plane fitting to calculate the adjustment value of every panels corner. But one actuator, which simultaneously controls the common corner of the adjacent panels, will have different adjustment values according to the plane fitting equation based on adjacent panels. In this paper, an adjustment value at the corner of the adjacent panel is obtained by plane fitting and adjustment calculation method with the antenna illumination function as the weight, that is, the optimal adjustment value of 1104 actuators of the Tianma 65 m radio telescope. Through many adjustments and the application of new algorithms, the surface accuracy of the reflector of the Tianma 65 m radio telescope has gradually improved to the current 0.24 mm.

  • Optimal Design of the Wind Barrier for Large-aperture and High-precision Radio Telescope Site Based on Numerical Simulation

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: With the increase in aperture and observation frequency of radio telescopes, there is a growing demand for enhanced pointing accuracy. However, telescope is operated at the field station, and the influence of wind disturbances on telescope pointing accuracy cannot be ignored in high frequency observation. Due to the time-varying characteristics of wind disturbances, the existing wind resistance methods cannot guarantee the effective observation time of large aperture and high pointing accuracy telescope in high frequency. Therefore, a method to improve the wind environment of the site based on the precise arrangement of wind barriers is proposed. The wind barrier simulation model is built, and the average error of the two porosity wind barriers is 3.7\% and 6.1\% respectively by comparing the simulation results with the measured data of the wind tunnel, which ensures the reliability of the wind barrier model. The QiTai radio Telescope (QTT) site in Xinjiang is taken as an example. The computational domain model is developed based on the slope terrain of the QTT site, and a series of wind field simulation tests are conducted to investigate the impact of different heights and porosities of a single wind barrier on downstream wind protection effectiveness. The parameters for simulating the north-south wind barrier are determined based on an appropriate height and optimal porosity. The results indicate that combining the determined height with a porosity of 0.1--0.1 yields the most effective wind reduction in the antenna area, with south winds experiencing a decrease in speed by over 75\%.

  • Research on the Centering Algorithm of Dwarf Planet Haumea System

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: Affected by the Earth's atmosphere, the image of primary and satellite system observed by ground-based telescopes often appears unresolved. Therefore, there may be some deviation between the system photocenter and the center of mass. The accurate measurement of the center of mass is helpful to improve the orbital parameters and to reveal the formation and evolution of the solar system. Taking the dwarf planet Haumea and its brighter satellite Hi'iaka as an example, we simulate its motion of the photocenter around the center of mass and explore whether the track of photocenter is the same with different seeings. The simulation results show that when using the two-dimensional Gaussian centering algorithm, the photocenter track is changing with the change of seeing, however, this phenomenon will not occur when using the modified moment centering algorithm. Based on the simulation results and the influence of noise, a new centering algorithm is proposed in this paper, which can effectively reduce the influence of changing seeing on the accurate photocenter position measurement. In addition, we also consider the influence of primary's changing luminosity on the photocenter position measurement. The images of dwarf planet Haumea were observed over 29 nights from February 7, 2022 to May 25, 2022 by three telescopes located in Yunnan Province. Our results show that the fitted error of the new centering algorithm's results is smaller than the two-dimensional Gaussian algorithm's results. In addition, we find that theoretical position of the bright satellite from Jet Propulsion Laboratory(JPL) and Institut de M\'{e}canique C\'{e}leste et de Calcul des \'{E}ph\'{e}m\'{e}rides(IMCCE) ephemeris has a large deviation.

  • Classification of Galaxy Morphology Based on FPN-ViT Model

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: With the development of artificial intelligence technology, the research of galaxy morphology classification using deep learning methods has made great progress, but there are still shortcomings in classification accuracy, automation and spatial characteristics representation of galaxies. The Vision Transformer model has good robustness in galaxy morphology classification, but has limitations in handling multi-scale images. In this paper, we propose to introduce the Feature Pyramid Networks(FPN) into the Vision Transformer(ViT) model to classify galaxies. The results show that the average accuracy, precision, recall, and F1-score of the FPN-ViT model are above 95\%, and the indexes are improved compared with the traditional ViT model. Meanwhile, we add different levels of Gaussian noise and pretzel noise to the original galaxy images to verify that the FPN-ViT model can obtain better classification performance for low signal-to-noise ratio data. In addition, to evaluate the model comprehensively, the t-distributed Stochastic Neighbor Embedding(t-SNE) algorithm is used to visualize and analyze the classification results, which can show the effect of FPN-ViT model on galaxy morphology classification more directly. The application of FPN network to the classification of galaxy morphology by ViT model is a new attempt, which is of great importance for the subsequent research.

  • Design and Realization of the Calorimeter Test System for the Hard X-ray Imager

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: The Advanced Space-based Solar Observatory(ASO-S), as one of the second series scientific satellites in the space science pilot project, was proposed by the Purple Mountain Observatory of Chinese Academy of Sciences. ASO-S aims to observe and investigate the two solar eruptive phenomena, i.e., solar flares and coronal mass ejections, and their relationships with magnetic field. Hard X-ray Imager(HXI) is one of the payloads aboard ASO-S. As an important part of HXI, the calorimeter is responsible for measuring solar hard X-rays in the energy range of 30--200 keV. Before the launch of ASO-S, a ground test system was necessary to verify the function and performance of the HXI calorimeter. There are 99 lanthanum bromide detectors in the HXI calorimeter, each controlled by 8 identical front-end electronic boards. In addition to testing the performance of each channel, the ground test system also needs to simulate the operation of the calorimeter under different solar activities in orbit, conducting a comprehensive test. Moreover, the ground test system needs to be stable enough to meet the long-term testing needs of the calorimeter in different tests, including standalone testing, environmental experiments, thermal vacuum and vibration tests. According to the test requirements, a ground test board and a software in the host computer were designed, combined with radioactive sources, a DC(Direct Current) power supply, and a high-voltage module to form a ground test system for the HXI calorimeter. It can simultaneously configure and manage 8 front-end electronic boards, efficiently complete command sending and data receiving, and meet the maximum data output bandwidth of 400 Mbps of the calorimeter. The ground test system can fully meet the requirement of automatic control, status monitoring, data acquisition and data analysis for the HXI calorimeter. Using this ground test system, all function and performance tests for the HXI calorimeter were completed with the acquisition of the noise, the dead time, the energy resolution and other performance data. The ground test system provides strong guarantee for the high performance operation of the HXI calorimeter in orbit.

  • Discrete Element Method Simulation System for Asteroid

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: Asteroid detection is now a hot spot of solar system exploration. The understanding of the evolution of asteroids has a great benefit on researches to the origin of the solar system. An important topic of the evolution research is the evolution of inner structures of asteroids, in other words, asteroids' evolution of shapes and structures under multiple mechanisms. A common method to simulate the dynamical evolution of asteroids is Discrete Element Method(DEM) under the assumption that asteroids are in rubble-pile structures. Some teams have developed several kinds of softwares for DEM simulations. The basics, realization, algorithms of our software ``Multi-particle system simulation software based on DEM algorithm'' are introduced in this article and the software is verified using two body collision, sound wave propagation, inner stress of asteroid, and asteroid spin disruption setups.

  • Research on Fault Identification Method of Elevation Bearing for Large Aperture Radio Telescope Based on Meta Learning

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: The prolonged operation of the large aperture radio telescope will lead to slight distortion of elevation axis, increasing the risk of fatigue of the rolling bearing which is the core component of the elevation axis. This can lead to a decline in the bearing life and the pointing accuracy of the telescope, which will greatly affect the telescope's high performance service. Investigating the fault identification method for the elevation bearing can provide an important support for the high-performance operation of the telescope antenna. In this paper, a few-shot meta-learning fault identification(FMFI) method based on meta-learning is proposed in order to achieve accurate fault identification of elevation bearing under limited data and complex working conditions. The raw signals of different working conditions are first converted to time-frequency images data and then randomly sampled for different learning tasks according to the meta-learning protocol. Under limited sample data condition, the FMFI method can obtain universal prior knowledge from the sample in the training task to achieve accurate and fast fault identification in unknown testing tasks. The variable load bearing data set which is similar to the working condition of telescope elevation bearing is selected for experiment, and the experimental results show that the FMFI method is high accurate and reliable, providing strong technical support for the operation, maintenance, and high-quality service of large aperture radio telescope.

  • Intensity Calibration for the Mingantu Spectral Radioheliograph Images

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: The Mingantu Spectral Radioheliograph(MUSER), a new generation of solar dedicated radio imaging spectroscopic telescope, has realized high-time, high-angular, and high-frequency resolution imaging of the Sun in the 0.4--15 GHz ultra-broadband frequency range. The radio brightness temperature is an important parameter to describe the solar physical process. It plays a very important role in the study of different radio radiation mechanisms, solar magnetic field and the acceleration of non-thermal particles in the solar burst process. Therefore, the image of radioheliograph must be calibrated for brightness temperature. This paper introduces a method suitable for radioheliograph image intensity calibration. The solar radio image contains the structural information of the solar disk. The radio radius and intensity of the quiet solar disk in the image can be obtained by fitting the first kind of Bessel function with the visibilities of short baselines of the radioheliograph. Then, the Rayleigh-Jeans law and the daily solar radio flux can be used to calculate the calibration factor $G_{\rm c}$ of the daily image, so as to realize the calibration of the MUSER image intensity. Applying this method to the actual observation data of MUSER, including different situations such as the quiet sun and solar radio bursts, the error of the daily calibration factor $G_{\rm c}$ is within 10\% of its value, and the obtained bright temperature of the quiet sun has a high correlation with the result obtained by other people. These indicate the feasibility and effectiveness of this method.

  • Combination of an Optical Clock and Hydrogen Masers for Accurate Time Scale Calculation

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: The frequency stability and uncertainty of the optical clock have reached a magnitude of $10^{-18}$, making it a promising candidate for the next generation of time and frequency standards, and it may be used to redefine the international unit ``second''. The time scale serves as a benchmark for accurately and continuously marking the passage of time, being the foundation of high-precision time generation. The production of a time scale relies on the continuous and stable operation of atomic clocks; however, optical clocks, as laboratory prototype devices, generally cannot operate continuously, thus involving optical clocks in time scale calculation presents a challenging issue. The proposed application of the Vondrak-Cepek combined filtering algorithm for joint time scale calculations with an optical clock and hydrogen masers aims to address the challenges posed by the intermittent operation of an optical clock. Initially, the ALGOS algorithm is used on the time difference data of the hydrogen masers to calculate and obtain a continuous and stable hydrogen maser clock time scale. Subsequently, the Vondrak-Cepek combined filtering algorithm is used to integrate the hydrogen maser time scale with the optical clock data to acquire a combined time scale that involves optical clocks in the calculation. Finally, the experimental results prove that the Vondrak-Cepek combined filtering algorithm effectively improves the performance of the combined time scale of optical clocks and hydrogen masers, with the time deviation of this time scale reaching the sub-nanosecond magnitude when compared with Coordinated Universal Time.

  • Batch Measurement and Calibration Method of DSRT Three-axis Low Frequency Antenna Pointing Error

    submitted time 2024-06-03 Cooperative journals: 《天文学报》

    Abstract: Daocheng Solar Radio Telescope(DSRT) is an important part of the Solar Interplanetary exploration subsystem of the second phase of The Chinese Meridian Project. It operates in the 150\;MHz to 450\;MHz frequency band and provides high-resolution spatial and temporal images of solar eruption brightness and temperature. Aiming at the high precision pointing measurement of the DSRT antenna and the requirement of batch calibration and correction of pointing errors, this paper establishes the 3-parameter, antenna encoder zero point offset pointing error model by quaternion rotation transformation method according to the unique three-axis mount system of DSRT. In this paper, the drift scanning method based on radio source is proposed to obtain the radiation pattern of 16 antennas and determine the boresight according to the two-dimensional power pattern to accurately measure the pointing error of the DSRT antenna. Finally, the least square method is used to get the model parameters, and the antenna control software is used to adjust the encoder zero point of each axis, and then the adjustment results are verified. The results show that the pointing calibration method is reliable and effective. The pointing accuracy of 16 antennas after correction is within 0.5$^\circ$, which is significantly better than the pointing error of 3.5$^\circ$ before calibration, and the error is less than one-tenth of HBPW(half power beam width) under the maximum working frequency of the DSRT antenna.