Your conditions: 王继民
  • Twitter Text Topic Mining and Sentiment Analysis Under the Belt and Road Initiative

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance] The Belt and Road Initiative has attracted widespread attention around the world, and users in many countries have expressed their opinions, comments and discussed with each other on twitter, the most representative social media. The discussion topic and emotional tendency of "the Belt And Road" in the world extracted from the tweets will be helpful for the government to optimize their propaganda strategies and increase the exposure and attention of the Belt and Road Initiative.[Method/process] This paper collected more than 60 000 tweets related to the Belt and Road Initiative in 2017, and respectively carried out data preprocessing, data description, topic mining, and sentiment analysis in Chinese and English, and realized cross-analysis of topics and emotions to draw conclusions.[Result/conclusion] The tweet theme in 2017 is mainly around the "Belt and Road Forum for International Cooperation". Chinese tweets pay more attention to the planning and implementation of the forum, as well as security issues, visits by the leadership, etc. The emotional value fluctuates greatly, especially the negative emotions on security issues. English tweets are more concerned with the facts of holding the summit forum and the economic effects brought by the forum. The emotional fluctuations are small, and the emotional value of the economic aspect is that the positive proportion is significantly higher than the negative and neutral emotional values.

  • An Analysis of News Topics Mining Based on LDA Model: Taking “The Belt and Road” Related News as an Example

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance] This paper conducted a LDA topic analysis on "the Belt and Road" related news content in official medias and built a basic framework of news topic analysis using LDA model to help the public understand the dynamics and progress of the initiative and its focus in different periods.[Method/process] This paper selected "the Belt and Road" related news on the Chinese government Website during 2015 to 2017, and conducted the topic extraction and heat evolution analysis using LDA model.[Result/conclusion] A total of 30 topics were extracted and summarized as seven categories called policy coordination, facilities connectivity, unimpeded trade, financial integration, people-to-people bond, economic impact and government work. Among them, the policy coordination category has the highest heat during whole time period. Unimpeded trade category and economic impact category are the second and third highest. The heat of some topics, such as "reform and transformation", decline over time, while others like "import and export" increase. These results reflect the changes in the attention of the official media to different news topics related with "the Belt and Road".

  • Healthcare Data Mining: Word Segmentation and Named Entity Recognition in Chinese Electronic Medical Record

    Subjects: Library Science,Information Science >> Library Science submitted time 2023-07-26 Cooperative journals: 《图书情报工作》

    Abstract: [Purpose/significance] Healthcare big data is an important basic strategic resource in China. Word segmentation and entity recognition of Chinese electronic medical record(EMR) is helpful in extracting important information from a large number of unstructured text.[Method/process] In this study, a Chinese medical thesaurus is firstly built in terms of authoritative medical subject headings, official standards and health website data; then, the effect of four segmentation methods is compared based on the corpus of artificial segmentation and manual annotation; finally, CRF model is used to identify 5 entities, including disease, symptom, test, drug and treatment.[Result/conclusion] Results show that (i)AC automaton model has the best F-measure in EMR word segmentation, which is 82%; (ii) compared with Western medical record, it's difficult to identify medical entities in the record of traditional Chinese medicine. Besides, "Test" and "Disease" entities have better F-measure, while the F-measure of "Symptom" entity is not that ideal.

  • A Review of Scholar Profiling Research

    Subjects: Library Science,Information Science >> Library Science submitted time 2022-11-26 Cooperative journals: 《图书情报工作》

    Abstract:

    This paper summarizes the research on scholar profile, and provides a refer­ ence for the related  research.  Through  literature research  and  analysis,  this paper discriminated the scholar profile and  its related  concepts,  summarized  the construction  process,key  technologies and  main  applica­ tions of the scholar profile,  and  analyzed  the challenges faced  by  the current research.  The construction process of scholar profile includes data collection,data preprocessing,scholar label construction and vis­ ual analysis.  The main  practical applications include expert recommendation,  academic resource recommendation  and scientific research  ability  evaluation.  At present,  there are still some challenges in  related  research,  such  as the diffi­ culty  of multi-source data acquisition  and  fusion,  difficulties in  research  on  dynamic update of the scholar profile and the lack of effective evaluation mechanism.