结合改进的CHI统计方法的TF-IDF算法优化

Author: 马莹 ¹ 赵辉 ¹ 李万龙 ¹ 庞海龙 ¹ 崔岩 ¹
Institute:

1. 长春工业大学计算机科学与工程学院
Submit Time:2018-05-24 21:08:12

Abstract: The selection of feature items and the calculation of feature weights are two crucial links in the process of text classification and play a key role in the results of text classification. In order to overcome the traditional CHI statistical method, there is a negative correlation between the frequency of feature items and the category, and a probability problem that a feature item exists in a text, The traditional CHI statistical method is improved by introducing some important factors such as negative correlation judgment and frequency, and the TF-IDF algorithm is optimized by combining the calculation method of semantic similarity. The K-nearest neighbor (KNN) classifier and support vector machine (SVM) classifier are respectively used in WEKA software to classify the Weibo emotional corpus The experimental results show that the new method has obvious improvement on the accuracy of text classification.

文本分类 CHI统计 TF-IDF算法特征选择

Journal: 计算机应用研究
Subject: Computer Science >> Integration Theory of Computer Science
Cite as: ChinaXiv:201805.00488 (or this version ChinaXiv:201805.00488V1)
DOI:10.12074/201805.00488V1
CSTR:32003.36.ChinaXiv.201805.00488.V1
Recommended references： 马莹,赵辉,李万龙,庞海龙,崔岩.(2018).结合改进的CHI统计方法的TF-IDF算法优化.计算机应用研究.[ChinaXiv:201805.00488] (Click&Copy)

Version History

[V1]

2018-05-24 21:08:12

ChinaXiv:201805.00488V1

Download

Related Paper

1. 甘肃方言数据库建设与研究	2024-06-12
2. 面向低资源语言机器翻译的平行语料句对齐评分	2024-06-05
3. Turing’s thinking machine and ’t Hooft’s principle of superposition of states	2024-05-14
4. 恶意代码SCMP分类方法框架与风险行为多标签机制	2024-05-09
5. SteganoDDPM: A high-quality image steganography self-learning method using diffusion model	2024-04-23
6. 引导大语言模型生成计算机可解析内容	2024-04-21
7. 基于大语言模型的中英文整合复杂性建模研究	2024-04-10
8. 大模型与标准文献知识库的融合应用探索	2024-04-10
9. 简体中文LIWC2024(SCLIWC2024)词典的修订与验证	2024-04-09
10. 引导大语言模型生成计算机可解析内容	2024-04-07


Public comments Anonymous comments Send only to author