Subjects: Other Disciplines >> Synthetic discipline submitted time 2023-03-28 Cooperative journals: 《中国科学院院刊》
Abstract: In recent years, there are such significant improvements on the performance and efficiency of big data technology and system. As it is widely applied in various fields, big data has empowered industrial intelligence, and is the key step into the intelligent stage of information society. Therefore, we are facing greater challenges nowadays, such as the paradox of data flooding and high-value data lacking, the complexity and uncertainty of big data analysis, and the difficulty to balance the data on sharing and circulation, and trustworthiness and security. Moreover, these challenges will not only promote the innovation and change of big data technology, but also develop and establish a new technology system. With respect to the requirements of new architecture, new paradigm, new model and security and trustworthiness, this study proposes to build a new big data analyzing and processing system stack, explore the new paradigm of extracting big data value, and outlook on the pioneer applications as the traction to a broad range of fields.
Subjects: Other Disciplines >> Synthetic discipline submitted time 2023-03-28 Cooperative journals: 《中国科学院院刊》
Abstract: The development of data science is valuable to clarify the theoretical boundary of data science, and provides new possibilities and opportunities for the sustainable development of computing intelligence. Meanwhile, the development of computing intelligence and the emergence of new intelligence paradigms can offer new chance for applications of big data in various industries and fields. This paper discusses the connotation of data science, the development of computing intelligence, the new intelligence paradigm, and lists the key applications leading the development of data science and computing intelligence. Furthermore, based on the discussion during the 667th Xiangshan Science Conference, seven key problems of data science and computing technology are proposed, anticipating to attract attentions of both researchers and applications in related fields, grasping the opportunity of the era, and promoting sustainable development of data science and computing intelligence.
Subjects: Computer Science >> Computer Network submitted time 2017-03-10
Abstract:随着网络数据的爆炸性增长,信息处理技术面临着前所未有的巨大挑战。如何从体量巨大、增长迅速、结构复杂、良莠不齐的数据中发掘潜在价值成为了关键难题。面向网络大数据的信息检索与挖掘技术,旨在通过对大数据的深度分析与建模,有效弥合用户需求与网络数据之间的信息鸿沟。本文介绍了面向网络大数据的深度检索与挖掘的一系列关键技术,包括用户查询理解与处理、文档建模与理解及检索模型等。
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-10
Abstract:
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-09
Abstract:
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-09
Abstract:
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Network submitted time 2017-03-09
Abstract:随着网络在线视频的广泛应用,对等传输(P2P)技术越来越受到业界的重视。我们开发的CoolFish是一个基于对等传输技术,集视频点播、直播于一体的流媒体系统。在本文中,我们基于CoolFish 系统,对目前流行的视频传输技术做了较为详细的探讨和阐述,并对CoolFish 的架构、功能和模块设计进行了全面介绍,另外,我们对CoolFish 系统中涉及到的对等传输关键技术和算法进行了深入探讨。
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-09
Abstract:随着互联网信息的指数增长,为了提高信息挖掘的效率,信息检索与话题检测等技术近年得到了广泛关注。本文首先回顾了话题检测与跟踪技术发展的历史,并在介绍传统话题检测方法的基础上,从突发性检测与基于社会网络的话题检测与跟踪方法两个方面进行深入探讨;对话题检测与跟踪的评价方法进行了分析;最后展望了话题检测与跟踪方法的发展趋势。
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-09
Abstract:网络等技术的快速发展,使人们能够访问的数据规模急剧增加。如何从海量信息中找到需要的信息成为难题。信息检索技术是应对该问题的有效手段,可以快速有效地帮助人们找到自己需要的信息。本文介绍了检索技术中使用的索引组织、检索模型、查询分析等关键技术及本课题组开发和维护的高性能开源检索系统FirteX。
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-09
Abstract:如何对大规模富含情感信息的文本进行倾向性分析是当前web应用一个亟待解决的问题。本文在分析目前国内外情感倾向性分析研究现状的基础上,介绍了我们为进行中文情感倾向性分析所构建的语料集及开发的实验平台,然后重点介绍我们的工作,包括整篇文本的倾向性分析、领域情感词典构建、跨领域情感倾向性分析等方面的关键技术,从而通过不同角度提高文本倾向性分析精度。最后总结了我们已有的工作,并展望下一步我们将深入开展的研究工作。
Peer Review Status:Awaiting Review
Subjects: Computer Science >> Computer Application Technology submitted time 2017-03-09
Abstract:信息抽取是当前搜索引擎与自然语言处理研究领域的核心技术之一,它用来对文本做匹配,以获得其中包含的各种实体以及它们的属性及关系。本文对实体及其属性的抽取做了简单介绍,包括基于规则的抽取技术和基于统计的抽取技术,并介绍了几个典型的系统实例,如:IE2、GATE和SystemT及它们的原理,最后简单介绍了我们在这个领域的工作成果。
Peer Review Status:Awaiting Review