分类: 图书馆学、情报学 >> 图书馆学 提交时间: 2016-02-02
摘要: If they find some words closely related to their target task of titles or abstracts, which are definedIf they find some words closely related to their target task of titles or abstracts, which are definedas intelligent sensitive words in this paper, they will decide to read these resources in depth tofind more useful information. These sensitive words may refer to special persons, organisations,programmes, terms and so on. All of these sensitive words and their featured information areincluded in our object. Basing on the above processes, we have determined that some featurescould be used for automatic computation.Downloaded by [National Science Library] at 03:28 19 January 2015 Profiling science and innovation policy by object-based computing 587? Authority of source. If one piece of news is released in an official website, then this resource ismore reliable than other resources from no
分类: 图书馆学、情报学 >> 图书馆学 提交时间: 2016-02-02
摘要: 【目的】构建国际重要科研机构 Web 存档系统。【方法】基于 IIPC 开源软件拓展采集存档框架, 在采集端采用三层扩展策略, 在采集客户端增加自动上传及报告等管理功能, 开发WARC文件内容解析模块, 利用Solr进行索引。【结果】在采集端实现三层扩展, 通过增加采集客户端功能提高存档流程自动化程度, 通过增加的WARC文件内容解析功能抽取更多信息, 实现索引及检索服务的扩展。【局限】没有使用大规模采集存档进行检验。【结论】扩展后的采集存档框架初步具备分布式、可扩展、全自动化的特点。